Anonymize (and then de-anonymize) comments in Word documents.

Piotr Czajkowski 9cf0636bee 64bit Windows binary 7 anos atrás
bin 9cf0636bee 64bit Windows binary 7 anos atrás
README.md 909e8574f3 Formatting 7 anos atrás
anonymize.c ff36614bb9 First version 7 anos atrás
comments.c e62c815d0d Small corrections to error messages 7 anos atrás
comments.h ff36614bb9 First version 7 anos atrás
dict.c ff36614bb9 First version 7 anos atrás
dict.h ff36614bb9 First version 7 anos atrás
keyval.c ff36614bb9 First version 7 anos atrás
keyval.h ff36614bb9 First version 7 anos atrás
makefile ff36614bb9 First version 7 anos atrás
stopif.h ff36614bb9 First version 7 anos atrás
test.docx ff36614bb9 First version 7 anos atrás
xmlbuff.c ff36614bb9 First version 7 anos atrás
xmlbuff.h ff36614bb9 First version 7 anos atrás
zip.c e62c815d0d Small corrections to error messages 7 anos atrás
zip.h ff36614bb9 First version 7 anos atrás

README.md

Anonymize DOCX Comments

While doing review in Word documents translators/reviewers often use tracked changes and comments to exchange feedback on translations. Usually these people are from different organizations and shouldn't know about each other. Hence the need to anonymize comments and this is what this tool will do for you.

It'll go through comments in "word/comments.xml" and change each author's name to Authornumber, where number starts from 1. It'll keep track of authors so "John Smith" will always be "Author1" for instance. After it's done it'll print list of authors and their new names.

Usage:

./anonymize test.docx - test.docx will be replaced with anonymized version.

./anonymize test.docx test2.docx - anonymized version will be saved as test2.docx leaving original test.docx intact.

Running it on provided test.docx should produce:

"King, Stephen" is now "Author1"
"Kowalski, Jan" is now "Author2"
"Piotr Fronczewski" is now "Author3"

You'll need libarchive and libxml2 to compile it. It was created as learning project while I was exploring C, so use it freely, but at your own risk. Output was tested with Word 2013 and Libre Office Writer.