Diffs de docs MS Word
Solución 1: convertir a odf
Lanzar el proceso de openoffice
soffice "-accept=socket,host=localhost,port=2002;urp;" # openoffice.org -accept="socket,host=localhost,port=2002;urp;StarOffice.ServiceManager"
Abrir doc con pyuno y guardar como *odf*
1 import uno
2 local = uno.getComponentContext()
3 resolver = local.ServiceManager.createInstanceWithContext("com.sun.star.bridge.UnoUrlResolver", local)
4
5 context = resolver.resolve("uno:socket,host=localhost,port=2002;urp;StarOffice.ComponentContext")
6 desktop = context.ServiceManager.createInstanceWithContext("com.sun.star.frame.Desktop", context)
7
8 doc = desktop.loadComponentFromURL("file:///home/lm/test.doc" ,"_blank", 0, ())
9 doc.storeAsURL("file:///home/lm/test.odf", () )
¿Guardar como MS Word?
links
Otras soluciones
(ver hilo: http://mail.python.org/pipermail/python-es/2010-December/028833.html)
OpenXML Productivity Tool http://www.microsoft.com/downloads/en/details.aspx?FamilyId=C6E744E5-36E9-45F5-8D8C-331DF206E0D0&displaylang=en
Con IronPython: http://msdn.microsoft.com/en-us/library/microsoft.office.tools.word.document.compare.aspx
Convertir en texto: http://code.activestate.com/recipes/279003-converting-word-documents-to-text/
- Deltaview (comercial)