[nycphp-talk] PHP & Word Documents
Tim Gales
tgales at tgaconnect.com
Sat May 7 20:18:32 EDT 2005
Joseph Crawford writes:
> Is there a way to parse a word document?
...
> Any suggestions appreciated.
Open Office will parse word documents.
It stores them in files with sxw and stw extensions.
They use a 'zip' format and can be unzipped into
there xml components.
Have a look at:
http://books.evc-cit.info/ch01.php
and
http://www.xml.com/pub/a/2005/01/26/hacking-ooo.html
Once you have some raw xml you should be able to
'xslt' it into just about anything you need.
--
T. Gales & Associates
'Helping People Connect with Technology'
http://www.tgaconnect.com
More information about the talk
mailing list