NYCPHP Meetup

NYPHP.org

[nycphp-talk] PHP & Word Documents

Tim Gales tgales at tgaconnect.com
Sat May 7 20:18:32 EDT 2005


Joseph Crawford writes:

> Is there a way to parse a word document?
...
> Any suggestions appreciated.

Open Office will parse word documents.
It stores them in   files with  sxw and stw extensions.

They use a 'zip' format and can be unzipped into 
there xml components. 

Have a look at:
http://books.evc-cit.info/ch01.php 
and 
http://www.xml.com/pub/a/2005/01/26/hacking-ooo.html 

Once you have some raw xml you should be able to 
'xslt' it into just about anything you need.

-- 
T. Gales & Associates
'Helping People Connect with Technology'

http://www.tgaconnect.com



More information about the talk mailing list