docbook: MS Word to XML
Subject:
Re: MS Word to XML
From:
Bob Stayton ####@####.####
Date:
14 Nov 2003 18:53:34 -0000
Message-Id: <20031114100925.F29568@sco.com>
On Thu, Nov 13, 2003 at 04:54:55PM -0800, Tabatha Marshall wrote:
> Hi all,
>
> I've been exploring Windows (and Linux) solutions for transforming MS
> Word documents into XML, preferably DocBook XML 4.2.
>
> I tried XMLSPY, which can only be evaluated for 30 days, I've tried
> Morphon, which is nice for working in XML, but I couldn't figure out
> what to do with the MS Word doc.
>
> When I've used my Linux tools to convert, I've ended up with an XML
> file, but it's so awful thanks to all the junk in MS Word, it makes me
> want to scrap it and just cut/paste everything in, writing the tags in
> myself.
>
> Anybody have better luck finding an easy way to convert? Your
> suggestions are most welcome, and the sooner the better. I have a guide
> that needs conversion to XML before month-end.
>
> For the benefit of our reviewers, many of whom use Windows, please use
> "Reply All" if you have ideas to share on this subject!
You could check the DocBookWiki tools page, which includes
several "up" conversion tools:
http://docbook.org/wiki/moin.cgi/DocBookTools
I've used UpCast with some success. It converts a Word
file to an XML file in its own generalized UpCast DTD,
and then you can get an XSL stylesheet from them that
converts the UpCast document to a DocBook document.
Bob Stayton 400 Encinal Street
Publications Architect Santa Cruz, CA 95060
Technical Publications voice: (831) 427-7796
The SCO Group fax: (831) 429-1887
email: ####@####.####