Any way to convert Word .doc to (X)HTML/CSS?
Written By chris on Apr. 4, 2007.
13 Comments
Report Note
+ Clip This
Anyone know any scripts or apps that can convert the crappy Microsoft Word HTML code to decent-formatted HTML so that it can be styled with custom stylesheets?

ryanarrowsmith
Written Apr. 4, 2007 / Report /
Try this.
chris
Written Apr. 5, 2007 / Report /
Thanks ryan - that was on my (short) list of things to try out. I haven't taken a look at the output code yet, but a problem is it's still semi-manual. And there's gonna be a massive amount of Word docs needed to be converted.
ryanarrowsmith
Written Apr. 5, 2007 / Report /
Hmmm...you got me then. Wish I had an easy answer for you. i am curious...what odd job has converting all of these docs?
chris
Written Apr. 5, 2007 / Report /
It's pretty interesting, I'm working on a pretty big project that [EDITED BY LAWYERS DUE TO NON DISCLOSURE AGREEMENT. BYE.]
chris
Written Apr. 16, 2007 / Report /
Update: So far, this seems to be impossible without SOME manual work. But if there's about 1000 documents to convert "some manual work" kind of sucks.
I will say that the Textism thing produced some of the best results out of the 30 or so products/scripts I tried -- it's just not 100% there with all the crazy formatting that MS Word allows.
glenndavid
Written Aug. 3, 2007 / Report /
the fast way would be, is to import them into dreamweaver, and then use the clean up word documents, for about a 1000 times...
OR
put them on the net and not care about it and pretend you never designed that page...
Mike
Written Aug. 3, 2007 / Report /
Haha Chris are you done with that project yet? :)
chris
Written Aug. 3, 2007 / Report /
I have no idea what final solution they used (or if they used one at all). I'll see if I can find out what happened with it.
peroty
Written Aug. 3, 2007 / Report /
I've not done so in a long time but when I worked for the university I used this workflow.
1. Export HTML through Word.
2. Open Crap Code in Dreamweaver
3. Run DW's word cleanup tool.
4. Tweak as needed.
5. Smile.
Cas
Written Aug. 3, 2007 / Report /
Try this little trick if none of the above are floating your boat. Though whatever you try you're going to end up with having to do some clean-up manually.
enc
Written Sep. 22, 2007 / Report /
continuing peroty's list:
6. ???
7. PROFIT!!!
Oli
Written Sep. 22, 2007 / Report /
Dreamweaver has its own cleaner (as I now see peroty said) but I would seriously recommend going back to formula and reformatting the text from scratch.
Either that or opening it in Wordpad (Word might work too) and copy and pasting into DW. Dreamweaver's Rich-Paste isn't all bad.
Pavlusha
Written Sep. 22, 2007 / Report /
"ryanarrowsmith" cool resource , thanks a lot!