I've got over 3,000 HTML pages that were created in Word to find a suitable clean-up method for o_0
Dreamweaver cleans Word code to a certain degree.
I've also got to integrate a new HTML editor into my companies back-end and have been looking at
www.xstandard.com as the most customizable and the best at cleaning Word code, (from what I've looked at so far), so you may want to take a look at that.
If you find anything else useful let me know