It would be nice to have a python bake script that takes the MS Word "HTML" pages and converts to ASCII and simple HTML. Maybe have some custom classes for the title, subtitle, and the rest of the description. And strip out all the spans and unnecessary tags (like body).