Thank you for being a valued part of the CNET community. As of December 1, 2020, the forums are in read-only format. In early 2021, CNET Forums will no longer be available. We are grateful for the participation and advice you have provided to one another over the years.

Thanks,

CNET Support

General discussion

OFFICE 2003 Filtered HTML - a myth?

Apr 23, 2007 7:44PM PDT

I've done File>Save as Webpage>Filtered HTML but still get code that looks like this:

<p class=MsoNormal style='text-align:justify'><b><span lang=EN-GB> </span></b></p>

<p class=MsoNormal style='text-align:justify'><b><span lang=EN-GB>DOMESTIC
VIOLENCE</span></b></p>

etc...etc...

With a massive style sheet at the top. This doesn;t look filtered to me and was wondering if this is normal or not?

Please help if you can,

Thanks, Steve

Discussion is locked

- Collapse -
Re: Word as html-editor
Apr 23, 2007 8:21PM PDT

Steve,

Word doesn't do a great job as html-editor indeed, at least not to publish the result on the web. And Microsoft has no reason to change that, because they sell Frontpage as such.

Maybe better try another program. There are a lot to choose from.

In the meantime: tidy from http://tidy.sourceforge.net/ can do some cleaning, run with the right switches.

Kees

- Collapse -
What happens when you have to use word?
Apr 23, 2007 8:32PM PDT

Thanks for you reply.

I totally agree that MS Word is not the best tool for web authoring but what happens as part of my job is that people send me formatted word files (and they won;t change that) and want them converted to html for our web site.

I'll try the tidy tool you suggest but it seems bizarre to me that MS provide a html filter which clearly doesn;t work or at least doesn;t work well.

Any other suggestions?

- Collapse -
Other suggestions.
Apr 23, 2007 9:38PM PDT

Just googled, none from my own experience:

http://philip.greenspun.com/wtr/word.html
http://www.solutionsoft.com/w2w.htm
http://blog.devstone.com/aaron/archive/2006/01/10/1426.aspx
http://www.informit.com/articles/article.asp?p=359433&rl=1

If this is your job, you might convince your boss that he needs to buy one of these cleaning tools, or he might convince you that when it looks right, it is right, so don't mind about the html, because nobody will see it.

http://office.microsoft.com/en-us/help/HP030852781033.aspx obviously promises more than it offers.

With Office 2007 using xml to store documents, the conversion to html might be easier, if nog unnecessary. Or somebody will write a nice 'office xml to clean html' conversion utility. In principle, xml is very suited to do that.

Hope this helps.


Kees

- Collapse -
Use GMAIL!
Apr 24, 2007 1:11AM PDT

Ok, its a totally roundabout way but it works for my purposes (text only docs). Email the file to a gmail account and choose "View as HTML" then view the source of the page and copy and paste that into a text file and edit.

this will not work for pages with images (i think).