Clean HTML Tages Added in Word Documents

Published in: Microsoft Word

If you ever tried converting an Excel sheet or a Word document into an HTML web page, you know the amount of useless tags and styles that are added to the output making the HTML file bulky and complex.

Similar is the situation when your copy-paste rich text from an existing web page into another text editor. CSS formatting styles that become part of the copied text actually make no sense once that HTML snippet is used elsewhere.

clean html Word Off removed all the inline styles from the HTML table of Word doc. file

To keep your HTML clean and beautiful, Tom Dyson has created an online utility at that takes your dirty HTML and strips out all the junk while preserving the links and basic formatting.

Word Off will delete all HTML elements that are empty, removes every <span> and <div> tag and reduces the number of line breaks.

Developers can integrate WordOff into their web application using the CURL API. The tool will also come handy for people who use WYSIWYG HTML editors like Dreamweaver as they too can sometimes generate bloated code.

If you are into web design, do check this set of tools to strip junk from CSS Stylesheets.

Published in: Microsoft Word

Looking for something? Find here!

Meet the Author

Web Geek, Tech Columnist
Amit Agarwal

Amit Agarwal is a Google Developer Expert in GSuite and Google Apps Script. He holds an engineering degree in Computer Science (I.I.T.) and is the first professional blogger in India. Read more on Lifehacker and YourStory

Get in touch