...

HtmlCleaner

The Html Cleaner is a product for cleaning web pages for unwanted tags when you only want the plain text extracted from the page. It uses statistical filtering to extract only vital areas from the webpages, removing ads, menus, and other unwanted text.

This is a recommended tool to use with the Summarizer when extracting text from the web. This includes the following features:

  • HTML tag cleanup for extracting text
  • Statistical filtering mechanism
  • Image URL extraction

Contact us for prices and software rights.

...