Intelligent Web Content Categorizer
Netcognize Intelligent Web Content Categorizer (iWCC) learns and categorizes any web content including common web categories and special information hidden in web sites.
The novel approach embedded in iWCC for web content mining is inspired by human brain activities. Employing this approach IWCC accuracy ratio could reach up to 98% in recognizing trained categories with minor than 1% of false alarms. iWCC will use various information on a web page depend on targeted categories. The information being used could be: texts, URL, links, images, profile, hidden data and etc.
iWCC is able to process at least 300 pages simultaneously and it supports IPv6. Besides, iWCC processes CSS and js, and text in 150 different languages and various text encoding.
iWCC Processes different components of the page independently and/or making decision based on combined results. Moreover, iWCC extracts hidden data and processes interactive web pages as well.