Brilliant Technologies Corporation

incorporating LTDnetwork Technology Group

INNOVATION • PARTNERSHIPS • SERVICES

Specializing in the development of innovative Technologies, Software, and Services for online e-tailers, advertising, media and marketing companies.


Free Text Analysis

Free Text Analysis is a powerful URL query system that automatically catalogues the information presented in a web page into a structured, hierarchical view of 'visually important' ranked data. All existing page data is preserved along with a rich collection of meta data that allows you to further process, manipulate or extract specific information from the results.

Free Text Analysis can be used in a wide variety of situations when you need to know something about the content of a given URL - whether it be trying to automatically identify a particular product, or classifying a web page's data for content filtering systems.

Figure 1 - A screen shot of an example application using Free Text Analysis technology.

Technical Overview

The system processes the underlying HTML (Hyper Text Markup Language) of a web page to extract different types of information. This will include different mixes of the HTML itself, the 'visible' text as a web user would see it (inner HTML) and other Meta data available in most web pages.

Free Text Analysis is a Microsoft VB.NET component that can run on any Microsoft platform that supports the .NET Framework 1.1.

Features

  • Extracts data from unstructured and inconsistent web pages

  • Single URL execution method

  • Pluggable business rules means system can be tailored and prompted to respond to different situations

Benefits

  • Efficiently gather accurate data in real time from any web site to ensure your data is up to date

  • Highly customizable with support to external reference data lookups

  • Schedule bulk data gathering jobs to utilize off peak resource periods

  • XML Web Services are platform and language independent and easily integrated with existing systems

  • High performance, scalability and reliability


Return to top