Information Fusion with ProFusion*

Susan Gauch, Guijun Wang

Department of Electrical Engineering and Computer Science
The University of Kansas, Lawrence, KS 66045

{sgauch, gwang}@eecs.ku.edu

*http://www.designlab.ku.edu/profusion/

This paper will be presented at WebNet'96.

Abstract

The explosive growth of the World Wide Web, and the resulting information overload, has led to a mini-explosion in World Wide Web search engines. This mini-explosion, in turn, led to the development of ProFusion, a meta search engine. Educators, like other users, do not have the time to evaluate multiple search engines to knowledgeably select the best for their uses. Nor do they have the time to submit each query to multiple search engines and wade through the resulting flood of good information, duplicated information, irrelevant information, and missing documents. ProFusion sends user queries to multiple underlying search engines in parallel, retrieves and merges the resulting URLs. It identifies and removes duplicates and creates one relevance-ranked list. If desired, the actual documents can be pre-fetched to remove yet more duplicates and broken links. ProFusion's performance has been compared to the individual search engines and other meta searchers, demonstrating its ability to retrieve more relevant information and present fewer duplicates pages. Future developments include analyzing the documents for improved ranking, automatically submitting queries to the most appropriate search engines, and modifying ProFusion to be an information filtering and dissemination system.

View Paper (PostScript)


If you have any comments, please send your comments to me. sgauch@ittc.ku.edu.