TopicSearch-Personalized Web Clustering Engine Using Semantic Query Expansion, Memetic Algorithms and Intelligent Agents



Document title: TopicSearch-Personalized Web Clustering Engine Using Semantic Query Expansion, Memetic Algorithms and Intelligent Agents
Journal: Polibits
Database: PERIÓDICA
System number: 000373737
ISSN: 1870-9044
Authors: 1
1
2
3
4
Institutions: 1Universidad del Cauca, Cali, Valle del Cauca. Colombia
2Universidad Nacional de Colombia, Bogotá. Colombia
3University of Idaho, Idaho Falls, Idaho. Estados Unidos de América
4Universidad de Granada, Granada. España
Year:
Season: Jul-Dic
Number: 46
Pages: 31-45
Country: México
Language: Inglés
Document type: Artículo
Approach: Experimental, aplicado
English abstract As resources become more and more available on the Web, so the difficulties associated with finding the desired information increase. Intelligent agents can assist users in this task since they can search, filter and organize information on behalf of their users. Web document clustering techniques can also help users to find pages that meet their information requirements. This paper presents a personalized web document clustering called TopicSearch. TopicSearch introduces a novel inverse document frequency function to improve the query expansion process, a new memetic algorithm for web document clustering, and frequent phrases approach for defining cluster labels. Each user query is handled by an agent who coordinates several tasks including query expansion, search results acquisition, preprocessing of search results, cluster construction and labeling, and visualization. These tasks are performed by specialized agents whose execution can be parallelized in certain instances. The model was successfully tested on fifty DMOZ datasets. The results demonstrated improved precision and recall over traditional algorithms (k-means, Bisecting k-means, STC y Lingo). In addition, the presented model was evaluated by a group of twenty users with 90% being in favor of the model
Disciplines: Ciencias de la computación
Keyword: Redes,
Recuperación de información,
Agrupamiento de documentos,
Agentes inteligentes,
Expansión de búsquedas,
Algoritmos,
Perfil de usuario
Keyword: Computer science,
Networks,
Information retrieval,
Document clustering,
Intelligent agents,
Query expansion,
Algorithms,
User profile
Full text: Texto completo (Ver HTML)