TopicSearch-Personalized Web Clustering Engine Using Semantic Query Expansion, Memetic Algorithms and Intelligent Agents



Título del documento: TopicSearch-Personalized Web Clustering Engine Using Semantic Query Expansion, Memetic Algorithms and Intelligent Agents
Revista: Polibits
Base de datos: PERIÓDICA
Número de sistema: 000373737
ISSN: 1870-9044
Autores: 1
1
2
3
4
Instituciones: 1Universidad del Cauca, Cali, Valle del Cauca. Colombia
2Universidad Nacional de Colombia, Bogotá. Colombia
3University of Idaho, Idaho Falls, Idaho. Estados Unidos de América
4Universidad de Granada, Granada. España
Año:
Periodo: Jul-Dic
Número: 46
Paginación: 31-45
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Experimental, aplicado
Resumen en inglés As resources become more and more available on the Web, so the difficulties associated with finding the desired information increase. Intelligent agents can assist users in this task since they can search, filter and organize information on behalf of their users. Web document clustering techniques can also help users to find pages that meet their information requirements. This paper presents a personalized web document clustering called TopicSearch. TopicSearch introduces a novel inverse document frequency function to improve the query expansion process, a new memetic algorithm for web document clustering, and frequent phrases approach for defining cluster labels. Each user query is handled by an agent who coordinates several tasks including query expansion, search results acquisition, preprocessing of search results, cluster construction and labeling, and visualization. These tasks are performed by specialized agents whose execution can be parallelized in certain instances. The model was successfully tested on fifty DMOZ datasets. The results demonstrated improved precision and recall over traditional algorithms (k-means, Bisecting k-means, STC y Lingo). In addition, the presented model was evaluated by a group of twenty users with 90% being in favor of the model
Disciplinas: Ciencias de la computación
Palabras clave: Redes,
Recuperación de información,
Agrupamiento de documentos,
Agentes inteligentes,
Expansión de búsquedas,
Algoritmos,
Perfil de usuario
Keyword: Computer science,
Networks,
Information retrieval,
Document clustering,
Intelligent agents,
Query expansion,
Algorithms,
User profile
Texto completo: Texto completo (Ver HTML)