Text Analysis Using Different Graph-Based Representations



Título del documento: Text Analysis Using Different Graph-Based Representations
Revue: Computación y Sistemas
Base de datos: PERIÓDICA
Número de sistema: 000423326
ISSN: 1405-5546
Autores: 1
1
2
Instituciones: 1Universidad de las Américas, Departamento de Ciencias de la Computación, Electrónica y Mecatrónica, Cholula, Puebla. México
2Benemérita Universidad Autónoma de Puebla, Facultad de Ciencias de la Computación, Puebla. México
Año:
Periodo: Oct-Dic
Volumen: 21
Número: 4
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Aplicado, descriptivo
Resumen en inglés This paper presents an overview of different graph-based representations proposed to solve text classification tasks. The core of this manuscript is to highlight the importance of enriched/non-enriched co-occurrence graphs as an alternative to traditional features representation models like vector representation, where most of the time these models can not map all the richness of text documents that comes from the web (social media, blogs, personal web pages, news, etc). For each text classification task the type of graph created as well as the benefits of using it are presented and discussed. In specific, the type of features/patterns extracted, the implemented classification/similarity methods and the results obtained in datasets are explained. The theoretical and practical implications of using co-occurrence graphs are also discussed, pointing out the contributions and challenges of modeling text document as graphs
Disciplinas: Ciencias de la computación,
Literatura y lingüística
Palabras clave: Lingüística aplicada,
Modelación de textos,
Representaciones gráficas,
Clasificación de textos
Keyword: Applied linguistics,
Text modeling,
Graphic representations,
Text classification
Texte intégral: Texto completo (Ver HTML) Texto completo (Ver PDF)