Kazakh Text Summarization using Fuzzy Logic



Título del documento: Kazakh Text Summarization using Fuzzy Logic
Revista: Computación y sistemas
Base de datos:
Número de sistema: 000560453
ISSN: 1405-5546
Autores: 1
1
2
1
Instituciones: 1L.N.Gumilyov Eurasian National University, Faculty of Information Technologies, Astana. Kazajstán
2Nazarbayev University, National Laboratory Astana, Astana. Kazajstán
Año:
Periodo: Jul-Sep
Volumen: 23
Número: 3
Paginación: 851-859
País: México
Idioma: Inglés
Tipo de documento: Artículo
Resumen en inglés In this paper we present an extractive summarization method for the Kazakh language based on fuzzy logic. We aimed to extract and concatenate important sentences from the primary text to obtain its shorter form. With the rapid growth of information on the Internet there is a demand on its efficient and cost-effective summarization. Therefore the creation of automatic summarization methods is considered as a very important task of natural language processing. Our approach is based on the preprocessing of the sentences by applying morphological analysis and pronoun resolution techniques in order to avoid their early rejections. Afterwards, we determine the features of the processed sentences need for exploiting fuzzy logic methods. Additionally, since there is no available data for the given task, we collected and manually annotated our own dataset from the different Internet resources in the Kazakh language for the experimentation. We also applied our method on CNN/Daily Mail dataset. The ROUGE-N indicators were calculated to assess the quality of the proposed method. The ROUGE-L(f-score) score by the proposed method with pronoun resolution for the former dataset is 0.40, whereas for the latter one it is 0.38.
Disciplinas: Ciencias de la computación
Palabras clave: Inteligencia artificial
Keyword: Extractive text summarization,
Natural language processing,
Fuzzy logic,
Artificial intelligence
Texto completo: Texto completo (Ver HTML) Texto completo (Ver PDF)