Revista: | Computación y sistemas |
Base de datos: | PERIÓDICA |
Número de sistema: | 000411077 |
ISSN: | 1405-5546 |
Autors: | Dandapat, Sandipan1 Way, Andy2 |
Institucions: | 1Microsoft India, Hyderabad, Andhra Pradesh. India 2Dublin City University, ADAPT Centre, Dublín. Irlanda |
Any: | 2016 |
Període: | Jul-Sep |
Volum: | 20 |
Número: | 3 |
Paginació: | 495-504 |
País: | México |
Idioma: | Inglés |
Tipo de documento: | Artículo |
Enfoque: | Experimental, aplicado |
Resumen en inglés | In this paper, we describe a technique to improve named entity recognition in a resource-poor language (Hindi) by using cross-lingual information. We use an on-line machine translation system and a separate word alignment phase to find the projection of each Hindi word into the translated English sentence. We estimate the cross-lingual features using an English named entity recognizer and the alignment information. We use these cross-lingual features in a support vector machine-based classifier. The use of cross-lingual features improves F i score by 2.1 points absolute (2.9% relative) over a good-performing baseline model |
Disciplines | Ciencias de la computación, Literatura y lingüística |
Paraules clau: | Procesamiento de datos, Lingüística aplicada, Lingüística computacional, Traducción automática, Reconocimiento de entidades nombradas |
Keyword: | Computer science, Literature and linguistics, Data processing, Applied linguistics, Computing linguistics, Machine translation, Named entity recognition |
Text complet: | Texto completo (Ver HTML) |