Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information



Título del documento: Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information
Revista: Computación y sistemas
Base de datos: PERIÓDICA
Número de sistema: 000411077
ISSN: 1405-5546
Autors: 1
2
Institucions: 1Microsoft India, Hyderabad, Andhra Pradesh. India
2Dublin City University, ADAPT Centre, Dublín. Irlanda
Any:
Període: Jul-Sep
Volum: 20
Número: 3
Paginació: 495-504
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Experimental, aplicado
Resumen en inglés In this paper, we describe a technique to improve named entity recognition in a resource-poor language (Hindi) by using cross-lingual information. We use an on-line machine translation system and a separate word alignment phase to find the projection of each Hindi word into the translated English sentence. We estimate the cross-lingual features using an English named entity recognizer and the alignment information. We use these cross-lingual features in a support vector machine-based classifier. The use of cross-lingual features improves F i score by 2.1 points absolute (2.9% relative) over a good-performing baseline model
Disciplines Ciencias de la computación,
Literatura y lingüística
Paraules clau: Procesamiento de datos,
Lingüística aplicada,
Lingüística computacional,
Traducción automática,
Reconocimiento de entidades nombradas
Keyword: Computer science,
Literature and linguistics,
Data processing,
Applied linguistics,
Computing linguistics,
Machine translation,
Named entity recognition
Text complet: Texto completo (Ver HTML)