Examining the Validity of Cross-Lingual Word Sense Disambiguation



Título del documento: Examining the Validity of Cross-Lingual Word Sense Disambiguation
Revista: Polibits
Base de datos: PERIÓDICA
Número de sistema: 000359001
ISSN: 1870-9044
Autores: 1
2
Instituciones: 1University College Ghent, Gante. Bélgica
2Ghent University, Department of Applied Mathematics and Computer Science, Gante. Bélgica
Año:
Periodo: Ene-Jun
Número: 43
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Analítico, descriptivo
Resumen en inglés This paper describes a set of experiments in which the viability of a classification–based Word Sense Disambiguation system that uses evidence from multiple languages is investigated. Instead of using a predefined monolingual sense–inventory such as WordNet, we use a language–independent framework and start from a manually constructed gold standard in which the word senses are made up by the translations that result from word alignments on a parallel corpus. To train and test the classifier, we used English as an input language and we incorporated the translations of our target words in five languages (viz. Spanish, Italian, French, Dutch and German) as features in the feature vectors. Our results show that the multilingual approach outperforms the classification experiments where no additional evidence from other languages is used. These results confirm our initial hypothesis that each language adds evidence to further refine the senses of a given word. This allows us to develop a proof of concept for a multilingual approach to Word Sense Disambiguation
Disciplinas: Ciencias de la computación,
Literatura y lingüística
Palabras clave: Procesamiento de datos,
Lingüística aplicada,
Lingüística computacional,
Ambigüedad,
Semántica,
Multilingüismo
Keyword: Computer science,
Literature and linguistics,
Data processing,
Applied linguistics,
Computing linguistics,
Ambiguity,
Semantics,
Multilingualism
Texto completo: Texto completo (Ver HTML)