Feature Extraction for Token Based Word Alignment for Question Answering Systems

Sharma, Lokesh Kumar; Mittal, Namita; Aggarwal, Anubha


Título del documento:	Feature Extraction for Token Based Word Alignment for Question Answering Systems
Revista:	Computación y sistemas
Base de datos:
Número de sistema:	000560406
ISSN:	1405-5546
Autors:	Sharma, Lokesh Kumar¹ Mittal, Namita² Aggarwal, Anubha³
Institucions:	¹Galgotias College of Engineering and Technology, Department of Computer Science and Engineering, Greater Noida, UP. India ²Malaviya National Institute Of Technology Jaipur, Department of Computer Science and Engineering, Jaipur, RJ. India ³Shri Mata Vaishno Devi University, School of Computer Science and Engineering, Katra, Jammu And Kashmir. India
Any:	2018
Període:	Oct-Dic
Volum:	22
Número:	4
Paginació:	1359-1366
País:	México
Idioma:	Inglés
Tipo de documento:	Artículo
Resumen en inglés	Mapping between the source words and the target words in a set of parallel sentences are a crucial part of Question Answering (QA) systems. If an accurate aligner is used in QA systems then the efficiency of these systems also gets increased. We purpose the aligner which despite using very less lexical resources gives very good results in terms of precision, recall and F1. Previous aligners either uses more lexical resources or uses very less lexical resources. Hence, we have used POS TAG and WordNet as lexical resources. But some words whose meaning we may not know but these occur in a similar distribution and by observing their distribution these words are similar. Consider two sentences ”Lambodar is the son of Parvati” and ”Ganesha is the son of Parvati”. Here we will not find the meaning of Lambodar and Ganesha in Wordnet but since they have similar distributions so they should be aligned. For these words, we used Distribution Similarity Feature in our word aligner. This distributional similarity helps our aligner in broader coverage of words. Previous aligners were having recall in the range of 75-86 but this aligner has recall in the range of 88.4-93.3. Similarly, Exact match of previous aligners was in the range of 21-35.3 but the proposed aligner’s exact match range is 46.1-58.6. Similarly F-measure and precision have also increased.
Disciplines	Literatura y lingüística
Paraules clau:	Lingüística aplicada
Keyword:	Structural feature, Question alignment, Feature score, Alignment score, Applied linguistics
Text complet:	Texto completo (Ver HTML) Texto completo (Ver PDF)

Feature Extraction for Token Based Word Alignment for Question Answering Systems

Esperi un moment...