A Cross-Lingual Pattern Retrieval Framework



Document title: A Cross-Lingual Pattern Retrieval Framework
Journal: Polibits
Database: PERIÓDICA
System number: 000359024
ISSN: 1870-9044
Authors: 1
1
1
1
1
Institutions: 1National Tsing Hua University, Hsinchu. Taiwán
Year:
Season: Ene-Jun
Number: 43
Country: México
Language: Inglés
Document type: Artículo
Approach: Analítico, descriptivo
English abstract We introduce a method for learning to grammatically categorize and organize the contexts of a given query. In our approach, grammatical descriptions, from general word groups to specific lexical phrases, are imposed on the query's contexts aimed at accelerating lexicographers' and language learners' navigation through and GRASP upon the word usages. The method involves lemmatizing, part–of–speech tagging and shallowly parsing a general corpus and constructing its inverted files for monolingual queries, and word–aligning parallel texts and extracting and pruning translation equivalents for cross–lingual ones. At run–time, grammar–like patterns are generated, organized to form a thesaurus index structure on query words' contexts, and presented to users along with their instantiations. Experimental results show that the extracted predominant patterns resemble phrases in grammar books and that the abstract–to–concrete context hierarchy of querying words effectively assists the process of language learning, especially in sentence translation or composition
Disciplines: Ciencias de la computación,
Literatura y lingüística
Keyword: Procesamiento de datos,
Lingüística aplicada,
Lingüística computacional,
Construcciones gramaticales,
Aprendizaje de idiomas,
Archivos invertidos,
Consulta,
Recuperación de patrones
Keyword: Computer science,
Literature and linguistics,
Data processing,
Applied linguistics,
Computing linguistics,
Grammatical constructions,
Language learning,
Inverted files,
Queries,
Pattern retrieval
Full text: Texto completo (Ver HTML)