Revista: | Polibits |
Base de datos: | PERIÓDICA |
Número de sistema: | 000359043 |
ISSN: | 1870-9044 |
Autores: | Han, Chao1 Liu, Yicheng1 Hao, Yu1 Zhu, Xiaoyan1 |
Instituciones: | 1Tsinghua University, Department of Computer Science and Technology, Beijing. China |
Año: | 2011 |
Periodo: | Ene-Jun |
Número: | 43 |
País: | México |
Idioma: | Inglés |
Tipo de documento: | Artículo |
Enfoque: | Analítico, descriptivo |
Resumen en inglés | With the development of Web 2.0, more and more people contribute their knowledge to the Internet. Many general and domain–specific online encyclopedia resources become available, and they are valuable for many Natural Language Processing (NLP) applications, such as summarization and question–answering. We propose a novel encyclopedia–specific method to retrieve passages which are semantically related to a short query (usually comprises of only one word/phrase) from a given article in the encyclopedia. The method captures the expression word features and categorical word features in the surrounding snippets of the aspect words by setting up massive hybrid language models. These local models outperform the global models such as LSA and ESA in our task |
Disciplinas: | Ciencias de la computación, Literatura y lingüística |
Palabras clave: | Procesamiento de datos, Lingüística aplicada, Lingüística computacional, Procesamiento de lenguaje natural, Enciclopedias, Consulta, Recuperación de información |
Keyword: | Computer science, Literature and linguistics, Data processing, Applied linguistics, Computing linguistics, Natural language processing, Encyclopedias, Queries, Information retrieval |
Texto completo: | Texto completo (Ver HTML) |