Revista: | Computación y sistemas |
Base de datos: | |
Número de sistema: | 000560355 |
ISSN: | 1405-5546 |
Autores: | Pathak, Amarnath1 Pakray, Partha2 Gelbukh, Alexander3 |
Instituciones: | 1National Institute of Technology Mizoram, Department of Computer Science and Engineering, Aizawl. India 2National Institute of Technology Silchar, Department of Computer Science and Engineering, Assam. India 3Instituto Politécnico Nacional, Centro de Investigación en Computación, Ciudad de México. México |
Año: | 2018 |
Periodo: | Jul-Sep |
Volumen: | 22 |
Número: | 3 |
Paginación: | 819-833 |
País: | México |
Idioma: | Inglés |
Tipo de documento: | Artículo |
Resumen en inglés | Intricate math formulae, which majorly constitute the content of scientific documents, add to the complexity of scientific document retrieval. Although modifications in conventional indexing and search mechanisms have eased the complexity and exhibited notable performance, the formula embedding approach to scientific document retrieval sounds equally appealing and promising. Formula Embedding Module of the proposed system uses a Bit Position Information Table to transform math formulae, contained inside scientific documents, into binary formulae vectors. Each set bit of a formula vector designates presence of a specific mathematical entity. Mathematical user query is transformed into query vector, in similar fashion, and the corresponding relevant documents are retrieved. Relevance of a search result is characterized by extent of similarity between the indexed formula vector and the query vector. Promising performance, under moderately constrained situation, substantiates competence of the proposed approach. |
Disciplinas: | Ciencias de la computación |
Palabras clave: | Programación, Incrustación de fórmulas, Búsqueda de fórmulas matemáticas, Documento científico, Recuperación de información, Procesamiento de lenguaje natural, Consulta de usuario, Variable de consulta, Indexador, Precisión |
Keyword: | Consultation variable, Indexador, Natural language processing, Scientific document, User consultation, Formula embedding, Math formula search, Precision, Programming, Information retrieval |
Texto completo: | Texto completo (Ver HTML) Texto completo (Ver PDF) |