Extracting Context of Math Formulae Contained inside Scientific Documents

Pathak, Amarnath; Das, Ranjita; Pakray, Partha; Gelbukh, Alexander


Título del documento:	Extracting Context of Math Formulae Contained inside Scientific Documents
Revista:	Computación y sistemas
Base de datos:
Número de sistema:	000560425
ISSN:	1405-5546
Autores:	Pathak, Amarnath¹ Das, Ranjita¹ Pakray, Partha² Gelbukh, Alexander³
Instituciones:	¹National Institute of Technology Mizoram, Department of Computer Science and Engineering, Mizoram, Aizawl. India ²National Institute of Technology Silchar, Department of Computer Science and Engineering, Assam. India ³Instituto Politécnico Nacional, Ciudad de México. México
Año:	2019
Periodo:	Jul-Sep
Volumen:	23
Número:	3
Paginación:	803-818
País:	México
Idioma:	Inglés
Tipo de documento:	Artículo
Resumen en inglés	A math formula present inside a scientific document is often preceded by its textual description, which is commonly referred to as the context of formula. Annotating context to the formula enriches its semantics, and consequently impacts the retrieval of mathematical contents from scientific documents. Also, with a considerable surety, a context can be assumed to be one of the Noun Phrases (NPs) of the sentence in which formula occurs. However, the presence of several different misleading NPs in the sentence necessitates extraction of an NP, which is more precise to the formula than the rest. Although a fair number of methods are developed for precise context extraction, it can be fascinating to prospect other competent techniques which can further their performances. To this end, this paper discusses implementation of an automated context extraction system, which follows certain heuristics in assigning weights to different candidate NPs, and tune those weights using a development set comprising annotated formulae. The implemented system significantly outperforms nearest noun and sentence-pattern based methods on the ground of F-score.
Disciplinas:	Ciencias de la computación
Palabras clave:	Procesamiento de datos, Inteligencia artificial
Keyword:	Context extraction, Math information retrieval, NTCIR, Parser, Noun phrase, Data processing, Artificial intelligence
Texto completo:	Texto completo (Ver HTML) Texto completo (Ver PDF)

Extracting Context of Math Formulae Contained inside Scientific Documents

Espere un momento...