Clinical Text Mining in Spanish Enhanced by NegationDetection and Named Entity Recognition

Tamayo Herrera, Antonio Jesús; Burgos, Diego A.; Gelbukh, Alexander


Título del documento:	Clinical Text Mining in Spanish Enhanced by NegationDetection and Named Entity Recognition
Revista:	Computación y sistemas
Base de datos:
Número de sistema:	000607846
ISSN:	1405-5546
Autores:	Tamayo Herrera, Antonio Jesús¹ Burgos, Diego A.² Gelbukh, Alexander¹
Instituciones:	¹Instituto Politécnico Nacional, Centro de Investigación en Computación, México ²Wake Forest University, Department of Spanish, Winston-Salem, North Carolina. Estados Unidos
Año:	2023
Periodo:	Oct-Dic
Volumen:	27
Número:	4
Paginación:	1169-1181
País:	México
Idioma:	Inglés
Resumen en inglés	Automatic identification of negation, uncertainty, and named entities are tasks of vital importance for clinical text mining. While several works have been published in English, only in recent years Spanish cases have been considered. In this work, we present a transfer learning framework based on a RoBERTa model pre-trained with biomedical documents and on multilingual BERT to identify diseases and organisms mentions as well as negations and uncertainty cues and scopes as a sequence labeling problem, utilizing the fact clinical datasets in Spanish for these four tasks. Our approach achieves results comparable to the state-of-the-art organism mentions identification and negation identification, competitive results in identifying diseases, and establishing state-of-the-art for uncertainty identification. Additionally, to remedy the lack of a unified dataset for the four tasks addressed, models to tackle them have been integrated into a web application that we built to allow effective clinical text mining in Spanish. The source code of this work is publicly available as well as the web application.
Keyword:	Clinical text mining, Negation scope detection, Uncertainty scope detection, Diseases, Organisms mentions identification
Texto completo:	Texto completo (Ver PDF) Texto completo (Ver HTML)

Clinical Text Mining in Spanish Enhanced by NegationDetection and Named Entity Recognition

Espere un momento...