Automatic speech recognizers for Mexican Spanish and its open resources



Título del documento: Automatic speech recognizers for Mexican Spanish and its open resources
Revista: Journal of applied research and technology
Base de datos: PERIÓDICA
Número de sistema: 000427963
ISSN: 1665-6423
Autores: 1
2
1
Instituciones: 1Universidad Nacional Autónoma de México, Laboratorio de Tecnologías del Lenguaje, Ciudad de México. México
2Universidad Nacional Autónoma de México, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Ciudad de México. México
Año:
Periodo: Jun
Volumen: 15
Número: 3
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Aplicado, descriptivo
Resumen en inglés Development of automatic speech recognition systems relies on the availability of distinct language resources such as speech recordings, pronunciation dictionaries, and language models. These resources are scarce for the Mexican Spanish dialect. In this work, we present a revision ofthe CIEMPIESS corpus that is a resource for spontaneous speech recognition in Mexican Spanish of Central Mexico. It consists of 17 h of segmented and transcribed recordings, a phonetic dictionary composed by 53,169 unique words, and a language model composed by 1,505,491 words extracted from 2489 university news letters. We also evaluate the CIEMPIESS corpus using three well known state of the art speech recognition engines, having satisfactory results. These resources are open for research and development in the field. Additionally, we present the methodology and the tools used to facilitate the creation of these resources which can be easily adapted to other variants of Spanish, or even other languages
Disciplinas: Ciencias de la computación,
Literatura y lingüística
Palabras clave: Lingüística aplicada,
Procesamiento de datos,
Reconocimiento automático del habla,
Modelos de lenguajes,
Modelos acústicos
Keyword: Applied linguistics,
Data processing,
Automatic speech recognition,
Language models,
Acoustic models
Texto completo: Texto completo (Ver HTML) Texto completo (Ver PDF)