Revista: | Latin-American Journal of Computing (LAJC) |
Base de datos: | |
Número de sistema: | 000565094 |
ISSN: | 1390-9134 |
Autores: | Charro, Francisco1 Herrera, Marco1 Pozo, Nataly1 Rosales, Andrés1 |
Instituciones: | 1Escuela Politécnica Nacional, |
Año: | 2017 |
Volumen: | 4 |
Número: | 3 |
Paginación: | 49-54 |
País: | Ecuador |
Idioma: | Inglés |
Resumen en inglés | This article analyzes vector representation of phonemes as an alternative to improve a language identification system (LID). CBOW (Continuous Bag-of-Words) and Skip-gram architectures proposed by Mikolov are studied. These models allow predicting words within a context by generating n-dimensional vectors. In this work we will analyze the application of these models in smaller phonetic units or n-grams. |
Keyword: | embeddings., n-grams, Skip-gram, Language Recognition, Vector Representation |
Texto completo: | Texto completo (Ver PDF) |