Journal: | Latin-American Journal of Computing (LAJC) |
Database: | |
System number: | 000565094 |
ISSN: | 1390-9134 |
Authors: | Charro, Francisco1 Herrera, Marco1 Pozo, Nataly1 Rosales, Andrés1 |
Institutions: | 1Escuela Politécnica Nacional, |
Year: | 2017 |
Volumen: | 4 |
Number: | 3 |
Pages: | 49-54 |
Country: | Ecuador |
Language: | Inglés |
English abstract | This article analyzes vector representation of phonemes as an alternative to improve a language identification system (LID). CBOW (Continuous Bag-of-Words) and Skip-gram architectures proposed by Mikolov are studied. These models allow predicting words within a context by generating n-dimensional vectors. In this work we will analyze the application of these models in smaller phonetic units or n-grams. |
Keyword: | embeddings., n-grams, Skip-gram, Language Recognition, Vector Representation |
Full text: | Texto completo (Ver PDF) |