Arabic Dialect Identification based on Probabilistic-Phonetic Modeling



Título del documento: Arabic Dialect Identification based on Probabilistic-Phonetic Modeling
Revista: Computación y sistemas
Base de datos:
Número de sistema: 000560346
ISSN: 1405-5546
Autors: 1
2
2
Institucions: 1LaTICE laboratory, Túnez
2Réseau National Universitaire Tunisien, Faculty of sciences, Túnez
Any:
Període: Jul-Sep
Volum: 22
Número: 3
Paginació: 863-870
País: México
Idioma: Inglés
Resumen en inglés The identification of Arabic dialects is considered to be the first pre-processing component for any natural language processing problem. This task is useful for automatic translation, information retrieval, opinion mining and sentiment analysis. In this purpose, we propose a statistical approach based on the phonetic modeling to identify the correspondent Arabic dialect for each input acoustic signal. The main idea consists first, and for each dialect, in calculating a referenced phonetic model. Second, for every input audio signal, we calculate an appropriate phonetic model. Third, we compare this latter to all referenced Arabic dialect models. Finally, we associate the input acoustic signal to the dialect where the referenced phonetic model minimizes the cosine similarity. The obtained results are satisfactory. Indeed, based on 117 audio sequences, we attain a classification rate of 93%. Supporting the achieved results and the coverage of most of Arabic dialects, this study can be a reference for future work addressing dialectical speech processing applications.
Disciplines Ciencias de la computación
Paraules clau: Inteligencia artificial
Keyword: Arabic dialects,
Probabilistic-phonetic model,
Dialect identification,
Cosine similarity,
Artificial intelligence
Text complet: Texto completo (Ver HTML) Texto completo (Ver PDF)