Automatic Opinion Extraction from Short Hebrew Texts Using Machine Learning Techniques



Título del documento: Automatic Opinion Extraction from Short Hebrew Texts Using Machine Learning Techniques
Revista: Computación y sistemas
Base de datos:
Número de sistema: 000560412
ISSN: 1405-5546
Autores: 1
1
2
Instituciones: 1Bar-Ilan University, Department of Computer Science, Ramat-Gan. Israel
2Jerusalem College of Technology, Department of Computer Science, Jerusalem. Israel
Año:
Periodo: Oct-Dic
Volumen: 22
Número: 4
Paginación: 1347-1357
País: México
Idioma: Inglés
Resumen en inglés Sentiment analysis deals with classifying written texts according to their polarity. Previous research in this topic has been conducted mostly for Latin languages, and no research has been done for Hebrew. This is important because it turns out that the task of text classification is extremely language-dependent. Furthermore, the work on sentiment analysis for English texts was mostly performed on relatively long documents. In this work, we focus specifically on classifying Modern Hebrew sentences according to their polarity. We compare various Machine Learning algorithms and techniques of classification. We added optimizations and methods that have not previously been used, and adjusted commonly used techniques so they would suit a Hebrew corpus. We elaborate on the differences in classifying short texts versus long ones and about the uniqueness of working specifically with Hebrew. Finally, our model achieved nearly 93% accuracy, which is higher than accuracies achieved previously in this field.
Disciplinas: Ciencias de la computación
Palabras clave: Inteligencia artificial
Keyword: Automatic classification,
Machine learning,
Sentiment analysis,
Short Hebrew texts,
Artificial intelligence
Texto completo: Texto completo (Ver HTML) Texto completo (Ver PDF)