Revista: | Computación y sistemas |
Base de datos: | PERIÓDICA |
Número de sistema: | 000423269 |
ISSN: | 1405-5546 |
Autores: | Asnani, Kavita1 Pawar, Jyoti D1 |
Instituciones: | 1Goa University, Department of Computer Science and Technology, Panaji, Goa. India |
Año: | 2018 |
Periodo: | Ene-Mar |
Volumen: | 22 |
Número: | 1 |
País: | México |
Idioma: | Inglés |
Tipo de documento: | Artículo |
Enfoque: | Aplicado, descriptivo |
Resumen en inglés | With recent advancements and popularity of social networking forums, millions of people virtually connected to the World Wide Web, commonly communicate in multiple languages. This has led to the generation of large volumes of unstructured code-mixed social media text having useful aspects of information highly dispersed. Aspect based opinion mining relates opinion targets to their polarity values, in a specific context. It is known that since aspects are often implicit, detecting and retrieving them is a difficult task. Moreover, it is very challenging as the code-mixed social media text suffers from its associated linguistic complexities. As a standard, topic modeling has a potential of extracting aspects pertaining to opinion data from large text. This results not only in retrieval of implicit aspects but also in clustering them together. In this paper we propose knowledge based language independent code-mixed semantic LDA (lcms-LDA) model, with an aim to improve the coherence of clusters. We find that the proposed lcms-LDA model infers topic distributions without language barrier, based on semantics associated with words. Our experimental results showed an increase in the UMass and KL divergence score indicating an improved performance in the resulting coherence and distinctiveness of aspect clusters in comparison with the state-of-the-art techniques used for aspect extraction of code-mixed data |
Disciplinas: | Ciencias de la computación, Literatura y lingüística |
Palabras clave: | Lingüística aplicada, Mezcla de códigos, Análisis de textos, Agrupamiento semántico |
Keyword: | Applied linguistics, Code mixing, Text analysis, Semantic clustering |
Texto completo: | Texto completo (Ver HTML) Texto completo (Ver PDF) |