Automatic WordNet Construction Using Markov Chain Monte Carlo



Título del documento: Automatic WordNet Construction Using Markov Chain Monte Carlo
Revista: Polibits
Base de datos: PERIÓDICA
Número de sistema: 000373733
ISSN: 1870-9044
Autores: 1
1
1
1
Instituciones: 1University of Tehran, College of Engineering, Teherán. Irán
Año:
Periodo: Jul-Dic
Número: 46
Paginación: 13-22
País: México
Idioma: Inglés
Tipo de documento: Artículo
Enfoque: Experimental, aplicado
Resumen en inglés WordNet is used extensively as a major lexical resource in information retrieval tasks. However, the qualities of existing Persian WordNets are far from perfect. They are either constructed manually which limits the coverage of Persian words, or automatically which results in unsatisfactory precision. This paper presents a fully-automated approach for constructing a Persian WordNet: A Bayesian Model with Markov chain Monte Carlo (MCMC) estimation. We model the problem of constructing a Persian WordNet by estimating the probability of assigning senses (synsets) to Persian words. By applying MCMC techniques in estimating these probabilities, we integrate prior knowledge in the estimation and use the expected value of generated samples to give the final estimates. This ensures great performance improvement comparing with Maximum-Likelihood and Expectation-Maximization methods. Our acquired WordNet has a precision of 90.46% which is a considerable improvement in comparison with automatically-built WordNets in Persian
Disciplinas: Ciencias de la computación
Palabras clave: Lingüística computacional,
Redes semánticas,
Ontología,
Inferencia bayesiana
Keyword: Computer science,
Computing linguistics,
Semantic networks,
Ontology,
Bayesian inference
Texto completo: Texto completo (Ver HTML)