Extract Reliable Relations from Wikipedia Texts for Practical Ontology Construction



Document title: Extract Reliable Relations from Wikipedia Texts for Practical Ontology Construction
Journal: Computación y sistemas
Database: PERIÓDICA
System number: 000411065
ISSN: 1405-5546
Authors: 1
2
3
1
Institutions: 1Electronics and Telecommunications Research Institute, Automatic Speech Translation and Language Intelligence Research Department, Daejeon. Corea del Sur
2Chonbuk National University, Division of Computer Science and Engineering, Jeonju. Corea del Sur
3Korea Advanced Institute of Science and Technology, School of Computing, Daejeon. Corea del Sur
Year:
Season: Jul-Sep
Volumen: 20
Number: 3
Pages: 467-476
Country: México
Language: Inglés
Document type: Artículo
Approach: Experimental, aplicado
English abstract A feature based relation classification approach is presented in this paper. We aimed to exact relation candidates from Wikipedia texts. A probabilistic and a semantic relatedness features are employed with other linguistic information for the purpose. The experiments show that, relation classification using the proposed relatedness features with surface information like word and part-of-speech tags is competitive with or even outperforms the one of using deep syntactic information. Meanwhile, an approach is proposed to distinguish reliable relation candidates from others, so that these reliable results can be accepted for knowledge building without human verification. The experiments show that, with the relation classification approach presented in this paper, more than 40% of the classification results are reliable, which means, at least 40% of the human and time costs can be saved in practice
Disciplines: Ciencias de la computación,
Literatura y lingüística
Keyword: Procesamiento de datos,
Lingüística aplicada,
Lingüística computacional,
Análisis de textos,
Extracción de información,
Ontología
Keyword: Computer science,
Literature and linguistics,
Data processing,
Applied linguistics,
Computing linguistics,
Text analysis,
Information extraction,
Ontology
Full text: Texto completo (Ver HTML)