The Forest Lion and the Bull: Morphosyntactic Annotation of the Panchatantra



Título del documento: The Forest Lion and the Bull: Morphosyntactic Annotation of the Panchatantra
Revista: Computación y sistemas
Base de datos:
Número de sistema: 000560394
ISSN: 1405-5546
Autors: 1
Institucions: 1Charles University, Faculty of Mathematics and Physics, Praha. Czechia
Any:
Període: Oct-Dic
Volum: 22
Número: 4
Paginació: 1377-1384
País: México
Idioma: Inglés
Resumen en inglés We present the first freely available dependency treebank of Sanskrit. It is based on text from Panchatantra, an ancient Indian collection of fables. The annotation scheme we chose is that of Universal Dependencies, a current de-facto standard for cross-linguistically comparable morphological and syntactic annotation. In the present paper, we discuss word segmentation issues, morphological inventory and certain interesting syntactic constructions in the light of the Universal Dependencies guidelines. We also present an initial parsing experiment.
Keyword: Dependency syntax,
Morphology,
Word segmentation,
Tokenization,
Treebank,
Sanskrit
Text complet: Texto completo (Ver HTML) Texto completo (Ver PDF)