Revista: | Polibits |
Base de datos: | PERIÓDICA |
Número de sistema: | 000359036 |
ISSN: | 1870-9044 |
Autores: | Parveen, Daraksha1 Sanyal, Ratna1 Ansari, Afreen1 |
Instituciones: | 1Indian Institute of Information Technology, Allahabad, Uttar Pradesh. India |
Año: | 2011 |
Periodo: | Ene-Jun |
Número: | 43 |
País: | México |
Idioma: | Inglés |
Tipo de documento: | Artículo |
Enfoque: | Analítico, descriptivo |
Resumen en inglés | This paper presents the identification of clause boundary for the Urdu language. We have used Conditional Random Field as the classification method and the clause markers. The clause markers play the role to detect the type of subordinate clause, which is with or within the main clause. If there is any misclassification after testing with different sentences then more rules are identified to get high recall and precision. Obtained results show that this approach efficiently determines the type of sub–ordinate clause and its boundary |
Disciplinas: | Ciencias de la computación, Literatura y lingüística |
Palabras clave: | Procesamiento de datos, Lingüística aplicada, Lingüística computacional, Marcadores de oración, Campo aleatorio condicional |
Keyword: | Computer science, Literature and linguistics, Data processing, Applied linguistics, Computing linguistics, Clause markers, Conditional random field |
Texto completo: | Texto completo (Ver HTML) |