Interactive Predictive Parsing Framework for the Spanish Language

Ricardo Sánchez Sáez; Luis A. Leiva Torres; Joan Andreu Sánchez; José Miguel Benedí Ruiz

Ayuda

Interactive Predictive Parsing Framework for the Spanish Language

Autores: Ricardo Sánchez Sáez, Luis A. Leiva Torres, Joan Andreu Sánchez , José Miguel Benedí Ruiz
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 45, 2010, págs. 121-128
Idioma: inglés
Enlaces
- Texto completo
Resumen
- español
  El marco teórico de Parsing Predictivo Interactivo (IPP) permite construir sistemas de anotación sintáctica interactivos. Los anotadores humanos pueden utilizar estos sistemas de ayuda para crear árboles sintácticos con muy poco esfuerzo (en comparación con el trabajo requerido para corregir manualmente árboles obtenidos a partir de un analizador sintáctico completamente automático). En este artículo se presenta la adaptación a la lengua castellana del marco IPP y su herramienta de anotación IPP-Ann, usando modelos obtenidos a partir del UAM Spanish Treebank. Hemos llevado a cabo experimentación simulando al usuario para obtener métricas de evaluación objetivas para nuestro sistema. Estos resultados muestran que el marco IPP aplicado al UAM Spanish Treebank se traduce en una importante cantidad de esfuerzo ahorrado, comparable con el obtenido al aplicar el marco IPP para analizar la lengua inglesa mediante el Penn Treebank.
- English
  The Interactive Predictive Parsing (IPP) framework allows us the construction of interactive tree annotation systems. These can help human annotators in creating error-free parse trees with little effort (compared to manually post-editing the trees obtained from a completely automatic parser). In this paper we adapt the IPP framework and the IPP-Ann annotation tool for parse of the Spanish language, by using models obtained from the UAM Spanish Treebank. We performed user simulation experimentation and obtained objective evaluation metrics. The results establish that the IPP framework over the UAM Treebank shows important amounts of user effort reduction, comparable to the gains obtained when applying IPP to the English language on the Penn Treebank.
Referencias bibliográficas
- Alabau, V., D. Ortiz, V. Romero, and J. Ocampo. 2009. A multimodal predictive-interactive application for computer assisted transcription...
- Barrachina, S., O. Bender, F. Casacuberta, J. Civera, E. Cubel, S. Khadivi, A. Lagarda, H. Ney, J. Tom´as, E. Vidal, and J.M. Vilar. 2009....
- Carter, D. 1997. The TreeBanker. A tool for supervised training of parsed corpora. In Proceedings of the Workshop on Computational Environments...
- Collins, M. 2003. Head-driven statistical models for natural language parsing. Computational linguistics, 29(4):589–637.
- de la Clergerie, E.V., O. Hamon, D. Mostefa, C. Ayache, P. Paroubek, and A. Vilnat. 2008. Passage: from French parser evaluation to large...
- Hiroshi, I., N. Masaki, H. Taiichi, T. Takenobu, and T. Hozumi. 2005. eBonsai: An integrated environment for annotating treebanks. In Second...
- Huang, L. 2008. Forest reranking: Discriminative parsing with non-local features. In Proc. of ACL. Citeseer.
- Klein, D. and C.D. Manning. 2001. Parsing with treebank grammars: Empirical bounds, theoretical models, and the structure of the Penn treebank....
- Klein, D. and C.D. Manning. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting on Association for Computational...
- McClosky, D., E. Charniak, and M. Johnson. 2006. Effective self-training for parsing. In Proceedings of the Human Language Technology Conference...
- Moreno, A., R. Grishman, S. López, F. Sanchez, and S. Sekine. 2000. A treebank of Spanish and its application to parsing. In Proceedings of...
- Oepen, S., D. Flickinger, K. Toutanova, and C.D. Manning. 2004. LinGO Redwoods. Research on Language & Computation, 2(4):575–596.
- Ortiz, D., L.A. Leiva, V. Alabau, and F. Casacuberta. 2010. Interactive machine translation using a web-based architecture. In Procedings...
- Petrov, S. and D. Klein. 2007. Improved inference for unlexicalized parsing. In Proceedings of NAACL HLT 2007, pages 404– 411.
- Romero, V., L.A. Leiva, A.H. Toselli, and E. Vidal. 2009. Interactive multimodal transcription of text imagse using a webbased demo system....
- Sánchez-Sáez, R., L.A. Leiva, J.A. S´anchez, and J.M. Bened´ı. 2010. Interactive predictive parsing using a web-based architecture. In Proceedings...
- Sánchez-Sáez, R., J.A. Sánchez, and J.M. Benedí. 2009. Interactive predictive parsing. In Proceedings of the 11th International Conference...
- Toselli, A.H., V. Romero, and E. Vidal. 2008. Computer assisted transcription of text images and multimodal interaction. In Proceedings of...