Ir al contenido

Documat


A machine learning method for identifying impersonal constructions and zero pronouns in Spanish

  • Autores: Luz Rello Sánchez Árbol académico, Pablo Suárez García, Ruslan Mitkov Árbol académico
  • Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 45, 2010, págs. 281-286
  • Idioma: inglés
  • Títulos paralelos:
    • Un método de aprendizaje automático para la identificación de construcciones impersonales y pronombres cero en español
  • Enlaces
  • Resumen
    • español

      En este trabajo se presenta un método basado en aprendizaje automático para la clasificación de la elipsis del sujeto como referencial o no referencial en español. Se trata, tal como se desprende de la revisión bibliográfica realizada, del primer intento de identificar construcciones impersonales no referenciales en esta lengua. Una evaluación del sistema con un corpus de entrenamiento formado por 6.827 verbos anotados ha mostrado que alcanza una exactitud del 87%.

    • English

      In this paper, we present a machine learning system for classifying subject ellipsis in Spanish as either referential or non-referential. To the best of our knowledge, this is the first attempt to automatically identify non-referential ellipsis in Spanish. An evaluation of our system against 6,827 finite verbs shows an accuracy of 87%.

  • Referencias bibliográficas
    • Bergsma, S., D. Lin, and R. Goebel. 2008. Distributional identication of nonreferential pronouns. In Proceedings of the 46th Annual Meeting...
    • Brucart, J. M. 1999. La elipsis. In I. Bosque and V. Demonte, editors, Gramatica descriptiva de la lengua espa~nola, volume 2. Espasa-Calpe,...
    • Chinchor, N. and L. Hirschman. 1997. MUC- 7 Coreference task denition (version 3.0). In Proceedings of the MUC-97. Chomsky, N. 1981. Lectures...
    • Cleary, J.G. and L.E. Trigg. 1995. K*: an instance-based learner using an entropic distance measure. In Proceedings of the 12th ICML-95, pages...
    • Danlos, L. 2005. Automatic recognition of French expletive pronoun occurrences. In Robert Dale, Kam-Fai Wong, Jiang Su, and Oi Yee Kwong,...
    • Ferrández, A., A. Palomar, and L. Moreno. 1999. An empirical approach to Spanish anaphora resolution. Machine Translation, 14(3/4):191{216. Ferrández,...
    • Hall, M., E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten. 2009. The WEKA data mining software: an update. SIGKDD Explorations,...
    • Han, N. 2004. Korean null pronouns: classification and annotation. In Proceedings of the Workshop on Discourse Annotation. 42nd Annual Meeting...
    • Mitkov, R. 2002. Anaphora resolution. Longman, London. Mitkov, R. 2010. Discourse processing. In Alexander Clark, Chris Fox, and Shalom Lappin,...
    • Okumura, M. and K. Tamura. 1996. Zero pronoun resolution in Japanese discourse based on centering theory. In Proceedings of the 16th COLING-96,...
    • Peral, J. and A. Ferrandez. 2000. Generation of Spanish zero-pronouns into English. In D. N. Christodoulakis, editor, Natural Language Processing....
    • Real Academia Espa~nola. 2001. Diccionario de la lengua espa~nola. Espasa-Calpe, Madrid, 22 edition.
    • Real Academia Espa~nola. 2009. Nueva gramática de la lengua espa~nola. Espasa- Calpe, Madrid.
    • Recasens, M. and E. Hovy. 2009. A deeper look into features for coreference resolution. In Lalitha Devi Sobha,
    • Antonio Branco, and Ruslan Mitkov, editors, Anaphora Processing and Applications. Proceedings of the 7th DAARC-09. Springer, Berlin, Heidelberg,...
    • Rello, L. 2010. Elliphant: A machine learning method for identifying subject ellipsis and impersonal constructions in spanish. Master's...
    • Rello, L. and I. Illisei. 2009. A rule-based approach to the identication of Spanish zero pronouns. In Student Research Workshop. RANLP-09,...
    • Steinberger, J., M. Poesio, M. A. Kabadjov, and K. Jeek. 2007. Two uses of anaphora resolution in summarization. Information Processing and...
    • Tapanainen, P. and T. Jarvinen. 1997. A non-projective dependency parser. In Proceedings of the 5th Conference on ANLP-97, pages 64{71.
    • Witten, I. H. and E. Frank. 2005. Data mining: practical machine learning tools and techniques. Morgan Kaufmann, London, 2 edition.
    • Yeh, C. and Y. Chen. 2003. Zero anaphora resolution in Chinese with partial parsing based on centering theory. In Proceedings of the International...
    • Zhao, S. and H.T. Ng. 2007. Identication and resolution of Chinese zero pronouns: a machine learning approach. In Proceedings of the 2007...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno