Ir al contenido

Documat


A simple approach to use bilingual information sources for word alignment

  • Autores: Miquel Esplà Gomis, Felipe Sánchez Martínez Árbol académico, Mikel L. Forcada Zubizarreta Árbol académico
  • Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 49, 2012, págs. 93-100
  • Idioma: inglés
  • Enlaces
  • Resumen
    • español

      En este artculo se describe un metodo nuevo y sencillo para utilizar fuentes de informacion bilingue para el alineamiento de palabras en segmentos de texto paralelos. Este metodo puede ser utilizado al vuelo, ya que no requiere de entrenamiento. Ademas, puede ser utilizado con corpus comparables. Hemos comparado los resultados de nuestro metodo con los obtenidos por la herramienta GIZA++, ampliamente utilizada para el alineamiento de palabras, obteniendo unos resultados bastante similares.

    • English

      In this paper we present a new and simple method for using sources of bilingual information for word alignment between parallel segments of text. This method can be used on the y, since it does not need to be trained. In addition, it can also be applied on comparable corpora. We compare our method to the state-of-the-art tool GIZA++, widely used for word alignment, and we obtain very similar results.

  • Referencias bibliográficas
    • Al-Onaizan, Y. and K. Knight. 2002. Translating named entities using monolingual and bilingual resources. In Proceedings of the 40th Annual...
    • Brown, P.F., S.A. Della Pietra, V.J. Della Pietra, and R.L. Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation....
    • Dagan, I., K.W. Church, and W.A. Gale. 1993. Robust bilingual word alignment for machine aided translation. In Proceedings of the Workshop...
    • Dempster, A.P., N.M. Laird, and D.B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. volume 39 of Series B. Blackwell...
    • Esplà, M., F. Sánchez-Martínez, and M.L. Forcada. 2011. Using word alignments to assist computer-aided translation users by marking which...
    • Esplà-Gomis, M., F. Sánchez-Martínez, and M.L. Forcada. 2011. Using machine translation in computer-aided translation to suggest the target-side...
    • Esplà-Gomis, M., F. Sánchez-Martínez, and M.L. Forcada. 2012. UAlacant: using online machine translation for crosslingual textual entailment....
    • Forcada, M.L., M. Ginestí-Rosell, J. Nordfalk, J. O'Regan, S. Ortiz-Rojas, J. Pérez-Ortiz, F. Sánchez-Martínez, G. Ramírez-Sánchez, and...
    • Fung, P. and K. McKeown. 1997. Finding terminology translations from non-parallel corpora. pages 192-202.
    • Koehn, P. 2005. Europarl: A parallel corpus for statistical machine translation. In Proceedings of Machine Translation Summit X, pages 79-86,...
    • Koehn, P., H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin,...
    • Koehn, P., F.J. Och, and D. Marcu. 2003. Statistical phrase-based translation. In Proceedings of the 2003 Conference of the North American...
    • Kranias, L. and A. Samiotou. 2004. Automatic translation memory fuzzy match post-editing: A step beyond traditional TM/MT integration. In...
    • Lambert, P., A. De Gispert, R. Banchs, and J. Mari~no. 2005. Guidelines for word alignment evaluation and manual alignment. Language Resources...
    • Och, F.J. and H. Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1):19-51.
    • Rapp, R. 1999. Automatic identification of word translations from unrelated English and German corpora. In Proceedings of the 37th Annual...
    • Schulz, S., K. Markó, E. Sbrissia, P. Nohama, and U. Hahn. 2004. Cognate mapping: a heuristic strategy for the semi-supervised acquisition...
    • Vogel, S., H. Ney, and C. Tillmann. 1996. HMM-based word alignment in statistical translation. In Proceedings of the 16th International Conference...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno