Ir al contenido

Documat


A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform

  • Autores: Zaid Md Abdul Wahab Sheikh, Felipe Sánchez Martínez Árbol académico
  • Localización: Proceedings of the First International Workshop on Free/Open-Source Ruled-Bases Machine Translation: 2-3 november 2009, Universidad d'Alacant / coord. por Juan Antonio Pérez Ortiz Árbol académico, Felipe Sánchez Martínez Árbol académico, Francisc M. Tyers, 2009, ISBN 978-84-613-6188-5
  • Idioma: inglés
  • Enlaces
  • Resumen
    • This paper describes the implementation of a second-order hidden Markov model (HMM) based part-of-speech tagger for the Apertium free/open-source rule-based machine translation platform. We describe the part-of-speech (PoS) tagging approach in Apertium and how it is parametrised through a tagger definition file that defines: (1) the set of tags to be used and (2) constrain rules that can be used to forbid certain PoS tag sequences, thus re-fining the HMM parameters and increasing its tagging accuracy. The paper also reviews the Baum-Welch algorithm used to estimate the HMM parameters and compares the tagging accuracy achieved with that achieved by the original, first-order HMM-based PoS tagger in Apertium.


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno