Ir al contenido

Documat


Evaluation of an automatic process for specialized web corpora collection and term extraction for Basque

  • Autores: Antton Gurrutxaga Hernaiz, Igor Leturia Azkarate Árbol académico, Elisabete Pociello Irigoyen Árbol académico, Xavier Saralegi Urizar, Iñaki San Vicente
  • Localización: E-lexicography in the 21st century: New challenges, new applications : proceedings of eLex 2009, Louvain-la Neuve, 22-24 october 2009 / Sylviane Granger (ed. lit.), Magali Paquot (ed. lit.), 2010, ISBN 978-2-87463-211-2, págs. 97-107
  • Idioma: inglés
  • Enlaces
  • Resumen
    • In this paper we describe the processes for collecting Basque specialized corpora in different domains from the Internet and subsequently extracting terminology out of them, using automatic tools in both cases. We evaluate the results of corpus compiling and term extraction by making use of a specialized dictionary recently updated by experts. We also compare the results of the automatically collected web corpus with those of a traditionally collected corpus, in order to analyze the usefulness of the Internet as a reliable source of information for terminology tasks.

  • Referencias bibliográficas

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno