Ir al contenido

Documat


EusHeidelTime: Time Expression Extraction and Normalisation for Basque

  • Autores: Begoña Altuna, María Jesús Aranzabe Urruzola, Arantza Díaz de Ilarraza Sánchez Árbol académico
  • Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 59, 2017, págs. 15-22
  • Idioma: inglés
  • Títulos paralelos:
    • EusHeidelTime: extracción y normalización de expresiones temporales para el euskera
  • Enlaces
  • Resumen
    • español

      La información temporal ayuda a organizar la información textual situando las acciones y los estados en el tiempo. Por eso, es importante identificar los puntos e intervalos temporales en el texto, así como los tiempos a los que estos se refieren. Hemos desarrollado EusHeidelTime para la extracción y normalización de expresiones temporales para el euskera. Para ello, hemos analizado las expresiones temporales en euskera, hemos creado las reglas y recursos para la herramienta y hemos construido un corpus para el desarrollo y la evaluación. Finalmente, hemos realizado un experimento para evaluar el rendimiento de EusHeidelTime. Hemos conseguido resultados satisfactorios en una lengua con morfología rica.

    • English

      Temporal information helps to organise the information in texts placing the actions and states in time. It is therefore important to identify the time points and intervals in the text, as well as what times they refer to. We developed EusHeidelTime for Basque time expression extraction and normalisation. For it, we analysed time expressions in Basque, we created the rules and resources for the tool and we built corpora for development and testing. We finally ran an experiment to evaluate EusHeidelTime's performance. We achieved satisfactory results in a morphologically rich language.

  • Referencias bibliográficas
    • Altuna, B., M. J. Aranzabe, and A. Dı́az de Ilarraza. 2014. Euskarazko denboraegiturak. Azterketa eta etiketatze esperimentua. Linguamática,...
    • Aramaki, E., Y. Miura, M. Tonoike, T. Ohkuma, H. Mashuichi, and K. Ohe. 2009. Text2table: Medical text summarization system based on named...
    • Bartalesi Lenzi, V., G. Moretti, and R. Sprugnoli. 2012. CAT: the CELCT Annotation Tool. In N. Calzolari, K. Choukri, T. Declerck, M. U. Doğan,...
    • Bauer, S., S. Clark, and T. Graepel. 2015. Learning to Identify Historical Figures for Timeline Creation from Wikipedia Articles. In L. Aiello...
    • Bethard, S. and J. H. Martin. 2013. ClearTK-TimeML: A minimalist approach to TempEval 2013. In S. Manandhar and D. Yuret, editors, Second...
    • Bittar, A. 2010. Building a TimeBank for French: a Reference Corpus Annotated According to the ISO-TimeML Standard. Ph.D. thesis, Université...
    • Ferrucci, D. and A. Lally. 2004. UIMA: an architectural approach to unstructured information processing in the corporate research environment....
    • Fokkens, A., A. Soroa, Z. Beloki, N. Ockeloen, G. Rigau, W. R. van Hage, and P. Vossen. 2014. NAF and GAF: Linking linguistic annotations....
    • Jang, S. B., J. Baldwin, and I. Mani. 2004. Automatic TIMEX2 Tagging of Korean News. ACM Transactions on Asian Language Information Processing...
    • Kawai, H., A. Jatowt, K. Tanaka, K. Kunieda, and K. Yamada. 2010. Chronoseeker: Search engine for future and past events. In Proceedings of...
    • Llorens, H., E. Saquete, and B. Navarro. 2010. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2. In Proceedings...
    • Mani, I. and G. Wilson. 2000. Robust Temporal Processing of News. In Proceedings of the 38th Annual Meeting on Association for Computational...
    • Mazur, P. and R. Dale. 2010. WikiWars: A New Corpus for Research on Temporal Expressions. In Proceedings of the 2010 Conference on Empirical...
    • Minard, A.-L., M. Speranza, R. Urizar, B. na Altuna, M. van Erp, A. Schoen, and C. van Son. 2016. MEANTIME, the NewsReader Multilingual Event...
    • Moriceau, V. and X. Tannier. 2014. French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime. In N. Calzolari,...
    • Otegi, A., N. Ezeiza, I. Goenaga, and G. Labaka. 2016. A Modular Chain of NLP Tools for Basque. In P. Sojka, A. Horák, I. Kopeček, and...
    • Pustejovsky, J., M. Verhagen, R. Sauŕı, J. Littman, R. Gaizauskas, G. Katz, I. Mani, R. Knippen, and A. Setzer. 2006. TimeBank 1.2. Technical...
    • Radinsky, K. and E. Horvitz. 2013. Mining the web to predict future events. In Proceedings of the sixth ACM international conference on Web...
    • Skukan, L., G. Glavaš, and J. Šnajder. 2014. HeidelTime.Hr: Extracting and Normalizing Temporal Expressions in Croatian. In Proceedings...
    • Strötgen, J. and M. Gertz. 2010." HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. In Proceedings...
    • Strötgen, J. and M. Gertz. 2011. WikiWarsDE: a German Corpus of Narratives Annotated with Temporal Expressions. In H. Hedeland, T. Schmidt,...
    • Strötgen, J. and M. Gertz. 2013. Multilingual and Cross-domain Temporal Tagging. Language Resources and Evaluation, 47(2):269–298.
    • TimeML Working Group. 2010. TimeML Annotation Guidelines version 1.3. Manuscript. Technical report, Brandeis University.
    • UzZaman, N., H. Llorens, J. F. Allen, L. Derczynski, M. Verhagen, and J. Pustejovsky. 2013. TempEval-3: Evaluating Events, Time Expressions,...
    • van de Camp, M. and H. Christiansen. 2013. Resolving relative time expressions in Dutch text with Constraint Handling Rules. In Revised Selected...
    • Verhagen, M. and J. Pustejovsky. 2008. Temporal Processing with the TARSQI Toolkit. In 22n d International Conference on on Computational...
    • Wu, M., W. Li, Q. Lu, and B. Li. 2005. CTEMP: A Chinese Temporal Parser for Extracting and Normalizing Temporal Information. In R. Dale, K.-F....

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno