Ir al contenido

Documat


Cross-Document Event Ordering through Temporal Relation Inference and Distributional Semantic Models

  • Autores: Estela Saquete Boró Árbol académico, Borja Navarro Colorado Árbol académico
  • Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 58, 2017, págs. 61-68
  • Idioma: inglés
  • Títulos paralelos:
    • Ordenación de eventos multidocumento usando inferencia de relaciones temporales y modelos semánticos distribucionales
  • Enlaces
  • Resumen
    • español

      Este artículo se centra en estudiar la contribución que la inferencia de relaciones temporales y los modelos semánticos distribucionales hacen a la tarea de ordenación de eventos. Nuestro sistema construye automáticamente líneas de tiempo con eventos extraídos de diferentes documentos escritos en inglés. Para ello realiza primero una agrupación temporal y posteriormente una agrupación semántica. Para determinar la compatibilidad temporal se realiza una inferencia sobre las relaciones temporales entre los eventos extraídos de un sistema automático de procesamiento de información temporal. Para la compatibilidad semántica entre eventos hemos analizado dos modelos semánticos distribucionales distintos: LDA Topic Modeling y Word2Vec Word Embeddings. Ambos modelos semánticos junto con la inferencia temporal han sido evaluados bajo el marco de evaluación de SemEval 2015 Task 4 Track B. Los experimentos muestran que, usando ambos modelos se mejora el estado del arte actual, implicando un avance importante en la tarea de ordenación de eventos multidocumento.

    • English

      This paper focuses on the contribution of temporal relations inference and distributional semantic models to the event ordering task. Our system automatically builds ordered timelines of events from different written texts in English by performing first temporal clustering and then semantic clustering. In order to determine temporal compatibility, an inference from the temporal relationships between events –automatically extracted from a Temporal Information Processing system– is applied. Regarding semantic compatibility between events, we analyze two different distributional semantic models: LDA Topic modeling and Word2Vec word embeddings. Both semantic models together with the temporal inference have been evaluated within the framework of SemEval 2015 Task 4 Track B. Experiments show that, using both models, the current State of the Art is improved, showing significant advance in the Cross-Document Event Ordering task.

  • Referencias bibliográficas
    • Bagga, A. and B. Baldwin. 1999. Cross document event coreference: Annotations, experiments, and observations. In In Proc. ACL-99 Workshop...
    • Baroni, M., G. Dinu, and G. Kruszewski. 2014. Don’t count, predict! a systematic comparison of context-counting vs. contextpredicting semantic...
    • Bejan, C. A. and S. Harabagiu. 2014. Unsupervised Event Coreference Resolution. Computational Linguistics, 40(2):311–347.
    • Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993–1022.
    • Caselli, T., A. Fokkens, R. Morante, and P. Vossen. 2015. SPINOZA VU: An NLP Pipeline for Cross Document TimeLines. In Proceedings of the...
    • Collobert, R., J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, and P. Kuksa. 2011. Natural Language Processing (Almost) from Scratch. Journal...
    • Cybulska, A. and P. Vossen. 2013. Semantic relations between events and their time, locations and participants for event coreference resolution....
    • Goyal, K., S. K. Jauhar, H. Li, M. Sachan, S. Srivastava, and E. H. Hovy. 2013. A structured distributional semantic model for event co-reference....
    • Ji, H., R. Grishman, Z. Chen, and P. Gupta. 2009. Cross-document event extraction and tracking: Task, evaluation, techniques and challenges....
    • Landauer, T. K. and S. T. Dumais. 1997. A solution to Plato’s problem: the latent semantic analysis theory of acquisition, induction and representation...
    • Laparra, E., I. Aldabe, and G. Rigau. 2015. Document level time-anchoring for timeline extraction. In Proceedings of the 53rd Annual Meeting...
    • Lee, H., M. Recasens, A. Chang, M. Surdeanu, and D. Jurafsky. 2012. Joint entity and event coreference resolution across documents. In Proceedings...
    • Li, P., Q. Zhu, and X. Zhu. 2011. A clustering and ranking based approach for multidocument event fusion. In Software Engineering, Artificial...
    • Llorens, H., E. Saquete, and B. Navarro Colorado. 2012. Automatic System for Identifying and Categorizing Temporal Relations in Natural...
    • Lu, J. and V. Ng. 2016. Event coreference resolution with multi-pass sieves. In Proceedings of the Tenth International Conference on Language...
    • Manning, C. D., M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. 2014. The Stanford CoreNLP natural language processing toolkit....
    • Mikolov, T., I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. 2013. Distributed representations of words and phrases and their compositionality....
    • Minard, A.-L., M. Speranza, E. Agirre, I. A. adn Marieke van Erp, B. Magnini, G. Rigau, and R. Urizar. 2015. Semeval-2015 task 4: Timeline:...
    • Mitchell, J. and M. Lapata. 2010. Composition in Distributional Models of Semantics. Cognitive Science, 34:1388–1429.
    • Moulahi, B., J. Strötgen, M. Gertz, and L. Tamine. 2015. Heideltoul: A baseline approach for cross-document event ordering. In Proceedings...
    • Navarro-Colorado, B. and E. Saquete. 2015. GPLSIUA: Combining Temporal Information and Topic Modeling for Cross-Document Event Ordering. In...
    • Navarro-Colorado, B. and E. Saquete. 2016. Cross-document event ordering through temporal, lexical and distributional knowledge. Knowledge-Based...
    • Palmer, M., D. Gildea, and P. Kingsbury. 2005. The Proposition Bank: An Annotated Corpus of Semantic Roles. Computational Linguistics, 31.
    • Saurí, R., J. Littman, R. Knippen, R. Gaizauskas, A. Setzer, and J. Pustejovsky, 2006. TimeML Annotation Guidelines 1.2.1 (http://www.timeml.org/).
    • Sun, W., A. Rumshisky, and O. Uzuner. 2013. Evaluating temporal relations in clinical text: 2012 i2b2 challenge. In J Am Med Inform Assoc.,...
    • UzZaman, N., H. Llorens, L. Derczynski, J. Allen, M. Verhagen, and J. Pustejovsky. 2013. Semeval-2013 task 1: Tempeval-3: Evaluating time...
    • Verhagen, M., R. Gaizauskas, M. Hepple, F. Schilder, G. Katz, and J. Pustejovsky. 2007. Semeval-2007 task 15: Tempeval temporal relation...
    • Verhagen, M., R. Saurí, T. Caselli, and J. Pustejovsky. 2010. Semeval-2010 task 13: Tempeval-2. In Proceedings of the 5th International Workshop...
    • Yang, B., C. Cardie, and P. I. Frazier. 2015. A hierarchical distance-dependent bayesian model for event coreference resolution. TACL, 3:517–528.

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno