From Sentences to Documents: Extending Abstract Meaning Representation for Understanding Documents

Paloma Moreda Pozo; Armando Suárez Cueto; Elena Lloret Pastor; Estela Saquete Boró; Isabel Moreno

Ayuda

From Sentences to Documents: Extending Abstract Meaning Representation for Understanding Documents

Autores: Paloma Moreda Pozo , Armando Suárez Cueto , Elena Lloret Pastor , Estela Saquete Boró , Isabel Moreno
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 60, 2018, págs. 61-68
Idioma: inglés
Títulos paralelos:
- De Oraciones a Documentos: extendiendo Abstract Meaning Representation para la comprensión de textos
Enlaces
- Texto completo
Resumen
- español
  The overabundance of information and its heterogeneity requires new ways to access, process and generate knowledge according to the user's needs. To define an appropriate formalism to represent textual information capable to allow machines to perform language understanding and generation will be crucial for achieving these tasks. Abstract Meaning Representation (AMR) is foreseen as a standard knowledge representation that can capture the information encoded in a sentence at various linguistic levels. However, its scope only limits to a single sentence, and it does not benefit from additional semantic information that could help the generation of different types of texts. Therefore, the aim of this paper is to address this limitation by proposing and outlining a method that can extend the information provided by AMR and use it to represent entire documents. Based on our proposal, we will determine a unique, invariant and independent standard text representation, called canonical representation. From it and through a transformational process, we will obtain different text variants that will be appropriate to the users' needs.
- English
  La sobreabundancia de información y su heterogeneidad requieren nuevas formas de acceder, procesar y generar conocimiento de acuerdo con las necesidades del usuario. Por ello, definir un formalismo adecuado para representar la información textual capaz de permitir a los ordenadores comprender y generar el lenguaje, es crucial para lograr esta tarea. Abstract Meaning Representation (AMR) es una representación del conocimiento estándar que puede capturar la información codificada en una oración en varios niveles lingüísticos. Sin embargo, su alcance se limita a una sola oración, y no se beneficia de la información semántica adicional que podría ayudar a la generación de diferentes tipos de textos. En este artículo propondremos un método que amplía la información proporcionada por AMR y la utiliza para representar documentos completos. En base a nuestra propuesta, definiremos una representación de texto estándar única, invariable e independiente, llamada representación canónica. A partir de la cual, y mediante un proceso de transformación, obtendremos diferentes variantes de texto que serán apropiadas para las necesidades de los usuarios
Referencias bibliográficas
- Banarescu, L., C. Bonial, S. Cai, M. Georgescu, K. Griffitt, U. Hermjakob, K. Knight, P. Koehn, M. Palmer, and N. Schneider. 2013. Abstract...
- Clark, K. and C. D. Manning. 2015. Entitycentric coreference resolution with model stacking. In Proceedings of the 53rd Annual Meeting of...
- Damonte, M., S. B. Cohen, and G. Satta. 2017. An incremental parser for abstract meaning representation. In Proceedings of the 15th Conference...
- Dohare, S. and H. Karnick. 2017. Text summarization using abstract meaning representation. CoRR, abs/1706.01678.
- Flanigan, J., C. Dyer, N. A. Smith, and J. Carbonell. 2016. Generation from abstract meaning representation using tree transducers. In Proceedings...
- Flanigan, J., S. Thomson, J. Carbonell, C. Dyer, and N. A. Smith. 2014. A discriminative graph-based parser for the abstract meaning representation....
- Goodman, J., A. Vlachos, and J. Naradowsky. 2016. Noise reduction and targeted exploration in imitation learning for abstract meaning representation...
- Langkilde, I. and K. Knight. 1998. Generation that exploits corpus-based statistical knowledge. In COLING-ACL ’98, Proceedings of the Conference,...
- Liu, F., J. Flanigan, S. Thomson, N. Sadeh, and N. A. Smith. 2015. Toward abstractive summarization using semantic representations. In Proceedings...
- Llorens, H., E. Saquete, and B. NavarroColorado. 2013. Applying Semantic Knowledge to the Automatic Processing of Temporal Expressions and...
- Martínez-Barco, P., A. F. Rodríguez, D. TomaÌs, E. Lloret, E. Saquete, F. Llopis, J. Peral, M. Palomar, J. M. G. Soriano, and M. T. Romà-Ferri....
- Moro, A., F. Cecconi, and R. Navigli. 2014. Multilingual word sense disambiguation and entity linking for everybody. In Proceedings of the...
- Navarro-Colorado, B. and E. Saquete. 2016. Cross-document event ordering through temporal, lexical and distributional knowledge. Knowl.-Based...
- Pourdamghani, N., K. Knight, and U. Hermjakob. 2016. Generating english from abstract meaning representations. In Proceedings of the 9th International...
- Saphra, N. and A. Lopez. 2015. Amrica: an amr inspector for cross-language alignments. In Proceedings of the 2015 Conference of the North...
- Schuler, K. K. 2006. VerbNet: A BroadCoverage, Comprehensive Verb Lexicon. Ph.D. thesis, University of Pennsylvania.
- UzZaman, N., H. Llorens, L. Derczynski, J. Allen, M. Verhagen, and J. Pustejovsky. 2013. Semeval-2013 task 1: Tempeval-3: Evaluating time...
- Vanderwende, L., A. Menezes, and C. Quirk. 2015. An amr parser for english, french, german, spanish and japanese and a new amr-annotated corpus....
- Zhou, J., F. Xu, H. Uszkoreit, W. QU, R. Li, and Y. Gu. 2016. Amr parsing with an incremental joint model. In Proceedings of the 2016 Conference...