Generación de resúmenes extractivos de múltiples documentos usando grafos semánticos

José Ángel Olivas Varela; Francisco Pascual Romero Chicharro; Oleyda del Camino Valle; Alfredo J. Simón Cuevas; Eduardo Valladares Valdés

Ayuda

Generación de resúmenes extractivos de múltiples documentos usando grafos semánticos

Autores: José Ángel Olivas Varela , Francisco Pascual Romero Chicharro , Oleyda del Camino Valle, Alfredo J. Simón Cuevas, Eduardo Valladares Valdés
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 63, 2019, págs. 103-110
Idioma: español
Títulos paralelos:
- Multi-document extractive summarization using semantic graph
Enlaces
- Texto completo
Resumen
- español
  La generación automática de resúmenes consiste en sintetizar en un texto corto la información más relevante contenida en documentos, y permite reducir los problemas generados por la sobrecarga de información. En este trabajo se presenta un método no supervisado de generación de resúmenes extractivos a partir de múltiples documentos. En esta propuesta, la conceptualización y estructura semántica subyacente del contenido textual se representa en un grafo semántico usando WordNet y se aplica un algoritmo de agrupamiento de conceptos para identificar los tópicos tratados en los documentos, con los cuales se evalúa la relevancia de las oraciones para construir el resumen. El método fue evaluado con corpus de textos de MultiLing 2015, y se usaron métricas de ROUGE para medir la calidad de los resúmenes generados. Los resultados obtenidos se compararon con los de otros sistemas participantes en MultiLing 2015, evidenciándose mejoras en la mayoría de los casos. |
- English
  The automatic texts summarization consists in synthesizing in a short text the most relevant information contained in text documents, and allows to reduce the generated problems by the information overload. In this paper, an unsupervised method for extractive multi-document summarization is presented. In this proposal, the conceptualization and underlying semantics structure of the textual content is represented in a semantic graph using WordNet, and a concept clustering algorithm is applied to identifying the topics of the documents set, with which the relevance of the sentences is evaluated to build the summary. The method was evaluated with texts corpus from MultiLing 2015, and ROUGE metrics were used to measure the quality of the generated summaries. The obtained results were compared with those other participant systems in MultiLing 2015, evidencing improves in most of the cases.
Referencias bibliográficas
- Allahyari, M., S., Pouriyeh, S., Safaei, E. D., Trippe, J. B., Gutierrez, y K., Kochut. 2017. Text Summarization Techniques: A Brief Survey,...
- Al-Saleh, A. y M. El B. Menai. 2018. Solving Multi-Document Summarization as an Orienteering Problem. Algorithms, 11(7):127.
- Baralis, E., L. Cagliero, S. Jabeen, A. Fiori, y S. Shah. 2013. Multi-document summarization based on the Yago ontology, Expert Systems with...
- Bhatia, N., y A., Jaiswal. 2015. Trends in Extractive and Abstractive Techniques in Text Summarization, International Journal of Computer...
- Bhoir, A. S., y A. Gulati. 2015. A Multidocument Hindi Text Summarization Technique using Fuzzy Logic. Int. J. of Advance Research in Science...
- Das, D. y A. F. Martins. 2007. A survey on automatic text summarization. Literature Survey for the Language and Statistics II course at CMU,...
- Erkan, G., y D. R. Radev. 2004. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization, Journal of Artificial Intelligence...
- Ferreira, R., L. S. Cabral, R. D. Lins, G. Pereira e Silva, F. Freitas, G. D.C. Cavalcanti, R. Lima, S. J. Simske, y L. Favaro. 2013. Assessing...
- Ferreira, R., L. S. Cabral, F. Freitas, R. D. Lins, G. F. Silva, S. J. Simske, y L. Favaro. 2014. A multi-document summarization system based...
- Gambhir, M., y V. Gupta. 2017. Recent automatic text summarization techniques: a survey. Artificial Intelligence Review, 47(1):1-66.
- Hariharan, S., T. Ramkumar, y R. Srinivasan. 2013. Enhanced graph based approach for multi document summarization. Int. Arab J. Inf. Technol.,...
- Hojas, W., A. Simón, M. de la Iglesia, F. P. Romero, J. A. Olivas. 2018. A ConceptBased Text Analysis Approach Using Knowledge Graph. Communications...
- Kumar, Y. J., y N. Salim. 2012. Automatic Multi Document Summarization Approaches. Journal of Computer Science, 8(1):133-140.
- Lin, C.-Y. 2004. ROUGE: a package for automatic evaluation of summaries. En Proceedings of the ACL-04 workshop, páginas 74-81.
- Lin, D. 1998. An information-theoretic definition of similarity. En Proceedings of the International Conference on Machine Learning.
- Litvak, M., N. Vanetik, M. Last, y E. Churkin. 2016. MUSEEC: A Multi-lingual Text Summarization Tool. En Proceedings of the 54th Annual Meeting...
- Miller, G. y C. Fellbaum (Eds.). 1998. WordNet: An Electronic Lexical Database, The MIT Press: Cambridge, MA. 1998.
- Mirchev, U., y M. Last. 2014. Multi-document summarization by extended graph text representation and importance refinement. En A. Fiori (Ed):...
- Moratanch, N. y S. Chitrakala. 2017. A Survey on Extractive Text Summarization. En Proceedings of the IEEE International Conference on Computer,...
- Navigli, R. 2009. Word sense disambiguation: A survey, ACM Computing Surveys, 41(2):1-69.
- Naserasa, A., H. Khosravi, y F. Sadegh. 2018. Extractive multi-document summarization based on textual entailment and sentence compression...
- Nenkova, A. y K. McKeown. 2012. A Survey of Text Summarization Techniques. En C.C. Aggarwal and C.X. Zhai (eds.), Mining Text Data, Springer,...
- Padmapriya, K. D. G., y V. G. Rajasekaran. 2012. A View On Natural Language Processing and Text Summarization. Int. Journal of Communications...
- Pedersen, T., S. Patwardhan y J. Michelizzi. 2004. WordNet::Similarity Measuring the Relatedness of Concepts. En Proceedings of the AAAI-04,...
- Plaza, L., y A. Diaz. 2011. Using Semantic Graphs and Word Sense Disambiguation Techniques to Improve Text Summarization. Procesamiento del...
- Puspaningrum, A., A. Nurilham, E. F. Bisono, K. Umam, y A. Z. Arifin. 2018. Inter and Intra Cluster on Self-Adaptive Differential Evolution...
- Soriyan, A., y T. Omodunbi. 2014. Trends in Multi-document Summarization System Methods. International Journal of Computer Applications, 97(16):46-52.
- Sankarasubramaniam, Y., K. Ramanathan, y S. Ghosh. 2014. Text summarization using Wikipedia. Information Processing & Management, 50(3):443-461.
- Steinberger, J. 2013. The UWB Summariser at Multiling 2013. En Proc. of the MultiLing 2013 Workshop on Multilingual Multidocument Summarization,...
- Yan, S., y X. Wan. 2014. SRRank: Leveraging Semantic Roles for Extractive MultiDocument Summarization. IEEE/ACM Transactions on Audio, Speech,...
- Zhong, Y., Z. Tang, X. Ding, L. Zhu, Y. Le, K. Li, y K. Li. 2017. An Improved LDA MultiDocument Summarization Model Based on TensorFlow. En...
- Zore, A. S, y A. Deshpande. 2014. Extractive Multi Document Summarizer Algorithm. IJCSIT, 5(4):5245-5248.