Enfoque de simplificación léxica utilizando recursos de lectura fácil

Isabel Segura Bedmar; Paloma Martínez Fernández; Rodrigo Alarcón; Lourdes Moreno López

Ayuda

Enfoque de simplificación léxica utilizando recursos de lectura fácil

Autores: Isabel Segura Bedmar , Paloma Martínez Fernández , Rodrigo Alarcón, Lourdes Moreno López
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 63, 2019, págs. 95-102
Idioma: español
Títulos paralelos:
- Lexical simplification approach using easy-to-read resources
Enlaces
- Texto completo

Dialnet Métricas: 1 Cita

Resumen
- español
  Este trabajo tiene como objetivo facilitar la comprensión y legibilidad de textos en español en un dominio genérico a través del diseño de un sistema de simplificación léxica que da soporte a la tarea de Complex Word Identification (CWI) y elección de sustituto más sencillo. Considerando la limitación de recursos disponibles en español, exploramos diferentes características que nos permitan discernir entre una palabra compleja y una simple. Algunas de estas características son obtenidas de lectura fácil. La evaluación muestra buenos resultados al obtener 0.7497 en F1-score en la tarea de CWI con el dataset de la competición de BEA Workshop 2018.
- English
  This work aims to facilitate the understanding and readability of Spanish texts in a generic domain through the design of a lexical simplification system that provides support to the task of Complex Word Identification (CWI) and selection of a simpler substitute. Considering the limited resources available in Spanish, we explore different features that allow us to discern between a complex word and a simpler one. Some of these features are obtained from easy-to-read resources. The evaluation shows good results by obtaining an F1-Score of 0.7497 on the CWI Task with the BEA Workshop 2018 competition’s dataset. |
Referencias bibliográficas
- Aroyehun, S. T., J. Angel, D. A. P. Alvarez, and A. Gelbukh. 2018. Complex word identification: Convolutional neural network vs. feature engineering....
- Baeza-Yates, R., L. Rello, and J. Dembowski. 2015. Cassa: A context-aware synonym simplification algorithm. In Proceedings of the 2015 Conference...
- Bott, S., L. Rello, B. Drndarevic, and H. Saggion. 2012. Can spanish be simpler? lexsis: Lexical simplification for spanish. Proceedings of...
- Burstein, J., J. Shore, J. Sabatini, Y.-W. Lee, and M. Ventura. 2007. The automated text adaptation tool. In Proceedings of Human Language...
- Cardellino, C. 2016. Spanish Billion Words Corpus and Embeddings, March. https://crscardellino.github.io/SBWCE/.
- Chayle, C., C. M. Herrera, M. A. Barrera, A. Pauletto, and S. Blanco. 2017. EvaluacioÌn de la accesibilidad web. In XIX Workshop de Investigadores...
- De Hertog, D. and A. Tack. 2018. Deep learning architecture for complex word identification. In Proceedings of the Thirteenth Workshop on...
- Ferrés, D., H. Saggion, and X. G. Guinovart. 2017. An adaptable lexical simplification architecture for major ibero-romance languages. In...
- Freyhoff, G., G. Hess, L. Kerr, B. Tronbacke, and K. Van Der Veken. 1998. Make it simple.
- Glavaš, G. and S. Štajnerr. 2015. Simplifying lexical simplification: do we need simplified corpora? In Proceedings of the 53rd Annual Meeting...
- Gonzalez-Dios, I. 2017. AnaÌlisis de la complejidad y simplificacioÌn automaÌtica de textos. el anaÌlisis de las estructuras complejas...
- Grave, E., P. Bojanowski, P. Gupta, A. Joulin, and T. Mikolov. 2018. Learning word vectors for 157 languages. In Proceedings of the International...
- Hartmann, N. and L. B. dos Santos. 2018. Nilc at cwi 2018: Exploring feature engineering and feature learning. In Proceedings of the Thirteenth...
- Kajiwara, T. and M. Komachi. 2018. Complex word identification based on frequency in a learner corpus. In Proceedings of the Thirteenth Workshop...
- Lal, P. and S. Ruger. 2002. Extract-based summarization with simplification. In Proceedings of the ACL.
- Mitkov, R. and S. SÌtajner. 2014. The fewer, the better? a contrastive study about ways to simplify. In Proceedings of the Workshop on Automatic...
- Moreno, L., P. MartÌÄ±nez, J. Muguerza, and J. Abascal. 2018. Support resource based on standards for accessible e-government transactional...
- Navigli, R. and S. P. Ponzetto. 2010. Babelnet: Building a very large multilingual semantic network. In Proceedings of the 48th annual meeting...
- Paetzold, G. and L. Specia. 2015. Lexenstein: A framework for lexical simplification. Proceedings of ACL-IJCNLP 2015 System Demonstrations,...
- Paetzold, G. and L. Specia. 2016a. Semeval 2016 task 11: Complex word identification. In Proceedings of the 10th International Workshop on...
- Paetzold, G. H. and L. Specia. 2016b. Unsupervised lexical simplification for nonnative speakers. In Thirtieth AAAI Conference on Artificial...
- Paetzold, G. H. and L. Specia. 2017. A survey on lexical simplification. Journal of Artificial Intelligence Research, 60:549– 593.
- Saggion, H. 2017. Automatic text simplification. Synthesis Lectures on Human Language Technologies, 10(1):1–137.
- Saggion, H., E. GoÌmez-MartÌÄ±nez, E. Etayo, A. Anula, and L. Bourg. 2011. Text simplification in simplext: Making texts more accessible....
- Shardlow, M. 2013. A comparison of techniques to automatically identify complex words. In 51st Annual Meeting of the Association for Computational...
- Shardlow, M. 2014. A survey of automated text simplification. International Journal of Advanced Computer Science and Applications, 4(1):58–70.
- Smith, K., G. Hallam, and S. Ghosh. 2012. Guidelines for professional library/information educational programs2012. IFLA Education and Training...
- Štajner, S., I. Calixto, and H. Saggion. 2015. Automatic text simplification for spanish: Comparative evaluation of various simplification...
- Štajner, S., H. Saggion, and S. P. Ponzetto. 2019. Improving lexical coverage of text simplification systems for spanish. Expert Systems with...
- W3C, W. 2019. Web content accessibility guidelines (wcag) overview. https://www.w3.org/WAI/standardsguidelines/wcag/.
- Yimam, S. M., C. Biemann, S. Malmasi, G. H. Paetzold, L. Specia, S. SÌtajner, A. Tack, and M. Zampieri. 2018. A report on the complex word...
- Yimam, S. M., S. Stajner, M. Riedl, and C. Biemann. 2017. Multilingual and cross-lingual complex word identification. In RANLP, pages 813–822.