Ir al contenido

Documat


Enfoque de simplificación léxica utilizando recursos de lectura fácil

  • Autores: Isabel Segura Bedmar Árbol académico, Paloma Martínez Fernández Árbol académico, Rodrigo Alarcón, Lourdes Moreno Árbol académico
  • Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 63, 2019, págs. 95-102
  • Idioma: español
  • Títulos paralelos:
    • Lexical simplification approach using easy-to-read resources
  • Enlaces
  • Resumen
    • español

      Este trabajo tiene como objetivo facilitar la comprensión y legibilidad de textos en español en un dominio genérico a través del diseño de un sistema de simplificación léxica que da soporte a la tarea de Complex Word Identification (CWI) y elección de sustituto más sencillo. Considerando la limitación de recursos disponibles en español, exploramos diferentes características que nos permitan discernir entre una palabra compleja y una simple. Algunas de estas características son obtenidas de lectura fácil. La evaluación muestra buenos resultados al obtener 0.7497 en F1-score en la tarea de CWI con el dataset de la competición de BEA Workshop 2018.

    • English

      This work aims to facilitate the understanding and readability of Spanish texts in a generic domain through the design of a lexical simplification system that provides support to the task of Complex Word Identification (CWI) and selection of a simpler substitute. Considering the limited resources available in Spanish, we explore different features that allow us to discern between a complex word and a simpler one. Some of these features are obtained from easy-to-read resources. The evaluation shows good results by obtaining an F1-Score of 0.7497 on the CWI Task with the BEA Workshop 2018 competition’s dataset. |

  • Referencias bibliográficas
    • Aroyehun, S. T., J. Angel, D. A. P. Alvarez, and A. Gelbukh. 2018. Complex word identification: Convolutional neural network vs. feature engineering....
    • Baeza-Yates, R., L. Rello, and J. Dembowski. 2015. Cassa: A context-aware synonym simplification algorithm. In Proceedings of the 2015 Conference...
    • Bott, S., L. Rello, B. Drndarevic, and H. Saggion. 2012. Can spanish be simpler? lexsis: Lexical simplification for spanish. Proceedings of...
    • Burstein, J., J. Shore, J. Sabatini, Y.-W. Lee, and M. Ventura. 2007. The automated text adaptation tool. In Proceedings of Human Language...
    • Cardellino, C. 2016. Spanish Billion Words Corpus and Embeddings, March. https://crscardellino.github.io/SBWCE/.
    • Chayle, C., C. M. Herrera, M. A. Barrera, A. Pauletto, and S. Blanco. 2017. Evaluación de la accesibilidad web. In XIX Workshop de Investigadores...
    • De Hertog, D. and A. Tack. 2018. Deep learning architecture for complex word identification. In Proceedings of the Thirteenth Workshop on...
    • Ferrés, D., H. Saggion, and X. G. Guinovart. 2017. An adaptable lexical simplification architecture for major ibero-romance languages. In...
    • Freyhoff, G., G. Hess, L. Kerr, B. Tronbacke, and K. Van Der Veken. 1998. Make it simple.
    • Glavaš, G. and S. Štajnerr. 2015. Simplifying lexical simplification: do we need simplified corpora? In Proceedings of the 53rd Annual Meeting...
    • Gonzalez-Dios, I. 2017. Análisis de la complejidad y simplificación automática de textos. el análisis de las estructuras complejas...
    • Grave, E., P. Bojanowski, P. Gupta, A. Joulin, and T. Mikolov. 2018. Learning word vectors for 157 languages. In Proceedings of the International...
    • Hartmann, N. and L. B. dos Santos. 2018. Nilc at cwi 2018: Exploring feature engineering and feature learning. In Proceedings of the Thirteenth...
    • Kajiwara, T. and M. Komachi. 2018. Complex word identification based on frequency in a learner corpus. In Proceedings of the Thirteenth Workshop...
    • Lal, P. and S. Ruger. 2002. Extract-based summarization with simplification. In Proceedings of the ACL.
    • Mitkov, R. and S. Štajner. 2014. The fewer, the better? a contrastive study about ways to simplify. In Proceedings of the Workshop on Automatic...
    • Moreno, L., P. Mart́ınez, J. Muguerza, and J. Abascal. 2018. Support resource based on standards for accessible e-government transactional...
    • Navigli, R. and S. P. Ponzetto. 2010. Babelnet: Building a very large multilingual semantic network. In Proceedings of the 48th annual meeting...
    • Paetzold, G. and L. Specia. 2015. Lexenstein: A framework for lexical simplification. Proceedings of ACL-IJCNLP 2015 System Demonstrations,...
    • Paetzold, G. and L. Specia. 2016a. Semeval 2016 task 11: Complex word identification. In Proceedings of the 10th International Workshop on...
    • Paetzold, G. H. and L. Specia. 2016b. Unsupervised lexical simplification for nonnative speakers. In Thirtieth AAAI Conference on Artificial...
    • Paetzold, G. H. and L. Specia. 2017. A survey on lexical simplification. Journal of Artificial Intelligence Research, 60:549– 593.
    • Saggion, H. 2017. Automatic text simplification. Synthesis Lectures on Human Language Technologies, 10(1):1–137.
    • Saggion, H., E. Gómez-Mart́ınez, E. Etayo, A. Anula, and L. Bourg. 2011. Text simplification in simplext: Making texts more accessible....
    • Shardlow, M. 2013. A comparison of techniques to automatically identify complex words. In 51st Annual Meeting of the Association for Computational...
    • Shardlow, M. 2014. A survey of automated text simplification. International Journal of Advanced Computer Science and Applications, 4(1):58–70.
    • Smith, K., G. Hallam, and S. Ghosh. 2012. Guidelines for professional library/information educational programs2012. IFLA Education and Training...
    • Štajner, S., I. Calixto, and H. Saggion. 2015. Automatic text simplification for spanish: Comparative evaluation of various simplification...
    • Štajner, S., H. Saggion, and S. P. Ponzetto. 2019. Improving lexical coverage of text simplification systems for spanish. Expert Systems with...
    • W3C, W. 2019. Web content accessibility guidelines (wcag) overview. https://www.w3.org/WAI/standardsguidelines/wcag/.
    • Yimam, S. M., C. Biemann, S. Malmasi, G. H. Paetzold, L. Specia, S. Štajner, A. Tack, and M. Zampieri. 2018. A report on the complex word...
    • Yimam, S. M., S. Stajner, M. Riedl, and C. Biemann. 2017. Multilingual and cross-lingual complex word identification. In RANLP, pages 813–822.

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno