Legibilidad del texto, métricas de complejidad y la importancia de las palabras

Fernando Martínez Santiago; Manuel Carlos Díaz Galiano; Rocío López Anguita; Arturo Montejo Ráez

Ayuda

Legibilidad del texto, métricas de complejidad y la importancia de las palabras

Autores: Fernando Martínez Santiago , Manuel Carlos Díaz Galiano , Rocío López Anguita, Arturo Montejo Ráez
Localización: Procesamiento del lenguaje natural, ISSN 1135-5948, Nº. 61, 2018, págs. 101-108
Idioma: español
Títulos paralelos:
- Text readability, complexity metrics and the importance of words
Enlaces
- Texto completo

Dialnet Métricas: 1 Cita

Resumen
- español
  El presente trabajo expone un estudio sobre la determinación de la edad recomendada de lectura sobre un conjunto de textos infantiles. Se ha evaluado el mismo con 12 medidas de complejidad propuestas por distintos autores. Usando estas medidas como características, hemos modelado los textos y aplicado una validación cruzada con varios clasificadores automáticos. Los resultados se han comparado con otras formas de representación de los textos, como vectores de palabras y vectores TF.IDF. Nuestras conclusiones indican que el rasgo más determinante para la determinación de la edad de lectura recomendada no radica tanto en factores como la complejidad sintáctica o léxica, sino en el uso de determinado vocabulario.
- English
  This article describes our study on the identification of the recommended age for readers in texts written for children. They have been evaluated over 12 complexity metrics proposed by different authors. By using these metrics as features, we have trained several automatic classifiers and cross-validated their performances to detect recommended reader level. The results have been compared with the classification performance obtained from other document models, like word embeddings and TF.IDF vectors. Our conclusions are that the most relevant facet to identify the recommended reader age is not on lexical or syntactical complexities, but strongly related with the vocabulary involved.
Referencias bibliográficas
- Alliende González, F. 1994. La legibilidad de los textos. Santiago de Chile: Andrés Bello, 24.
- Anula, A. 2008. Lecturas adaptadas a la enseñanza del español como l2: variables lingüísticas para la determinación del nivel de legibilidad....
- Blanco Pérez, A. y U. Gutiérrez Couto. 2002. Legibilidad de las páginas web sobre salud dirigidas a pacientes y lectores de la población general....
- Cain, K., J. Oakhill, y P. Bryant. 2004. Children’s reading comprehension ability: Concurrent prediction by working memory, verbal ability,...
- Contreras, A., R. Garcia-Alonso, M. Echenique, y F. Daye-Contreras. 1999. The sol Legibilidad del texto, métricas de complejidad y la importancia...
- De Granada Barrio-Cantalejo, D. S., P. Simón-Lorda, M. Melguizo, I. Escalona, M. Marijuán, P. Hernándo, y others. 2008. Validación de la escala...
- Flesch, R. 1948. A new readability yardstick. Journal of applied psychology, 32(3):221. García López, J. 2001. Legibilidad de los folletos...
- Larson, J. y J. Marsh. 2014. Making literacy real: Theories and practices for learning and teaching. Sage.
- Mc Laughlin, G. H. 1969. Smog gradinga new readability formula. Journal of reading, 12(8):639–646.
- Mikolov, T., I. Sutskever, K. Chen, G. S. Corrado, y J. Dean. 2013. Distributed representations of words and phrases and their compositionality....
- Montejo-Ráez, A. y M. C. Díaz-Galiano. 2016. Participación de sinai en tass 2016. En TASS@ SEPLN, páginas 41–45.
- Muñoz, M. 2006. Legibilidad y variabilidad de los textos. Boletín de Investigación Educacional, Pontificia Universidad Católica de Chile,...
- Padró, L. y E. Stanilovsky. 2012. Freeling 3.0: Towards wider multilinguality. En LREC2012.
- Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, y others. 2011....
- Ramírez-Puerta, M., R. FernándezFernández, J. Frías-Pareja, M. YusteOssorio, S. Narbona-Galdó, y L. PeñasMaldonado. 2013. Análisis de legibilidad de...
- Rehurek, R. y P. Sojka. 2011. Gensim– python framework for vector space modelling. NLP Centre, Faculty of Informatics, Masaryk University,...
- Rello, L., R. Baeza-Yates, S. Bott, y H. Saggion. 2013. Simplify or help?: text simplification strategies for people with dyslexia. En Proceedings...
- Rodríguez, T. 1980. Determinación de la comprensibilidad de materiales de lectura por medio de variables lingüísticas. Lectura y vida, 1(1):29–32.
- Saggion, H., S. Stajner, S. Bott, S. Mille, ˇ L. Rello, y B. Drndarevic. 2015. Making it simplext: Implementation and evaluation of a text...
- Salton, G., A. Wong, y C.-S. Yang. 1975. A vector space model for automatic indexing. Communications of the ACM, 18(11):613–620.
- Senter, R. y E. A. Smith. 1967. Automated readability index. Informe técnico, CINCINNATI UNIV OH. Spache, G. 1953. A new readability formula...
- Spaulding, S. 1956. A spanish readability formula. The Modern Language Journal, 40(8):433–441.
- Stahl, S. A. 2003. Vocabulary and readability: How knowing word meanings affects comprehension. Topics in Language Disorders, 23(3):241–247.