Ir al contenido

Documat


Can Generative AI Solve Geometry Problems? Strengths and Weaknesses of LLMs for Geometric Reasoning in Spanish

  • Verónica Parra [1] ; Patricia Sureda [1] ; Ana Corica [1] ; Silvia Schiaffino [1] Árbol académico ; Daniela Godoy [1]
    1. [1] Universidad Nacional del Centro de la Provincia de Buenos Aires; CONICET
  • Localización: IJIMAI, ISSN-e 1989-1660, Vol. 8, Nº. 5, 2024, págs. 65-74
  • Idioma: inglés
  • DOI: 10.9781/ijimai.2024.02.009
  • Enlaces
  • Resumen
    • Generative Artificial Intelligence (AI) has emerged as a disruptive technology that is challenging traditional teaching and learning practices. Question-answering in natural language fosters the use of chatbots, such as ChatGPT, Bard and others, that generate text based on pre-trained Large Language Models (LLMs). The performance of these models in certain areas, like Math problem solving is receiving a crescent attention as it directly impacts on its potential use in educational settings. Most of these evaluations, however, concentrate on the construction and use of benchmarks comprising diverse Math problems in English. In this work, we discuss the capabilities of most used LLMs within the subfield of Geometry, in view of the relevance of this subject in high-school curricula and the difficulties exhibited by even most advanced multimodal LLMs to deal with geometric notions. This work focuses on Spanish, which is additionally a less resourced language. The answers of three major chatbots, based on different LLMs, were analyzed not only to determine their capacity to provide correct solutions, but also to categorize the errors found in the reasoning processes described. Understanding LLMs strengths and weaknesses in a field like Geometry can be a first step towards the design of more informed methodological proposals to include these technologies in classrooms as well as the development of more powerful automatic assistance tools based on generative AI.

  • Referencias bibliográficas
    • ] S. Frieder, L. Pinchetti, A. Chevalier, R.-R. Griffiths, T. Salvatori, T. Lukasiewicz, P. C. Petersen, J. Berner, “Mathematical capabilities...
    • D. Hendrycks, C. Burns, S. Kadavath, A. Arora, S. Basart, E. Tang, D. Song, J. Steinhardt, “Measuring mathematical problem solving with the...
    • P. Shakarian, A. Koyyalamudi, N. Ngu, L. Mareedu, “An independent evaluation of ChatGPT on mathematical word problems (MWP),” in Proceedings...
    • A. Chowdhery, S. Narang, J. Devlin, M. Bosma, G. Mishra, A. Roberts, P. Barham, H. W. Chung, C. Sutton, S. Gehrmann, P. Schuh, K. Shi, S....
    • K. Cobbe, V. Kosaraju, M. Bavarian, M. Chen, H. Jun, L. Kaiser, M. Plappert, J. Tworek, J. Hilton, R. Nakano, C. Hesse, J. Schulman, “Training...
    • T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan, P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. HerbertVoss,...
    • OpenAI, “GPT-4 technical report,” ArXiv, vol. abs/2303.08774, 2023.
    • F. J. García Pen¨ alvo, F. Llorens-Largo, J. Vidal, “La nueva realidad de la educación ante los avances de la inteligencia artificial generativa,”...
    • B. Memarian, T. Doleck, “ChatGPT in education: Methods, potentials, and limitations,” Computers in Human Behavior: Artificial Humans, vol....
    • B. Han, S. Nawaz, G. Buchanan, D. McKay, “Ethical and pedagogical impacts of AI in education,” in Artificial Intelligence in Education, Tokyo, Japan,...
    • J. Flores-Vivar, F. García-Pen¨ alvo, “Reflections on the ethics, potential, and challenges of artificial intelligence in the framework of...
    • R. Hadi Mogavi, C. Deng, J. Juho Kim, P. Zhou, Y. D. Kwon, A. Hosny Saleh Metwally, A. Tlili, S. Bassanelli, A. Bucchiarone, S. Gujar, L. E....
    • S. S. Gill, M. Xu, P. Patros, H. Wu, R. Kaur, K. Kaur, S. Fuller, M. Singh, P. Arora, A. K. Parlikad, V. Stankovski, A. Abraham, S. K. Ghosh,...
    • C. K. Lo, “What is the impact of ChatGPT on education? A rapid review of the literature,” Education Sciences, vol. 13, no. 4, 2023, doi: 10.3390/educsci13040410.
    • S. Chithrananda, G. Grand, B. Ramsundar, “ChemBERTa: Large-scale self-supervised pretraining for molecular property prediction,” ArXiv, vol....
    • Y. Wu, F. Jia, S. Zhang, H. Li, E. Zhu, Y. Wang, Y. T. Lee, R. Peng, Q. Wu, C. Wang, “An empirical study on challenging math problem solving...
    • R. T. McCoy, S. Yao, D. Friedman, M. Hardy, T. L. Griffiths, “Embers of autoregression: Understanding large language models through the problem...
    • P. Nguyen, P. Nguyen, Bruneau, L. Cao, Wang, H. Truong, “Evaluation of mathematics performance of Google Bard on the mathematics test of the vietnamese...
    • V. Plevris, G. Papazafeiropoulos, A. Jiménez Rios, “Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of...
    • J. Wei, X. Wang, D. Schuurmans, M. Bosma, B. Ichter, F. Xia, E. Chi, Q. Le, D. Zhou, “Chain-of-thought prompting elicits reasoning in large...
    • J. Gao, R. Pi, J. Zhang, J. Ye, W. Zhong, Y. Wang, L. Hong, J. Han, H. Xu, Z. Li, L. Kong, “G-LLaVA: Solving geometric problem with multi-modal large...
    • H. Liu, C. Li, Q. Wu, Y. J. Lee, “Visual instruction tuning,” in NeurIPS, 2023.
    • Ministerio de Educación, Argentina, Núcleos de Aprendizajes Prioritarios. Matemática. Ciclo Básico Educación Secundaria 1° y 2° / 2° y 3°...
    • R. S. Abrate, G. I. Delgado, M. D. Pochulu, “Caracterización de las actividades de geometría que proponen los textos de matemática,” Revista Iberoamericana...
    • M. B. López, I. B. Fernández, “Tendencias actuales de la ensenanzaaprendizaje de la geometría en educación secundaria,” Revista Internacional...
    • A. M. Bressan, K. Crego, B. Bogisic, Razones para ensenar geometría en la educación básica: mirar, construir, decir y pensar (1a. ed.). Novedades educativas,...
    • C. R. Suárez, T. Ángel Sierra Delgado, “Spatial problems: An alternative proposal to teach geometry in compulsory secondary education,” Educaçao Matemática...
    • L. Santalo, “Olimpíadas matemáticas,” Revista de Educación Matemática, vol. 6, ago. 2021, doi: 10.33044/revem.11101.
    • P. Fauring, F. Gutierrez Eds., Olimpiadas de Mayo - XVII a XXIV. Buenos Aires, Argentina: Red Olimpica, 2020.
    • B. Glass, C. Maher, “Students problem solving and justification,” in Proceedings of the 28th Conference of the International Group for the Psychology...
    • Y. S. Eko, S. Prabawanto, A. Jupri, “The role of writing justification in mathematics concept: the case of trigonometry,” Journal of Physics: Conference...
    • E. Pavlick, “Symbols and grounding in large language models,” Philosophical Transactions of the Royal Society A: Mathematical, Physical and...
    • G. M. Zunzarren, “The error as a problem or as teaching strategy,” Procedia - Social and Behavioral Sciences, vol. 46, pp. 3209–3214, 2012,...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno