Prototipo de plataforma educativa basada en modelos de lenguaje para el apoyo en el aprendizaje de matemáticas básicas

Pabón Correa, David Alejandro

Prototipo de plataforma educativa basada en modelos de lenguaje para el apoyo en el aprendizaje de matemáticas básicas

dc.contributor.advisor	Restrepo Calle, Felipe
dc.contributor.author	Pabón Correa, David Alejandro
dc.contributor.orcid	Pabon Correa, David Alejandro [0009000824194336]
dc.contributor.researchgroup	Plas Programming languages And Systems
dc.coverage.country	Colombia
dc.date.accessioned	2025-12-18T12:37:12Z
dc.date.available	2025-12-18T12:37:12Z
dc.date.issued	2025
dc.description	ilustraciones a color, diagramas	spa
dc.description.abstract	El presente trabajo desarrolla un prototipo de plataforma educativa de código abierto orientada a la enseñanza de matemáticas básicas, integrando modelos de lenguaje para ofrecer tutoría personalizada. La propuesta surge como respuesta a la brecha de aprendizaje matemático en Colombia y a la necesidad de contar con herramientas capaces de operar en entornos con recursos limitados. Se plantea la adaptación de modelos de lenguaje pequeños (Small Language Models) al dominio de las matemáticas elementales, con el propósito de generar explicaciones paso a paso y fomentar el aprendizaje activo. El documento describe las fases de diseño pedagógico, la construcción de un conjunto de datos en español, el ajuste fino de los modelos y la implementación de un prototipo con interfaz de usuario. Los resultados obtenidos muestran la factibilidad técnica y pedagógica de esta aproximación en escenarios de baja conectividad, y se plantea su potencial escalabilidad como alternativa inclusiva para fortalecer la enseñanza de las matemáticas en el sistema educativo colombiano (Texto tomado de la fuente).	spa
dc.description.abstract	This work develops an open-source educational platform prototype aimed at teaching basic mathematics, integrating language models to provide personalized tutoring. The proposal arises in response to the mathematics learning gap in Colombia and the need for tools capable of operating in resource-constrained environments. The approach involves adapting Small Language Models to the domain of elementary mathematics, with the goal of generating step-by-step explanations and fostering active learning. The document describes the phases of pedagogical design, the construction of a Spanish dataset, the fine-tuning of the models, and the implementation of a user interface prototype. The results obtained demonstrate the technical and pedagogical feasibility of this approach in low-connectivity scenarios, and highlight its potential scalability as an inclusive alternative to strengthen mathematics education within the Colombian educational system.	eng
dc.description.degreelevel	Maestría
dc.description.degreename	Magíster en Ingeniería de Sistemas
dc.description.methods	Contiene una metodología para el desarrollo de una plataforma que utiliza modelos de lenguaje aplicados en un entorno local. Describe adicionalmente un procesamiento para generar conjuntos de datos de manera sintética, aportando los enlaces a los repositorios producidos por el proyecto. Incluye un estudio de ajuste fino de modelos de lenguaje, describiendo de manera detallada la metodología, resultados. E incluye el diseño de la plataforma y su discusión de resultados.
dc.description.researcharea	Sistemas Inteligentes
dc.description.technicalinfo	N/A	spa
dc.format.extent	108 páginas
dc.format.mimetype	application/pdf
dc.identifier.instname	Universidad Nacional de Colombia	spa
dc.identifier.reponame	Repositorio Institucional Universidad Nacional de Colombia	spa
dc.identifier.repourl	https://repositorio.unal.edu.co/	spa
dc.identifier.uri	https://repositorio.unal.edu.co/handle/unal/89227
dc.language.iso	spa
dc.publisher	Universidad Nacional de Colombia
dc.publisher.branch	Universidad Nacional de Colombia - Sede Bogotá
dc.publisher.faculty	Facultad de Ingeniería
dc.publisher.place	Bogotá, Colombia
dc.publisher.program	Bogotá - Ingeniería - Maestría en Ingeniería - Ingeniería de Sistemas y Computación
dc.relation.references	Abdin, M., Aneja, J., Awadalla, H., Awadallah, A., Awan, A. A., Bach, N., Bahree, A., Bakhtiari, A., Bao, J., Behl, H., Benhaim, A., Bilenko, M., Bjorck, J., Bubeck, S., Cai, M., Cai, Q., Chaudhary, V., Chen, D., Chen, D., y Zhou, X. (2024). Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone. http://arxiv.org/abs/2404.14219
dc.relation.references	Bonilla-Mejía, L., y Londoño-Ortega, E. (2021). Geographic Isolation and Learning in Rural Schools [Spatial regression discontinuity study showing negative impact of distance on student lear- ning in Colombia]. Banco de la República, Documentos de Trabajo
dc.relation.references	Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., … y Amodei, D. (2020). Language Models are Few-Shot Learners. http://arxiv.org/abs/2005.14165
dc.relation.references	Bulathwela, S., Pérez-Ortiz, M., Holloway, C., Cukurova, M., y Shawe-Taylor, J. (2024). Artificial Inte- lligence Alone Will Not Democratise Education: On Educational Inequality, Techno-Solutionism and Inclusive Tools. Sustainability (Switzerland), 16. https://doi.org/10.3390/su16020781
dc.relation.references	Chen, W., Yin, M., Ku, M., Lu, P., Wan, Y., Ma, X., Xu, J., Wang, X., y Xia, T. (2023). TheoremQA: A Theorem-driven Question Answering Dataset. arXiv preprint arXiv:2305.12524
dc.relation.references	Cobbe, K., Kosaraju, V., Bavarian, M., Chen, M., y Jun, H. (2021). Training Verifiers to Solve Math Word Problems
dc.relation.references	Cobbe, K., Kosaraju, V., Bavarian, M., Chen, M., Jun, H., Kaiser, L., Plappert, M., Tworek, J., Hilton, J., Nakano, R., Hesse, C., y Schulman, J. (2021). Training verifiers to solve math word problems. arXiv preprint arXiv:2110.14168.
dc.relation.references	Departamento Administrativo Nacional de Estadística (DANE). (2023). Boletín técnico: Educación Formal (EDUC) Año 2023 [Disponible en: https : / / www . dane . gov . co / files / operaciones / EDUC/bol-EDUC-2023.pdf]
dc.relation.references	European Commission. (2023). Netherlands AI Strategy Report (inf. téc.) (Accessed: 2025-06-03). Eu- ropean Commission. Brussels. https : / / ai - watch . ec . europa . eu / countries / netherlands / netherlands-ai-strategy-report_en
dc.relation.references	Farfán Betancourt, A. M., y Correal Romero, T. F. (2025). Brechas Educativas en el Campo Colombiano: Accesibilidad, Permanencia y Calidad en la Educación Rural [Narrative review identifying lack of infrastructure, connectivity, and curricular relevance as key barriers in rural education]. Línea Imaginaria, 1(20). https://doi.org/10.56219/lneaimaginaria.v1i20.3695
dc.relation.references	García Cuéllar, D. A., Rojas Carvajal, J. S., y Coronado, A. (2024). Desarrollo de competencias matemá- ticas en estudiantes rurales: una estrategia did’actica de aprendizaje. Praxis, 20(3), 585-601. https://doi.org/10.21676/23897856.5948
dc.relation.references	Gemini Team, G. (2023). Gemini: A Family of Highly Capable Multimodal Models. arXiv preprint ar- Xiv:2312.11805. https://arxiv.org/abs/2312.11805
dc.relation.references	Gemma Team, Kamath, A., Ferret, J., Pathak, S., Vieillard, N., Merhej, R., Perrin, S., Matejovicova, T., Ramé, A., Rivière, M., Rouillard, L., Mesnard, T., Cideron, G., Grill, J.-B., Ramos, S., Yvinec, É., Casbon, M., Pot, E., Penchev, I., … y Hussenot, L. (2025, marzo). Gemma 3 Technical Report. https://arxiv.org/abs/2503.19786
dc.relation.references	Goodfellow, I., Bengio, Y., y Courville, A. (2016). Deep Learning [Book in preparation for MIT Press]. MIT Press. http://www.deeplearningbook.org
dc.relation.references	Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., y Steinhardt, J. (2021a). Measuring Mathematical Problem Solving With the MATH Dataset. NeurIPS.
dc.relation.references	Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., y Steinhardt, J. (2021b). Measuring Mathematical Problem Solving with the MATH Dataset. Advances in Neural Informa- tion Processing Systems (NeurIPS), 34, 24241-24253.
dc.relation.references	House of Lords Library. (2023). Educational Technology: Digital Innovation and AI in Schools [Acces- sed: 2025-06-03].
dc.relation.references	ICFES, I. (2020). Marco de Referencia: Matemáticas Saber 3, 5, 7 y 9 (inf. téc.) (Versión en línea, con- sultado el 12 de julio de 2025). Instituto Colombiano para la Evaluación de la Educación - Icfes. https://www.icfes.gov.co/wp- content/uploads/2024/11/Marco- de- Referencia- Matematicas-Saber-3579.pdf
dc.relation.references	International Trade Administration. (2023). South Korea: Artificial Intelligence in Public Schools [Ac- cessed: 2025-06-03].
dc.relation.references	Javaid, M., Haleem, A., Singh, R. P., Khan, S., y Khan, I. H. (2023). Unlocking the opportunities th- rough ChatGPT Tool towards ameliorating the education system. BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 3(2), 100115. https : / / doi . org / https : / / doi . org / 10 . 1016/j.tbench.2023.100115
dc.relation.references	Kasneci, E., Sessler, K., Klinker, G., Schuller, B., y Hauck, T. (2023). ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274
dc.relation.references	Khan, S. (2023). Harnessing GPT-4 so that all students benefit. A nonprofit approach for equal access [Disponible en https://blog.khanacademy.org/harnessing-ai-so-that-all-students-benefit-a- nonprofit-approach-for-equal-access/].
dc.relation.references	Kumar, H., Rothschild, D. M., Goldstein, D. G., y Hofman, J. M. (s.f.). Math Education With Large Language Models: Peril or Promise? (Inf. téc.). https://aspredicted.org/H34_SZX
dc.relation.references	Labadze, L., Grigolia, M., y Machaidze, L. (2023). Role of AI chatbots in education: systematic lite- rature review. International Journal of Educational Technology in Higher Education, 20(56). https: //doi.org/10.1186/s41239-023-00426-1
dc.relation.references	Laboratorio de Economía de la Educación. (2023). Características y retos de la educación rural en Colombia (Radiografía de la educación rural) [Disponible en https://lee.javeriana.edu.co/ noticia-colegios-rurales-y-el-estado-2023]
dc.relation.references	Laranjo, L., Dunn, A. G., Tong, H.-T., Kocaballi, A. B., Chen, J., Bashir, R., Surian, D., Gallego, B., Magrabi, F., Lau, A. Y., y Coiera, E. (2018). Conversational agents in healthcare: a systematic review. Journal of the American Medical Informatics Association, 25(9), 1248-1258. https://doi.org/10. 1093/jamia/ocy072
dc.relation.references	Laun, M., y Wolff, F. (2025). Chatbots in education: Hype or help? A meta-analysis. Learning and Indi- vidual Differences, 119, 102646. https://doi.org/10.1016/j.lindif.2025.102646
dc.relation.references	Lewkowycz, A., Andreassen, A., Dohan, D., Dyer, E., Michalewski, H., Ramasesh, V., Slone, A., Anil, C., Schlag, I., Gutman-Solo, T., Wu, Y., Neyshabur, B., Gur-Ari, G., y Misra, V. (2022). Sol- ving Quantitative Reasoning Problems with Language Models. arXiv preprint arXiv:2206.14858. https://arxiv.org/abs/2206.14858
dc.relation.references	Liu, W., Hu, H., Zhou, J., Ding, Y., Li, J., Zeng, J., He, M., Chen, Q., Jiang, B., Zhou, A., y He, L. (2023). Mathematical Language Models: A Survey. http://arxiv.org/abs/2312.07622
dc.relation.references	Liu, Z., Liu, T., Chen, Z., ZhenshengFang, Tian, M., y Luo, W. (2025). MathEval: A Comprehensive Benchmark for Evaluating Large Language Models on Mathematical Reasoning Capabilities. https://openreview.net/forum?id=DexGnh0EcB
dc.relation.references	Minaee, S., Mikolov, T., Nikzad, N., Chenaghlu, M., Socher, R., Amatriain, X., y Gao, J. (2024). Lar- ge Language Models: A Survey [Version 3, March 23, 2025]. arXiv preprint arXiv:2402.06196. https://arxiv.org/abs/2402.06196
dc.relation.references	Ministerio de Educación Nacional. (2010). Matemáticas. Escuela Nueva. Cartilla 1. Grado 5. MEN.
dc.relation.references	Ministerio de Educación Nacional. (2011). Nivelemos Matemáticas 3. Guía del estudiante. MEN.
dc.relation.references	Ministerio de Tecnologías de la Información y las Comunicaciones de Colombia (MinTIC). (2024). Informe de conectividad rural 2024 [Disponible en: https://mintic.gov.co/portal/604/w3- channel.html].
dc.relation.references	Ministerio de Tecnologías de la Información y las Comunicaciones de Colombia (MinTIC). (2024). Informe de conectividad rural 2024 [Disponible en: https://mintic.gov.co/portal/604/w3- channel.html].
dc.relation.references	Mladenova, T., Kalmus, V., y Sukk, M. (2021). Distance Learning in the COVID-19 Era: Promises and Pitfalls. Handbook of Research on Managing and Designing Online Courses in Synchronous and Asyn- chronous Environments, 104, 233-250. https://doi.org/10.4018/978-1-7998-8701-0.ch012
dc.relation.references	Sonkar, S., Liu, N., Mallick, D. B., y Baraniuk, R. G. (2023). CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principles. https : / / arxiv . org / abs / 2305.13272
dc.relation.references	Sonkar, S., Ullman, J., Nye, M., Chi, H., Wu, J., Raffel, C., Ziegler, D., Liang, P., y Steinhardt, J. (2024). Pedagogical Alignment of Large Language Models. arXiv preprint arXiv:2403.00843. https:// arxiv.org/abs/2403.00843
dc.relation.references	Zhu, X., Li, J., Liu, Y., Ma, C., y Wang, W. (2024). Distilling Mathematical Reasoning Capabilities into Small Language Models. arXiv preprint arXiv:2401.11864. https://arxiv.org/abs/2401.11864
dc.relation.references	Nguyen, C. V., Shen, X., Aponte, R., Xia, Y., Basu, S., Hu, Z., Chen, J., Parmar, M., Kunapuli, S., Barrow, J., Wu, J., Singh, A., Wang, Y., Gu, J., Dernoncourt, F., Ahmed, N. K., Lipka, N., Zhang, R., Chen,X., y Nguyen, T. H. (2024). A Survey of Small Language Models. http://arxiv.org/abs/2410. 20011
dc.relation.references	Núñez, R. P., Procopio, M. V. R., Fernández-Cézar, R., y Solano-Pinto, N. (2023). Affective domain and mathematics achievement of Colombian students under multiple correspondence analysis. Frontiers in Education, 8. https://doi.org/10.3389/feduc.2023.1261829
dc.relation.references	Obando-Zapata, G., Pontón-Ladino, T., Parada-Rico, S. E., y Villa-Ochoa, J. A. (2020). Research into cognition and numerical thinking in Colombia. Estudios de Psicologia, 41, 319-347. https://doi. org/10.1080/02109395.2020.1748841
dc.relation.references	OCDE. (2023). Resultados PISA 2022 (Volumen I): El estado del aprendizaje y la equidad en la educación. https://doi.org/10.1787/53f23881-en
dc.relation.references	OECD. (2023). Education GPS: Colombia - Student Performance (PISA 2022) [Disponible en https: //gpseducation.oecd.org/CountryProfile?primaryCountry=COL&topic=PI
dc.relation.references	Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., Schulman, J., Hilton, J., Kelton, F., Miller, L., Simens, M., Askell, A., Welinder, P., Christiano, P., Leike, J., y Lowe, R. (2022). Training language models to follow instructions with human feedback. https://arxiv.org/abs/2203.02155
dc.relation.references	Perez, M. A. P., Orozco, B. L., Soto, J. T. C., Hernandez, M. B., Gonzalez, M. A. A., y Malagon, S. (2025). AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models. https://arxiv.org/abs/2505.18978
dc.relation.references	Rafailov, R., Mondal, S., Chowdhery, A., Du, Y., Jha, S., Zhang, X., Zoph, B., Hou, L., Chi, E. H., Le, Q. V., et al. (2023). Direct Preference Optimization: Your Language Model is Secretly a Reward Model. arXiv preprint arXiv:2305.18290. https://arxiv.org/abs/2305.18290
dc.relation.references	Red Hat. (2025). SLMs vs LLMs: What are small language models? [Accessed: 2025-06-03]. https : / / www.redhat.com/en/topics/ai/llm-vs-slm
dc.relation.references	Rodríguez Orgales, C., Sánchez Torres, F. J., y Márquez Zúñiga, J. (2011). Impacto del programa Compu- tadores para Educar en la deserción estudiantil, el logro escolar y el ingreso a la educación superior (inf. téc. N.o 2011-36). Documento CEDE, Universidad de los Andes. https : / / doi . org / 10 . 57784/1992/8254
dc.relation.references	Shuyo, N. (2010). Language Detection Library for Java. http : / / code . google . com / p / language - detection/
dc.relation.references	Toshniwal, S., Du, W., Moshkov, I., Kisacanin, B., Ayrapetyan, A., y Gitman, I. (2024). OpenMathInstruct- 2: Accelerating AI for Math with Massive Open-Source Instruction Data. https://arxiv.org/ abs/2410.01560
dc.relation.references	Sossa, K., y Puertas, E. (2024). Dataset of Math Word Problems in Spanish and MathML [150 proble- mas en español con anotación MathML; licencia CC BY 4.0].
dc.relation.references	Zhao, W., Shang, M., Liu, Y., Wang, L., y Liu, J. (2020). Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems. arXiv preprint arXiv:2009.11506.
dc.relation.references	World Bank. (2019). What are the main lessons from the latest results from PISA 2018 for Latin America? [Disponible en https : / / blogs . worldbank . org / latinamerica / que - nos - deja - pisa - 2018-america-latina]
dc.relation.references	U.S. Department of Education, Office of Educational Technology. (2023). Artificial Intelligence and the Future of Teaching and Learning: Insights and Recommendations (inf. téc.) (Accessed: 2025-06- 03). U.S. Department of Education. Washington, DC. https://www.ed.gov/sites/ed/files/ documents/ai-report/ai-report.pdf
dc.relation.references	Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M.-A., Lacroix, T., Rozière, B., Goyal, N., Ham- bro, E., Azhar, F., Rodriguez, A., Joulin, A., Grave, E., y Lample, G. (2023, febrero). LLaMA: Open and Efficient Foundation Language Models. https://doi.org/10.48550/arXiv.2302.13971
dc.rights.accessrights	info:eu-repo/semantics/openAccess
dc.rights.license	Reconocimiento 4.0 Internacional
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	000 - Ciencias de la computación, información y obras generales::003 - Sistemas
dc.subject.ddc	370 - Educación::371 - Escuelas y actividades; educación especial
dc.subject.lemb	METODOS DE SIMULACION	spa
dc.subject.lemb	Simulation methods	eng
dc.subject.lemb	PROTOTIPOS	spa
dc.subject.lemb	Prototype	eng
dc.subject.lemb	DESARROLLO DE PROTOTIPOS	spa
dc.subject.lemb	Prototype development	eng
dc.subject.lemb	INGENIERIA DE SISTEMAS	spa
dc.subject.lemb	Systems engineering	eng
dc.subject.lemb	MATEMATICAS-ENSENANZA BASICA	spa
dc.subject.lemb	Mathematics - study and teaching (elementary)	eng
dc.subject.lemb	METODOS DE ENSEÑANZA	spa
dc.subject.lemb	Educational method	eng
dc.subject.lemb	ENSEÑANZA PROGRAMADA	spa
dc.subject.lemb	Programmed instruction	eng
dc.subject.proposal	Modelos de lenguaje	spa
dc.subject.proposal	Educación	spa
dc.subject.proposal	Matemáticas básicas	spa
dc.subject.proposal	Inteligencia artificial	spa
dc.subject.proposal	Active learning	eng
dc.subject.proposal	Language models	eng
dc.subject.proposal	Education	eng
dc.subject.proposal	Basic mathematics	eng
dc.subject.proposal	Artificial intelligence	eng
dc.title	Prototipo de plataforma educativa basada en modelos de lenguaje para el apoyo en el aprendizaje de matemáticas básicas	spa
dc.title.translated	Prototype of an educational platform based on language models to support the learning of basic mathematics	eng
dc.type	Trabajo de grado - Maestría
dc.type.coar	http://purl.org/coar/resource_type/c_bdcc
dc.type.coarversion	http://purl.org/coar/version/c_ab4af688f83e57aa
dc.type.content	Workflow
dc.type.content	Software
dc.type.content	Text
dc.type.driver	info:eu-repo/semantics/masterThesis
dc.type.redcol	http://purl.org/redcol/resource_type/TM
dc.type.version	info:eu-repo/semantics/acceptedVersion
dcterms.audience.professionaldevelopment	Investigadores
dcterms.audience.professionaldevelopment	Estudiantes
dcterms.audience.professionaldevelopment	Maestros
dcterms.audience.professionaldevelopment	Público general
oaire.accessrights	http://purl.org/coar/access_right/c_abf2

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: DavidPabon1020814474.pdf
Tamaño:: 1.69 MB
Formato:: Adobe Portable Document Format
Descripción:: Tesis de Maestría en Ingeniería de Sistemas y Computación

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 5.74 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Maestría en Ingeniería - Sistemas y Computación