Prototipo de plataforma educativa basada en modelos de lenguaje para el apoyo en el aprendizaje de matemáticas básicas
| dc.contributor.advisor | Restrepo Calle, Felipe | |
| dc.contributor.author | Pabón Correa, David Alejandro | |
| dc.contributor.orcid | Pabon Correa, David Alejandro [0009000824194336] | |
| dc.contributor.researchgroup | Plas Programming languages And Systems | |
| dc.coverage.country | Colombia | |
| dc.date.accessioned | 2025-12-18T12:37:12Z | |
| dc.date.available | 2025-12-18T12:37:12Z | |
| dc.date.issued | 2025 | |
| dc.description | ilustraciones a color, diagramas | spa |
| dc.description.abstract | El presente trabajo desarrolla un prototipo de plataforma educativa de código abierto orientada a la enseñanza de matemáticas básicas, integrando modelos de lenguaje para ofrecer tutoría personalizada. La propuesta surge como respuesta a la brecha de aprendizaje matemático en Colombia y a la necesidad de contar con herramientas capaces de operar en entornos con recursos limitados. Se plantea la adaptación de modelos de lenguaje pequeños (Small Language Models) al dominio de las matemáticas elementales, con el propósito de generar explicaciones paso a paso y fomentar el aprendizaje activo. El documento describe las fases de diseño pedagógico, la construcción de un conjunto de datos en español, el ajuste fino de los modelos y la implementación de un prototipo con interfaz de usuario. Los resultados obtenidos muestran la factibilidad técnica y pedagógica de esta aproximación en escenarios de baja conectividad, y se plantea su potencial escalabilidad como alternativa inclusiva para fortalecer la enseñanza de las matemáticas en el sistema educativo colombiano (Texto tomado de la fuente). | spa |
| dc.description.abstract | This work develops an open-source educational platform prototype aimed at teaching basic mathematics, integrating language models to provide personalized tutoring. The proposal arises in response to the mathematics learning gap in Colombia and the need for tools capable of operating in resource-constrained environments. The approach involves adapting Small Language Models to the domain of elementary mathematics, with the goal of generating step-by-step explanations and fostering active learning. The document describes the phases of pedagogical design, the construction of a Spanish dataset, the fine-tuning of the models, and the implementation of a user interface prototype. The results obtained demonstrate the technical and pedagogical feasibility of this approach in low-connectivity scenarios, and highlight its potential scalability as an inclusive alternative to strengthen mathematics education within the Colombian educational system. | eng |
| dc.description.degreelevel | Maestría | |
| dc.description.degreename | Magíster en Ingeniería de Sistemas | |
| dc.description.methods | Contiene una metodología para el desarrollo de una plataforma que utiliza modelos de lenguaje aplicados en un entorno local. Describe adicionalmente un procesamiento para generar conjuntos de datos de manera sintética, aportando los enlaces a los repositorios producidos por el proyecto. Incluye un estudio de ajuste fino de modelos de lenguaje, describiendo de manera detallada la metodología, resultados. E incluye el diseño de la plataforma y su discusión de resultados. | |
| dc.description.researcharea | Sistemas Inteligentes | |
| dc.description.technicalinfo | N/A | spa |
| dc.format.extent | 108 páginas | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.instname | Universidad Nacional de Colombia | spa |
| dc.identifier.reponame | Repositorio Institucional Universidad Nacional de Colombia | spa |
| dc.identifier.repourl | https://repositorio.unal.edu.co/ | spa |
| dc.identifier.uri | https://repositorio.unal.edu.co/handle/unal/89227 | |
| dc.language.iso | spa | |
| dc.publisher | Universidad Nacional de Colombia | |
| dc.publisher.branch | Universidad Nacional de Colombia - Sede Bogotá | |
| dc.publisher.faculty | Facultad de Ingeniería | |
| dc.publisher.place | Bogotá, Colombia | |
| dc.publisher.program | Bogotá - Ingeniería - Maestría en Ingeniería - Ingeniería de Sistemas y Computación | |
| dc.relation.references | Abdin, M., Aneja, J., Awadalla, H., Awadallah, A., Awan, A. A., Bach, N., Bahree, A., Bakhtiari, A., Bao, J., Behl, H., Benhaim, A., Bilenko, M., Bjorck, J., Bubeck, S., Cai, M., Cai, Q., Chaudhary, V., Chen, D., Chen, D., y Zhou, X. (2024). Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone. http://arxiv.org/abs/2404.14219 | |
| dc.relation.references | Bonilla-Mejía, L., y Londoño-Ortega, E. (2021). Geographic Isolation and Learning in Rural Schools [Spatial regression discontinuity study showing negative impact of distance on student lear- ning in Colombia]. Banco de la República, Documentos de Trabajo | |
| dc.relation.references | Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., … y Amodei, D. (2020). Language Models are Few-Shot Learners. http://arxiv.org/abs/2005.14165 | |
| dc.relation.references | Bulathwela, S., Pérez-Ortiz, M., Holloway, C., Cukurova, M., y Shawe-Taylor, J. (2024). Artificial Inte- lligence Alone Will Not Democratise Education: On Educational Inequality, Techno-Solutionism and Inclusive Tools. Sustainability (Switzerland), 16. https://doi.org/10.3390/su16020781 | |
| dc.relation.references | Chen, W., Yin, M., Ku, M., Lu, P., Wan, Y., Ma, X., Xu, J., Wang, X., y Xia, T. (2023). TheoremQA: A Theorem-driven Question Answering Dataset. arXiv preprint arXiv:2305.12524 | |
| dc.relation.references | Cobbe, K., Kosaraju, V., Bavarian, M., Chen, M., y Jun, H. (2021). Training Verifiers to Solve Math Word Problems | |
| dc.relation.references | Cobbe, K., Kosaraju, V., Bavarian, M., Chen, M., Jun, H., Kaiser, L., Plappert, M., Tworek, J., Hilton, J., Nakano, R., Hesse, C., y Schulman, J. (2021). Training verifiers to solve math word problems. arXiv preprint arXiv:2110.14168. | |
| dc.relation.references | Departamento Administrativo Nacional de Estadística (DANE). (2023). Boletín técnico: Educación Formal (EDUC) Año 2023 [Disponible en: https : / / www . dane . gov . co / files / operaciones / EDUC/bol-EDUC-2023.pdf] | |
| dc.relation.references | European Commission. (2023). Netherlands AI Strategy Report (inf. téc.) (Accessed: 2025-06-03). Eu- ropean Commission. Brussels. https : / / ai - watch . ec . europa . eu / countries / netherlands / netherlands-ai-strategy-report_en | |
| dc.relation.references | Farfán Betancourt, A. M., y Correal Romero, T. F. (2025). Brechas Educativas en el Campo Colombiano: Accesibilidad, Permanencia y Calidad en la Educación Rural [Narrative review identifying lack of infrastructure, connectivity, and curricular relevance as key barriers in rural education]. Línea Imaginaria, 1(20). https://doi.org/10.56219/lneaimaginaria.v1i20.3695 | |
| dc.relation.references | García Cuéllar, D. A., Rojas Carvajal, J. S., y Coronado, A. (2024). Desarrollo de competencias matemá- ticas en estudiantes rurales: una estrategia did’actica de aprendizaje. Praxis, 20(3), 585-601. https://doi.org/10.21676/23897856.5948 | |
| dc.relation.references | Gemini Team, G. (2023). Gemini: A Family of Highly Capable Multimodal Models. arXiv preprint ar- Xiv:2312.11805. https://arxiv.org/abs/2312.11805 | |
| dc.relation.references | Gemma Team, Kamath, A., Ferret, J., Pathak, S., Vieillard, N., Merhej, R., Perrin, S., Matejovicova, T., Ramé, A., Rivière, M., Rouillard, L., Mesnard, T., Cideron, G., Grill, J.-B., Ramos, S., Yvinec, É., Casbon, M., Pot, E., Penchev, I., … y Hussenot, L. (2025, marzo). Gemma 3 Technical Report. https://arxiv.org/abs/2503.19786 | |
| dc.relation.references | Goodfellow, I., Bengio, Y., y Courville, A. (2016). Deep Learning [Book in preparation for MIT Press]. MIT Press. http://www.deeplearningbook.org | |
| dc.relation.references | Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., y Steinhardt, J. (2021a). Measuring Mathematical Problem Solving With the MATH Dataset. NeurIPS. | |
| dc.relation.references | Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., y Steinhardt, J. (2021b). Measuring Mathematical Problem Solving with the MATH Dataset. Advances in Neural Informa- tion Processing Systems (NeurIPS), 34, 24241-24253. | |
| dc.relation.references | House of Lords Library. (2023). Educational Technology: Digital Innovation and AI in Schools [Acces- sed: 2025-06-03]. | |
| dc.relation.references | ICFES, I. (2020). Marco de Referencia: Matemáticas Saber 3, 5, 7 y 9 (inf. téc.) (Versión en línea, con- sultado el 12 de julio de 2025). Instituto Colombiano para la Evaluación de la Educación - Icfes. https://www.icfes.gov.co/wp- content/uploads/2024/11/Marco- de- Referencia- Matematicas-Saber-3579.pdf | |
| dc.relation.references | International Trade Administration. (2023). South Korea: Artificial Intelligence in Public Schools [Ac- cessed: 2025-06-03]. | |
| dc.relation.references | Javaid, M., Haleem, A., Singh, R. P., Khan, S., y Khan, I. H. (2023). Unlocking the opportunities th- rough ChatGPT Tool towards ameliorating the education system. BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 3(2), 100115. https : / / doi . org / https : / / doi . org / 10 . 1016/j.tbench.2023.100115 | |
| dc.relation.references | Kasneci, E., Sessler, K., Klinker, G., Schuller, B., y Hauck, T. (2023). ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. Learning and Individual Differences, 103, 102274. https://doi.org/10.1016/j.lindif.2023.102274 | |
| dc.relation.references | Khan, S. (2023). Harnessing GPT-4 so that all students benefit. A nonprofit approach for equal access [Disponible en https://blog.khanacademy.org/harnessing-ai-so-that-all-students-benefit-a- nonprofit-approach-for-equal-access/]. | |
| dc.relation.references | Kumar, H., Rothschild, D. M., Goldstein, D. G., y Hofman, J. M. (s.f.). Math Education With Large Language Models: Peril or Promise? (Inf. téc.). https://aspredicted.org/H34_SZX | |
| dc.relation.references | Labadze, L., Grigolia, M., y Machaidze, L. (2023). Role of AI chatbots in education: systematic lite- rature review. International Journal of Educational Technology in Higher Education, 20(56). https: //doi.org/10.1186/s41239-023-00426-1 | |
| dc.relation.references | Laboratorio de Economía de la Educación. (2023). Características y retos de la educación rural en Colombia (Radiografía de la educación rural) [Disponible en https://lee.javeriana.edu.co/ noticia-colegios-rurales-y-el-estado-2023] | |
| dc.relation.references | Laranjo, L., Dunn, A. G., Tong, H.-T., Kocaballi, A. B., Chen, J., Bashir, R., Surian, D., Gallego, B., Magrabi, F., Lau, A. Y., y Coiera, E. (2018). Conversational agents in healthcare: a systematic review. Journal of the American Medical Informatics Association, 25(9), 1248-1258. https://doi.org/10. 1093/jamia/ocy072 | |
| dc.relation.references | Laun, M., y Wolff, F. (2025). Chatbots in education: Hype or help? A meta-analysis. Learning and Indi- vidual Differences, 119, 102646. https://doi.org/10.1016/j.lindif.2025.102646 | |
| dc.relation.references | Lewkowycz, A., Andreassen, A., Dohan, D., Dyer, E., Michalewski, H., Ramasesh, V., Slone, A., Anil, C., Schlag, I., Gutman-Solo, T., Wu, Y., Neyshabur, B., Gur-Ari, G., y Misra, V. (2022). Sol- ving Quantitative Reasoning Problems with Language Models. arXiv preprint arXiv:2206.14858. https://arxiv.org/abs/2206.14858 | |
| dc.relation.references | Liu, W., Hu, H., Zhou, J., Ding, Y., Li, J., Zeng, J., He, M., Chen, Q., Jiang, B., Zhou, A., y He, L. (2023). Mathematical Language Models: A Survey. http://arxiv.org/abs/2312.07622 | |
| dc.relation.references | Liu, Z., Liu, T., Chen, Z., ZhenshengFang, Tian, M., y Luo, W. (2025). MathEval: A Comprehensive Benchmark for Evaluating Large Language Models on Mathematical Reasoning Capabilities. https://openreview.net/forum?id=DexGnh0EcB | |
| dc.relation.references | Minaee, S., Mikolov, T., Nikzad, N., Chenaghlu, M., Socher, R., Amatriain, X., y Gao, J. (2024). Lar- ge Language Models: A Survey [Version 3, March 23, 2025]. arXiv preprint arXiv:2402.06196. https://arxiv.org/abs/2402.06196 | |
| dc.relation.references | Ministerio de Educación Nacional. (2010). Matemáticas. Escuela Nueva. Cartilla 1. Grado 5. MEN. | |
| dc.relation.references | Ministerio de Educación Nacional. (2011). Nivelemos Matemáticas 3. Guía del estudiante. MEN. | |
| dc.relation.references | Ministerio de Tecnologías de la Información y las Comunicaciones de Colombia (MinTIC). (2024). Informe de conectividad rural 2024 [Disponible en: https://mintic.gov.co/portal/604/w3- channel.html]. | |
| dc.relation.references | Ministerio de Tecnologías de la Información y las Comunicaciones de Colombia (MinTIC). (2024). Informe de conectividad rural 2024 [Disponible en: https://mintic.gov.co/portal/604/w3- channel.html]. | |
| dc.relation.references | Mladenova, T., Kalmus, V., y Sukk, M. (2021). Distance Learning in the COVID-19 Era: Promises and Pitfalls. Handbook of Research on Managing and Designing Online Courses in Synchronous and Asyn- chronous Environments, 104, 233-250. https://doi.org/10.4018/978-1-7998-8701-0.ch012 | |
| dc.relation.references | Sonkar, S., Liu, N., Mallick, D. B., y Baraniuk, R. G. (2023). CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principles. https : / / arxiv . org / abs / 2305.13272 | |
| dc.relation.references | Sonkar, S., Ullman, J., Nye, M., Chi, H., Wu, J., Raffel, C., Ziegler, D., Liang, P., y Steinhardt, J. (2024). Pedagogical Alignment of Large Language Models. arXiv preprint arXiv:2403.00843. https:// arxiv.org/abs/2403.00843 | |
| dc.relation.references | Zhu, X., Li, J., Liu, Y., Ma, C., y Wang, W. (2024). Distilling Mathematical Reasoning Capabilities into Small Language Models. arXiv preprint arXiv:2401.11864. https://arxiv.org/abs/2401.11864 | |
| dc.relation.references | Nguyen, C. V., Shen, X., Aponte, R., Xia, Y., Basu, S., Hu, Z., Chen, J., Parmar, M., Kunapuli, S., Barrow, J., Wu, J., Singh, A., Wang, Y., Gu, J., Dernoncourt, F., Ahmed, N. K., Lipka, N., Zhang, R., Chen,X., y Nguyen, T. H. (2024). A Survey of Small Language Models. http://arxiv.org/abs/2410. 20011 | |
| dc.relation.references | Núñez, R. P., Procopio, M. V. R., Fernández-Cézar, R., y Solano-Pinto, N. (2023). Affective domain and mathematics achievement of Colombian students under multiple correspondence analysis. Frontiers in Education, 8. https://doi.org/10.3389/feduc.2023.1261829 | |
| dc.relation.references | Obando-Zapata, G., Pontón-Ladino, T., Parada-Rico, S. E., y Villa-Ochoa, J. A. (2020). Research into cognition and numerical thinking in Colombia. Estudios de Psicologia, 41, 319-347. https://doi. org/10.1080/02109395.2020.1748841 | |
| dc.relation.references | OCDE. (2023). Resultados PISA 2022 (Volumen I): El estado del aprendizaje y la equidad en la educación. https://doi.org/10.1787/53f23881-en | |
| dc.relation.references | OECD. (2023). Education GPS: Colombia - Student Performance (PISA 2022) [Disponible en https: //gpseducation.oecd.org/CountryProfile?primaryCountry=COL&topic=PI | |
| dc.relation.references | Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C. L., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., Schulman, J., Hilton, J., Kelton, F., Miller, L., Simens, M., Askell, A., Welinder, P., Christiano, P., Leike, J., y Lowe, R. (2022). Training language models to follow instructions with human feedback. https://arxiv.org/abs/2203.02155 | |
| dc.relation.references | Perez, M. A. P., Orozco, B. L., Soto, J. T. C., Hernandez, M. B., Gonzalez, M. A. A., y Malagon, S. (2025). AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models. https://arxiv.org/abs/2505.18978 | |
| dc.relation.references | Rafailov, R., Mondal, S., Chowdhery, A., Du, Y., Jha, S., Zhang, X., Zoph, B., Hou, L., Chi, E. H., Le, Q. V., et al. (2023). Direct Preference Optimization: Your Language Model is Secretly a Reward Model. arXiv preprint arXiv:2305.18290. https://arxiv.org/abs/2305.18290 | |
| dc.relation.references | Red Hat. (2025). SLMs vs LLMs: What are small language models? [Accessed: 2025-06-03]. https : / / www.redhat.com/en/topics/ai/llm-vs-slm | |
| dc.relation.references | Rodríguez Orgales, C., Sánchez Torres, F. J., y Márquez Zúñiga, J. (2011). Impacto del programa Compu- tadores para Educar en la deserción estudiantil, el logro escolar y el ingreso a la educación superior (inf. téc. N.o 2011-36). Documento CEDE, Universidad de los Andes. https : / / doi . org / 10 . 57784/1992/8254 | |
| dc.relation.references | Shuyo, N. (2010). Language Detection Library for Java. http : / / code . google . com / p / language - detection/ | |
| dc.relation.references | Toshniwal, S., Du, W., Moshkov, I., Kisacanin, B., Ayrapetyan, A., y Gitman, I. (2024). OpenMathInstruct- 2: Accelerating AI for Math with Massive Open-Source Instruction Data. https://arxiv.org/ abs/2410.01560 | |
| dc.relation.references | Sossa, K., y Puertas, E. (2024). Dataset of Math Word Problems in Spanish and MathML [150 proble- mas en español con anotación MathML; licencia CC BY 4.0]. | |
| dc.relation.references | Zhao, W., Shang, M., Liu, Y., Wang, L., y Liu, J. (2020). Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems. arXiv preprint arXiv:2009.11506. | |
| dc.relation.references | World Bank. (2019). What are the main lessons from the latest results from PISA 2018 for Latin America? [Disponible en https : / / blogs . worldbank . org / latinamerica / que - nos - deja - pisa - 2018-america-latina] | |
| dc.relation.references | U.S. Department of Education, Office of Educational Technology. (2023). Artificial Intelligence and the Future of Teaching and Learning: Insights and Recommendations (inf. téc.) (Accessed: 2025-06- 03). U.S. Department of Education. Washington, DC. https://www.ed.gov/sites/ed/files/ documents/ai-report/ai-report.pdf | |
| dc.relation.references | Touvron, H., Lavril, T., Izacard, G., Martinet, X., Lachaux, M.-A., Lacroix, T., Rozière, B., Goyal, N., Ham- bro, E., Azhar, F., Rodriguez, A., Joulin, A., Grave, E., y Lample, G. (2023, febrero). LLaMA: Open and Efficient Foundation Language Models. https://doi.org/10.48550/arXiv.2302.13971 | |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.license | Reconocimiento 4.0 Internacional | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.ddc | 000 - Ciencias de la computación, información y obras generales::003 - Sistemas | |
| dc.subject.ddc | 370 - Educación::371 - Escuelas y actividades; educación especial | |
| dc.subject.lemb | METODOS DE SIMULACION | spa |
| dc.subject.lemb | Simulation methods | eng |
| dc.subject.lemb | PROTOTIPOS | spa |
| dc.subject.lemb | Prototype | eng |
| dc.subject.lemb | DESARROLLO DE PROTOTIPOS | spa |
| dc.subject.lemb | Prototype development | eng |
| dc.subject.lemb | INGENIERIA DE SISTEMAS | spa |
| dc.subject.lemb | Systems engineering | eng |
| dc.subject.lemb | MATEMATICAS-ENSENANZA BASICA | spa |
| dc.subject.lemb | Mathematics - study and teaching (elementary) | eng |
| dc.subject.lemb | METODOS DE ENSEÑANZA | spa |
| dc.subject.lemb | Educational method | eng |
| dc.subject.lemb | ENSEÑANZA PROGRAMADA | spa |
| dc.subject.lemb | Programmed instruction | eng |
| dc.subject.proposal | Modelos de lenguaje | spa |
| dc.subject.proposal | Educación | spa |
| dc.subject.proposal | Matemáticas básicas | spa |
| dc.subject.proposal | Inteligencia artificial | spa |
| dc.subject.proposal | Active learning | eng |
| dc.subject.proposal | Language models | eng |
| dc.subject.proposal | Education | eng |
| dc.subject.proposal | Basic mathematics | eng |
| dc.subject.proposal | Artificial intelligence | eng |
| dc.title | Prototipo de plataforma educativa basada en modelos de lenguaje para el apoyo en el aprendizaje de matemáticas básicas | spa |
| dc.title.translated | Prototype of an educational platform based on language models to support the learning of basic mathematics | eng |
| dc.type | Trabajo de grado - Maestría | |
| dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.content | Workflow | |
| dc.type.content | Software | |
| dc.type.content | Text | |
| dc.type.driver | info:eu-repo/semantics/masterThesis | |
| dc.type.redcol | http://purl.org/redcol/resource_type/TM | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion | |
| dcterms.audience.professionaldevelopment | Investigadores | |
| dcterms.audience.professionaldevelopment | Estudiantes | |
| dcterms.audience.professionaldevelopment | Maestros | |
| dcterms.audience.professionaldevelopment | Público general | |
| oaire.accessrights | http://purl.org/coar/access_right/c_abf2 |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- DavidPabon1020814474.pdf
- Tamaño:
- 1.69 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Tesis de Maestría en Ingeniería de Sistemas y Computación
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 5.74 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción:

