Pronóstico de la precipitación acumulada en Colombia, utilizando técnicas de aprendizaje automático
| dc.contributor.advisor | Camargo Mendoza, Jorge Eliecer | spa |
| dc.contributor.author | Mogollón Oviedo, Juan Diego | spa |
| dc.contributor.referee | González Osorio, Fabio Augusto | spa |
| dc.coverage.country | Colombia | spa |
| dc.coverage.tgn | http://vocab.getty.edu/page/tgn/1000050 | |
| dc.date.accessioned | 2025-11-21T16:43:14Z | |
| dc.date.available | 2025-11-21T16:43:14Z | |
| dc.date.issued | 2025 | |
| dc.description | ilustraciones, diagramas | spa |
| dc.description.abstract | Este trabajo aborda el desafío del pronóstico de precipitación en Colombia, un aspecto fundamental para sectores estratégicos como la agricultura, la gestión de recursos hídricos, la energía y la mitigación de riesgos naturales. En un país caracterizado por su alta variabilidad climática y diversidad geográfica, la predicción exacta de la precipitación representa no solo un reto académico, sino una necesidad estratégica. En este contexto, se llevó a cabo un procesamiento automatizado de datos abiertos de precipitación acumulada, obtenidos a partir de las estaciones meteorológicas del Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM), distribuidas a lo largo del territorio colombiano. El análisis incluyó la aplicación de técnicas de descomposición de señales, como Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) y Variational Mode Decomposition (VMD). Se evaluó el desempeño de diversos algoritmos de aprendizaje automático, incluyendo modelos basados en árboles de decisión (XGBoost), redes neuronales recurrentes (LSTM, BiLSTM), arquitecturas Transformer (Lag-Llama) y métodos híbridos con descomposición de señales y máquinas de soporte vectorial (CEEMDAN-VMD-BiLSTM, SVM-BiLSTM). La evaluación se realizó mediante métricas utilizadas en predicción climática como Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Nash-Sutcliffe Efficiency (NSE) y Forecast Anomaly Correlation (AC). Los resultados demostraron que el modelo XGBoost presentó un rendimiento superior, con el menor error cuadrático medio y el coeficiente de Nash-Sutcliffe más alto. Los modelos basados en redes neuronales recurrentes mostraron un desempeño ligeramente inferior pero consistente, mientras que las arquitecturas más complejas como Lag-Llama y CEEMDAN-VMD-BiLSTM exhibieron limitaciones significativas. Finalmente, se diseñó una arquitectura de implementación incorporando prácticas de operaciones de aprendizaje automático (MLOps), definiendo componentes técnicos, principios y herramientas útiles para implementar los modelos en entornos productivos. Este trabajo contribuye al desarrollo científico y tecnológico en la predicción hidrometeorológica en Colombia, ofreciendo alternativas innovadoras frente a los modelos numéricos tradicionales y evaluando su viabilidad en el contexto local. (Texto tomado de la fuente). | spa |
| dc.description.abstract | This work addresses the challenge of precipitation forecasting in Colombia, a fundamental aspect for strategic sectors such as agriculture, water resource management, energy, and natural disaster risk mitigation. In a country characterized by high climatic variability and geographic diversity, accurate precipitation prediction represents not only an academic challenge but also a strategic necessity. In this context, an automated processing of open data on accumulated precipitation was carried out, obtained from the meteorological stations of the Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM), distributed throughout the Colombian territory. The analysis included the application of signal decomposition techniques such as Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Variational Mode Decomposition (VMD). The performance of various machine learning algorithms was evaluated, including decision tree–based models (XGBoost), recurrent neural networks (LSTM, BiLSTM), Transformer architectures (Lag-Llama), and hybrid methods combining signal decomposition and support vector machines (CEEMDAN-VMD-BiLSTM, SVM-BiLSTM). Evaluation was carried out using metrics commonly applied in climate prediction, such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Nash-Sutcliffe Efficiency (NSE), and Forecast Anomaly Correlation (AC). The results showed that the XGBoost model achieved the best performance, with the lowest root mean square error and the highest Nash-Sutcliffe coefficient. Recurrent neural network–based models showed slightly lower but consistent performance, while more complex architectures such as Lag-Llama and CEEMDAN-VMD-BiLSTM exhibited significant limitations. Finally, an implementation architecture was designed incorporating machine learning operations (MLOps) practices, defining technical components, principles, and useful tools for deploying the models in production environments. This work contributes to the scientific and technological development of hydrometeorological forecasting in Colombia, offering innovative alternatives to traditional numerical models and evaluating their feasibility in the local context. | eng |
| dc.description.degreelevel | Maestría | spa |
| dc.description.degreename | Magíster en Ingeniería - Ingeniería de Sistemas y Computación | spa |
| dc.description.researcharea | Machine learning - data science | spa |
| dc.format.extent | x, 50 páginas | spa |
| dc.format.mimetype | application/pdf | |
| dc.identifier.instname | Universidad Nacional de Colombia | spa |
| dc.identifier.reponame | Repositorio Institucional Universidad Nacional de Colombia | spa |
| dc.identifier.repourl | https://repositorio.unal.edu.co/ | spa |
| dc.identifier.uri | https://repositorio.unal.edu.co/handle/unal/89142 | |
| dc.language.iso | spa | |
| dc.publisher | Universidad Nacional de Colombia | spa |
| dc.publisher.branch | Universidad Nacional de Colombia - Sede Bogotá | spa |
| dc.publisher.department | Departamento de Ingeniería de Sistemas e Industrial | spa |
| dc.publisher.faculty | Facultad de Ingeniería | spa |
| dc.publisher.place | Bogotá, Colombia | spa |
| dc.publisher.program | Bogotá - Ingeniería - Maestría en Ingeniería - Ingeniería de Sistemas y Computación | spa |
| dc.relation.references | Ben Bouallègue, Z., Clare, M. C. A., Magnusson, L., Gascón, E., Maier-Gerber, M., Janoušek, M., Rodwell, M., Pinault, F., Dramsch, J. S., Lang, S. T. K., Raoult, B., Rabier, F., Chevallier, M., Sandu, I., Dueben, P., Chantry, M., y Pappenberger, F. (2024). The Rise of Data-Driven Weather Forecasting: A First Statistical Assessment of Machine Learning–Based Weather Forecasts in an Operational-Like Context. Bulletin of the American Meteorological Society, 105(6), E864–E883. https://doi.org/10.1175/bams-d-23-0162.1 | |
| dc.relation.references | Bi, K., Xie, L., Zhang, H., Chen, X., Gu, X., y Tian, Q. (2022). Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast. https://arxiv.org/abs/2211.02556 | |
| dc.relation.references | Bochenek, B., y Ustrnul, Z. (2022). Machine Learning in Weather Prediction and Climate Analyses—Applications and Perspectives. Atmosphere, 13(2), 180. https://doi.org/10.3390/atmos13020180 | |
| dc.relation.references | Cárdenas, L. L., Mezher, A. M., Barbecho Bautista, P. A., Astudillo León, J. P., y Igartua, M. A. (2021). A Multimetric Predictive ANN-Based Routing Protocol for Vehicular Ad Hoc Networks. IEEE Access, 9, 86037–86053. https://doi.org/10.1109/ACCESS.2021.3088474 | |
| dc.relation.references | Cloud, G. (2023). MLOps: Entrega continua y automatización de pipelines en machine learning. https://cloud.google.com/architecture/mlops-continuous-delivery-and-automation-pipelines-in-machine-learning | |
| dc.relation.references | Evgeniou, T., y Pontil, M. (2001). Support Vector Machines: Theory and Applications, 2049, 249–257. https://doi.org/10.1007/3-540-44673-7_12 | |
| dc.relation.references | Gerrity, J., Gyakum, J., Anthes, R., Bosart, L., y O’Lenic, E. (2023, julio). Weather forecasting and prediction [Retrieved July 14, 2025]. https://doi.org/10.1036/1097-8542.742600 | |
| dc.relation.references | Guerrero-Álvarez, J. A. (2021). Predicción niveles de precipitación para la identificación de zonas agrícolas. https://reunir.unir.net/handle/123456789/11338 | |
| dc.relation.references | Jaramillo R., A. (2005). Clima andino y café en Colombia: Andean climate and coffee in Colombia [Serie: Climatología]. CENICAFÉ. http://hdl.handle.net/10778/859 | |
| dc.relation.references | Kizito, R., Scruggs, P., Li, X., Devinney, M., Jansen, J., y Kress, R. (2021). Long Short-Term Memory Networks for Facility Infrastructure Failure and Remaining Useful Life Prediction. IEEE Access, PP, 1–1. https://doi.org/10.1109/ACCESS.2021.3077192 | |
| dc.relation.references | Kreuzberger, D., Kühl, N., y Hirschl, S. (2022). Machine Learning Operations (MLOps): Overview, Definition, and Architecture. https://arxiv.org/abs/2205.02302 | |
| dc.relation.references | Lam, R., Sanchez-Gonzalez, A., Willson, M., Wirnsberger, P., Fortunato, M., Alet, F., Ravuri, S., Ewalds, T., Eaton-Rosen, Z., Hu, W., Merose, A., Hoyer, S., Holland, G., Vinyals, O., Stott, J., Pritzel, A., Mohamed, S., y Battaglia, P. (2023). GraphCast: Learning skillful medium-range global weather forecasting. https://arxiv.org/abs/2212.12794 | |
| dc.relation.references | Lang, S., Alexe, M., Chantry, M., Dramsch, J., Pinault, F., Raoult, B., Bouallègue, Z. B., Clare, M., Lessig, C., Magnusson, L., y Nemesio, A. P. (2024, enero). AIFS: a new ECMWF forecasting system [European Centre for Medium-Range Weather Forecasts]. https://www.ecmwf.int/en/newsletter/178/news/aifs-new-ecmwf-forecasting-system | |
| dc.relation.references | Latif, S. D., Hazrin, N. A. B., Koo, C. H., Ng, J. L., Chaplot, B., Huang, Y. F., El-Shafie, A., y Ahmed, A. N. (2023). Assessing rainfall prediction models: Exploring the advantages of machine learning and remote sensing approaches. Alexandria Engineering Journal, 82, 16–25. https://doi.org/10.1016/j.aej.2023.09.060 | |
| dc.relation.references | Liu, X., Sang, X., Chang, J., Zheng, Y., y Han, Y. (2022). Rainfall prediction optimization model in ten-day time step based on sliding window mechanism and zero sum game. AQUA - Water Infrastructure, Ecosystems and Society, 71(1), 1–18. https://doi.org/10.2166/aqua.2021.086 | |
| dc.relation.references | Mejía, J. A. G., y O., F. G. (2022). Machine Learning Operations y sus desafíos de implementación en Colombia. Revista SISTEMAS, 165, a4. https://doi.org/10.29236/sistemas.n165a4 | |
| dc.relation.references | Microsoft. (s.f.). Machine Learning Operations (MLOps). https://learn.microsoft.com/en-us/azure/cloud-adoption-framework/ready/azure-best-practices/ai-machine-learning-mlops | |
| dc.relation.references | Mienye, I. D., y Jere, N. (2024). A Survey of Decision Trees: Concepts, Algorithms, and Applications. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3416838 | |
| dc.relation.references | Murcia, J. F. R., y Franco, J. Y. M. (2020, marzo). Aspectos Metodológicos de la Predicción Climática Mensual en Colombia (Nota Técnica N.º IDEAM-METEO/001-2020). IDEAM. https://bart.ideam.gov.co/wrfideam/new_modelo/DOCUMENTOS/2020/NT_001_2020.pdf | |
| dc.relation.references | Nguyen, T., Brandstetter, J., Kapoor, A., Gupta, J. K., y Grover, A. (2023). ClimaX: A foundation model for weather and climate. https://arxiv.org/abs/2301.10343 | |
| dc.relation.references | Pathak, J., Subramanian, S., Harrington, P., Raja, S., Chattopadhyay, A., Mardani, M., Kurth, T., Hall, D., Li, Z., Azizzadenesheli, K., Hassanzadeh, P., Kashinath, K., y Anandkumar, A. (2022). FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators. https://arxiv.org/abs/2202.11214 | |
| dc.relation.references | Poornima, S., y Pushpalatha, M. (2019). Prediction of Rainfall Using Intensified LSTM Based Recurrent Neural Network with Weighted Linear Units. Atmosphere, 10(11), 668. https://doi.org/10.3390/atmos10110668 | |
| dc.relation.references | Rasul, K., Ashok, A., Williams, A. R., Ghonia, H., Bhagwatkar, R., Khorasani, A., Bayazi, M. J. D., Adamopoulos, G., Riachi, R., Hassen, N., Biloš, M., Garg, S., Schneider, A., Chapados, N., Drouin, A., Zantedeschi, V., Nevmyvaka, Y., y Rish, I. (2024). Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting. https://arxiv.org/abs/2310.08278 | |
| dc.relation.references | Tkachuk, T. (2023, octubre). Predicción de condiciones meteorológicas en aeropuertos mediante deep learning. https://upcommons.upc.edu/handle/2117/395797 | |
| dc.relation.references | Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., y Polosukhin, I. (2023). Attention Is All You Need. https://arxiv.org/abs/1706.03762 | |
| dc.relation.references | Zhang, X., Shi, J., Chen, H., Xiao, Y., y Zhang, M. (2023). Precipitation prediction based on CEEMDAN–VMD–BILSTM combined quadratic decomposition model. Water Supply, 23(9), 3597–3613. https://doi.org/10.2166/ws.2023.212 | |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.license | Reconocimiento 4.0 Internacional | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.ddc | 000 - Ciencias de la computación, información y obras generales::005 - Programación, programas, datos de computación | spa |
| dc.subject.ddc | 550 - Ciencias de la tierra::551 - Geología, hidrología, meteorología | spa |
| dc.subject.proposal | Precipitación | spa |
| dc.subject.proposal | Aprendizaje automático | spa |
| dc.subject.proposal | Predicción climática | spa |
| dc.subject.proposal | Series temporales | spa |
| dc.subject.proposal | MLOps | spa |
| dc.subject.proposal | Precipitation | eng |
| dc.subject.proposal | Machine Learning | eng |
| dc.subject.proposal | Climate Prediction | eng |
| dc.subject.proposal | Time Series | eng |
| dc.subject.proposal | MLOps | eng |
| dc.subject.unesco | Hidrología | spa |
| dc.subject.unesco | Hydrology | eng |
| dc.subject.unesco | Recursos energéticos | spa |
| dc.subject.unesco | Energy resources | eng |
| dc.subject.unesco | Datos abiertos | spa |
| dc.subject.unesco | Open data | eng |
| dc.subject.unesco | Gestión de los recursos hídricos | spa |
| dc.subject.unesco | Water resources management | eng |
| dc.title | Pronóstico de la precipitación acumulada en Colombia, utilizando técnicas de aprendizaje automático | spa |
| dc.title.translated | Forecast of accumulated precipitation in Colombia using machine learning techniques | eng |
| dc.type | Trabajo de grado - Maestría | |
| dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.content | Text | |
| dc.type.driver | info:eu-repo/semantics/masterThesis | |
| dc.type.redcol | http://purl.org/redcol/resource_type/TM | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion | |
| dcterms.audience.professionaldevelopment | Investigadores | spa |
| dcterms.audience.professionaldevelopment | Público general | spa |
| dcterms.audience.professionaldevelopment | Responsables políticos | spa |
| oaire.accessrights | http://purl.org/coar/access_right/c_abf2 |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- TrabajoFinalMaestria.pdf
- Tamaño:
- 4.66 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Tesis de Maestría en Ingeniería - Ingeniería de Sistemas y Computación
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 5.74 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción:

