Machine Learning Operations aplicado al proceso de desarrollo y aprovisionamiento de modelos
dc.contributor.advisor | Camargo Mendoza, Jorge Eliécer | spa |
dc.contributor.advisor | Flórez Fernández, Héctor Arturo | spa |
dc.contributor.author | Mendez Aguirre, Oscar Alexander | spa |
dc.contributor.researchgroup | UnSecureLab | spa |
dc.date.accessioned | 2024-05-06T20:27:00Z | |
dc.date.available | 2024-05-06T20:27:00Z | |
dc.date.issued | 2024 | |
dc.description | ilustraciones, diagramas | spa |
dc.description.abstract | En la actual era de la ingeniería de software, donde el Machine Learning (ML) desempeña un papel crucial en la innovación tecnológica, la aplicación efectiva de prácticas de desarrollo y operación es esencial. El enfoque de DevSecOps (Development Security Operations) se ha popularizado por su capacidad para integrar la seguridad y la calidad en todas las etapas del ciclo de vida del desarrollo seguro de software. Sin embargo, en el contexto específico del Machine Learning, surge la necesidad de un enfoque especializado que considere las particula- ridades de los modelos y algoritmos utilizados. El Machine Learning Operations (MLOps), a pesar de su relativa novedad, busca establecer un marco para caracterizar el ciclo de vida del desarrollo de ML, desacoplarlo del desarrollo de software y garantizar atributos de calidad como escalabilidad, mantenibilidad y seguridad. También se enfrenta al desafío de gestionar datos de entrenamiento, la seguridad en el proceso de análisis y desarrollo de modelos, y la necesidad de una cultura orientada a la calidad. Este trabajo se centra en investigar cómo la implementación de MLOps puede impactar positivamente en la gestión del ciclo de vida del desarrollo de ML, con el objetivo de contribuir al conocimiento en este campo emergente y promover la adopción de las mejores prácticas en soluciones basadas en ML. (Texto tomado de la fuente). | spa |
dc.description.abstract | In the current era of software engineering, where Machine Learning (ML) plays a pivotal role in technological innovation, the effective implementation of development and opera- tions practices is essential. The DevSecOps (Development Security Operations) approach has gained popularity due to its ability to integrate security and quality at every stage of the software development lifecycle. However, in the specific context of Machine Learning, there arises a need for a specialized approach that takes into account the peculiarities of the models and algorithms used. Machine Learning Operations (MLOps), despite its relative immaturity, aims to establish a framework for characterizing the ML development lifecycle, decoupling it from software development, and ensuring quality attributes such as scalability, maintainability, and security. It also grapples with challenges related to managing training data, security throughout the model analysis, development and deployment process, and the need for a quality-oriented culture. This thesis focuses on investigating how the implementa- tion of MLOps can positively impact the management of the ML development lifecycle, with the goal of contributing to knowledge in this emerging field and promoting the adoption of best practices in ML-based solutions. | eng |
dc.description.degreelevel | Maestría | spa |
dc.description.degreename | Magíster en Ingeniería - Ingeniería de Sistemas y Computación | spa |
dc.description.researcharea | Aplicaciones del machine learning operations | spa |
dc.format.extent | xiv, 103 páginas | spa |
dc.format.mimetype | application/pdf | spa |
dc.identifier.instname | Universidad Nacional de Colombia | spa |
dc.identifier.reponame | Repositorio Institucional Universidad Nacional de Colombia | spa |
dc.identifier.repourl | https://repositorio.unal.edu.co/ | spa |
dc.identifier.uri | https://repositorio.unal.edu.co/handle/unal/86035 | |
dc.language.iso | spa | spa |
dc.publisher | Universidad Nacional de Colombia | spa |
dc.publisher.branch | Universidad Nacional de Colombia - Sede Bogotá | spa |
dc.publisher.faculty | Facultad de Ingeniería | spa |
dc.publisher.place | Bogotá, Colombia | spa |
dc.publisher.program | Bogotá - Ingeniería - Maestría en Ingeniería - Ingeniería de Sistemas y Computación | spa |
dc.relation.references | Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., Maddox, W., Maarek, Y., McDirmid, S., New, M., et al. (2022). Software engineering for machine learning: An experience report. IEEE Software, 39(5):68–75. | spa |
dc.relation.references | Banerjee, A., Chen, C.-C., Hung, C.-C., Huang, X., Wang, Y., and Chevesaran, R. (2020). Challenges and experiences with mlops for performance diagnostics in hybrid-cloud enter- prise software deployments. | spa |
dc.relation.references | Baylor, D., Haas, K., Katsiapis, K., Leong, S., Liu, R., Menwald, C., Miao, H., Polyzotis, N., Trott, M., and Zinkevich, M. (2019). Continuous training for production tMLu in the tTensorFlowu extended (tttttTFXuuuuu) platform. In 2019 USENIX Conference on Operational Machine Learning (OpML 19), pages 51–53. | spa |
dc.relation.references | Benjumea, J., Ropero, J., Rivera, J., Mavrogiannopoulos, N., and Kamhoua, C. (2022). Catalog of requirements for artificial intelligence products. Future Internet, 14(5):118. | spa |
dc.relation.references | Bodor, A., Hnida, M., and Najima, D. (2023). MLOps: Overview of Current State and Future Directions, volume 629 LNNS. | spa |
dc.relation.references | Chatterjee, A., Ahmed, B., Hallin, E., and Engman, A. (2022a). Quality assurance in mlops setting: An industrial perspective. volume 3362. | spa |
dc.relation.references | Chatterjee, A., Ahmed, B. S., Hallin, E., and Engman, A. (2022b). Quality assurance in mlops setting: An industrial perspective. arXiv preprint arXiv:2211.12706. | spa |
dc.relation.references | Díaz, J., Pérez, J. E., Lopez-Peña, M. A., Mena, G. A., and Yagüe, A. (2019). Self-service cybersecurity monitoring as enabler for devsecops. Ieee Access, 7:100283–100295. | spa |
dc.relation.references | DrivenData (2023). Cookiecutter data science. https://drivendata.github.io/cookiecutter- data-science/. Último acceso: [Fecha de último acceso]. | spa |
dc.relation.references | Eck, B., Kabakci-Zorlu, D., Chen, Y., Savard, F., and Bao, X. (2022). A monitoring frame- work for deployed machine learning models with supply chain examples. arXiv preprint arXiv:2211.06239. | spa |
dc.relation.references | Fujii, T. Y., Hayashi, V. T., Arakaki, R., Ruggiero, W. V., Bulla Jr, R., Hayashi, F. H., and Khalil, K. A. (2021). A digital twin architecture model applied with mlops techniques to improve short-term energy consumption prediction. Machines, 10(1):23. | spa |
dc.relation.references | Gartner (2016). Gartner Says Business Intelligence and Analytics Leaders Must Focus on Mindsets and Culture to Kick Start Advanced Analytics. Technical report, Gartner. | spa |
dc.relation.references | Garzas, J. and Piattini, M. (2022). Failure factors in machine learning projects. In Advances in Intelligent Systems and Computing, volume 1197, pages 21–31. Springer. | spa |
dc.relation.references | Ghanta, S., Subramanian, S., Khermosh, L., Sundararaman, S., Shah, H., Goldberg, Y., Roselli, D., and Talagala, N. (2019). Ml health monitor: Taking the pulse of machine learning algorithms in production. volume 11139. | spa |
dc.relation.references | Google Cloud (2022). Mlops: Continuous delivery and automation pipelines in machine learning. | spa |
dc.relation.references | Gärtler, M., Khaydarov, V., Klöpper, B., and Urbas, L. (2021). The machine learning life cycle in chemical operations – status and open challenges. Chemie-Ingenieur-Technik, 93:2063–2080. | spa |
dc.relation.references | Hernandez, J., Daza, K., and Florez, H. (2022). Spiking neural network approach based on caenorhabditis elegans worm for classification. IAENG International Journal of Computer Science, 49(4). | spa |
dc.relation.references | Hewage, N. and Meedeniya, D. (2022). Machine learning operations: A survey on mlops tool support. arXiv. | spa |
dc.relation.references | Humble, J. and Kim, G. (2018). Accelerate: The science of lean software and devops: Building and scaling high performing technology organizations. IT Revolution. | spa |
dc.relation.references | John, M., Olsson, H., and Bosch, J. (2021). Towards mlops: A framework and maturity model. pages 334–341. | spa |
dc.relation.references | Kitchenham, B. and Brereton, P. (2013). A systematic review of systematic review process research in software engineering. Information and software technology, 55(12):2049–2075. | spa |
dc.relation.references | Kreuzberger, D., Kühl, N., and Hirschl, S. (2023). Machine learning operations (mlops): Overview, definition, and architecture. IEEE Access. | spa |
dc.relation.references | Lim, J., Lee, H., Won, Y., and Yeon, H. (2019). Mlop lifecycle scheme for vision-based inspection process in manufacturing. pages 9–11. | spa |
dc.relation.references | Lima, A., Monteiro, L., and Furtado, A. (2022). Mlops: Practices, maturity models, roles, tools, and challenges - a systematic literature review. volume 1, pages 308–320. | spa |
dc.relation.references | Liu, L. T., Wang, S., Britton, T., and Abebe, R. (2023). Reimagining the machine learning life cycle to improve educational outcomes of students. Proceedings of the National Academy of Sciences, 120(9):e2204781120. | spa |
dc.relation.references | Makinen, S., Skogstrom, H., Laaksonen, E., and Mikkonen, T. (2021). Who needs mlops: What data scientists seek to accomplish and how can mlops help? pages 109–112. | spa |
dc.relation.references | Malhotra, Y. (2022). How you can implement well-architected ‘zero trust’hybrid-cloud com- puting beyond ‘lift and shift’: cloud-enabled digital innovation at scale with infrastructure as code (iac), devsecops and mlops. In 2022 New York State Cyber Security Conference: Invited Presentations, Albany, New York: https://its. ny. gov/2022-nyscsc. | spa |
dc.relation.references | Martel, Y., Roßmann, A., Sultanow, E., Weiß, O., Wissel, M., Pelzel, F., and Seßler, M. (2021). Software architecture best practices for enterprise artificial intelligence. INFOR- MATIK 2020. | spa |
dc.relation.references | Mboweni, T., Masombuka, T., and Dongmo, C. (2022). A systematic review of machine learning devops. In 2022 International Conference on Electrical, Computer and Energy Technologies (ICECET), pages 1–6. IEEE. | spa |
dc.relation.references | Mejía, J. A. G. and González, F. (2022). Encuesta nacional “machine learning operations y sus desafíos de implementación en colombia”. Revista Sistemas, (165):20–26. | spa |
dc.relation.references | Mirza, B., Li, X., Lauwers, K., Reddy, B., Muller, A., Wozniak, C., and Djali, S. (2023). A clinical site workload prediction model with machine learning lifecycle. Healthcare Analytics, 3:100159. | spa |
dc.relation.references | Parihar, A. S., Gupta, U., Srivastava, U., Yadav, V., and Trivedi, V. K. (2023). Automa- ted machine learning deployment using open-source ci/cd tool. In Proceedings of Data Analytics and Management: ICDAM 2022, pages 209–222. Springer. | spa |
dc.relation.references | Paul, S. K., Riaz, S., and Das, S. (2022). A conceptual architecture for ai in supply chain risk management. In TENCON 2022-2022 IEEE Region 10 Conference (TENCON), pages 1–5. IEEE. | spa |
dc.relation.references | Recupito, G., Pecorelli, F., Catolino, G., Moreschini, S., Nucci, D., Palomba, F., and Tam- burri, D. (2022). A multivocal literature review of mlops tools and features. pages 84–91. | spa |
dc.relation.references | Reddy, M., Dattaprakash, B., Kammath, S., Kn, S., Manokaran, S., and Be, R. (2022). Application of mlops in prediction of lifestyle diseases. ECS Transactions, 107(1):1191. | spa |
dc.relation.references | Robertson, J. and Robertson, S. (2000). Volere. Requirements Specification Templates. | spa |
dc.relation.references | Saeed, W. and Omlin, C. (2023). Explainable ai (xai): A systematic meta-survey of current challenges and future opportunities. Knowledge-Based Systems, page 110273. | spa |
dc.relation.references | Schulz, C., Gao, J., and Sun, T. (2020). Mlops: A literature review. In 2020 IEEE 1st International Workshop on Machine Learning in Business Process Management (BPML). IEEE. | spa |
dc.relation.references | Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., Young, M., Crespo, J.-F., and Dennison, D. (2015). Hidden technical debt in machine learning systems. Advances in neural information processing systems, 28. | spa |
dc.relation.references | Treveil, M., Omont, N., Stenac, C., Lefevre, K., Phan, D., Zentici, J., Lavoillotte, A., Miya- zaki, M., and Heidmann, L. (2020). Introducing MLOps. O’Reilly Media. | spa |
dc.relation.references | Yasar, H. (2020). Leveraging devops and devsecops to accelerate ai development and deploy- ment. CARNEGIEMELLON UNIV PITTSBURGH PA PITTSBURGH United States. | spa |
dc.relation.references | Zhang, X. and Jaskolka, J. (2022). Conceptualizing the secure machine learning operations (secmlops) paradigm. In 2022 IEEE 22nd International Conference on Software Quality, Reliability and Security (QRS), pages 127–138. IEEE. | spa |
dc.relation.references | Zhou, Y., Yu, Y., and Ding, B. (2020). Towards mlops: A case study of ml pipeline platform. pages 494–500. | spa |
dc.rights.accessrights | info:eu-repo/semantics/openAccess | spa |
dc.rights.license | Atribución-NoComercial 4.0 Internacional | spa |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | spa |
dc.subject.ddc | 620 - Ingeniería y operaciones afines::629 - Otras ramas de la ingeniería | spa |
dc.subject.ddc | 000 - Ciencias de la computación, información y obras generales::005 - Programación, programas, datos de computación | spa |
dc.subject.proposal | Machine Learning | spa |
dc.subject.proposal | MLOps | spa |
dc.subject.proposal | DevSecOps | spa |
dc.subject.proposal | Gestión de datos | spa |
dc.subject.proposal | Innovación tecnológica | spa |
dc.subject.proposal | Desarrollo de software | spa |
dc.subject.proposal | Machine Learning | eng |
dc.subject.proposal | MLOps | eng |
dc.subject.proposal | DevSecOps | eng |
dc.subject.proposal | Security | eng |
dc.subject.proposal | Data management | eng |
dc.subject.proposal | Technological innovation | eng |
dc.subject.proposal | Software development | eng |
dc.subject.wikidata | Aprendizaje automático | spa |
dc.subject.wikidata | machine learning | eng |
dc.subject.wikidata | Integridad de datos | spa |
dc.subject.wikidata | data integrity | eng |
dc.subject.wikidata | Gestión de datos | spa |
dc.subject.wikidata | data management | eng |
dc.title | Machine Learning Operations aplicado al proceso de desarrollo y aprovisionamiento de modelos | spa |
dc.title.translated | Machine Learning Operations applied to the process of model development and provisioning | eng |
dc.type | Trabajo de grado - Maestría | spa |
dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | spa |
dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | spa |
dc.type.content | Text | spa |
dc.type.driver | info:eu-repo/semantics/masterThesis | spa |
dc.type.redcol | http://purl.org/redcol/resource_type/TM | spa |
dc.type.version | info:eu-repo/semantics/acceptedVersion | spa |
dcterms.audience.professionaldevelopment | Público general | spa |
oaire.accessrights | http://purl.org/coar/access_right/c_abf2 | spa |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- 80121552.2024.pdf
- Tamaño:
- 6.68 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Tesis de Maestría en Ingeniería - Ingeniería de Sistemas y Computación
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 5.74 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción: