Solución basada en software para la reconciliación transaccional enfocada en la gestión de datos financieros de Clip MX utilizando Databricks
| dc.contributor.advisor | Vergara Vargas, Jeisson Andrés | |
| dc.contributor.author | Neira Embus, Manuel Fernando | |
| dc.contributor.researchgroup | Colectivo de Investigación en Ingeniería de Software Colswe | |
| dc.date.accessioned | 2026-02-17T13:45:46Z | |
| dc.date.available | 2026-02-17T13:45:46Z | |
| dc.date.issued | 2025 | |
| dc.description | ilustraciones a color, diagramas | spa |
| dc.description.abstract | En el contexto empresarial actual, la eficiencia en la gestión financiera se ha convertido en un aspecto determinante para el éxito estratégico, operativo y normativo de las organizaciones. Las compañías del sector financiero enfrentan continuamente desafíos derivados del incremento en la cantidad, variedad y velocidad con la que los datos transaccionales son generados y procesados. Clip, reconocida fintech mexicana especializada en soluciones integradas de pago electrónico, no es ajena a esta realidad, ya que la empresa ha experimentado un crecimiento sostenido en el volumen de transacciones electrónicas que procesa diariamente. Esta situación le ha generado retos significativos en términos de gestión y análisis de datos financieros, particularmente en el área de reconciliación transaccional. Históricamente, Clip ha enfrentado desafíos operativos en la reconciliación financiera, relacionados principalmente con procesos manuales y semi-automatizados que generaban altos costos operativos, limitaciones en términos de dependencia de plataformas tecnológicas específicas como Snowflake. Estas limitantes se traducían en demoras operativas, alto riesgo de errores y una flexibilidad reducida ante cambios normativos o incrementos en el volumen transaccional. En respuesta a estas necesidades, la presente investigación desarrolla una solución basada en software para optimizar el proceso de reconciliación financiera transaccional en Clip MX mediante la plataforma Databricks. La propuesta se fundamenta en un análisis detallado de los procesos operativos existentes en la compañía, identificando tanto las necesidades funcionales (capacidad de procesamiento, precisión en la reconciliación, reportabilidad) como las no funcionales (escalabilidad, portabilidad, seguridad de la información). La solución integra tecnologías avanzadas como procesamiento distribuido mediante Apache Spark, almacenamiento eficiente y versionado utilizando Delta Lake y la automatización de flujos de trabajo y orquestación de procesos mediante Airflow. Además, se establecen mecanismos robustos de monitoreo en tiempo real mediante integración con plataformas como Slack, que permiten identificar y responder de manera oportuna ante cualquier anomalía. De esa manera, la evaluación y validación del sistema se realizan mediante métricas específicas relacionadas con efectividad operativa, precisión de resultados, eficiencia temporal y reducción de costos operativos, de manera que se asegura que la solución propuesta contribuya efectivamente al fortalecimiento del proceso de gestión financiera de Clip. (Texto tomado de la fuente) | spa |
| dc.description.abstract | In the current business context, efficiency in financial management has become a critical factor for organizations’ strategic, operational, and regulatory success. Companies in the financial sector continuously face challenges, which result from the increasing volume, variety, and velocity of generating and processing transactional data. Clip, a renowned Mexican fintech specialized in integrated electronic payment solutions, is no exception to this reality. The company has experienced sustained growth in the volume of electronic transactions processed daily, posing significant challenges in financial data management and analysis, particularly in transactional reconciliation. Historically, Clip has encountered operational difficulties in financial reconciliation, primarily associated with manual and semiautomated processes that resulted in high operational costs and dependence on specific technological platforms such as Snowflake. These limitations are translated into operational delays, a high risk of errors, and reduced flexibility when responding to regulatory changes or increases in transaction volume. In response to these requirements, this study develops a software-based solution to optimize the transactional financial reconciliation process at Clip MX using the Databricks platform. The proposed solution is grounded in a comprehensive analysis of the company’s existing operational processes, identifying both functional requirements (processing capacity, reconciliation accuracy, and reportability) and non-functional requirements (scalability, portability, and information security). The solution incorporates advanced technologies such as distributed processing using Apache Spark, efficient and versioned storage with Delta Lake, workflow automation, and process orchestration through Airflow. Furthermore, robust real-time monitoring mechanisms have been established through integration with platforms like Slack, enabling timely identification and response to any anomalies. The evaluation and validation of the system are conducted using specific metrics related to operational effectiveness, accuracy of results, time efficiency, and operational cost reduction, thus ensuring that the proposed solution effectively strengthens Clip’s financial management processes. | eng |
| dc.description.degreelevel | Maestría | |
| dc.description.degreename | Magíster en Ingeniería de Sistemas y Computación | |
| dc.description.researcharea | Ingeniería de Software → Arquitectura de Software | |
| dc.description.technicalinfo | Databricks, SQL, Python | spa |
| dc.format.extent | xv, 53 páginas | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.instname | Universidad Nacional de Colombia | spa |
| dc.identifier.reponame | Repositorio Institucional Universidad Nacional de Colombia | spa |
| dc.identifier.repourl | https://repositorio.unal.edu.co/ | spa |
| dc.identifier.uri | https://repositorio.unal.edu.co/handle/unal/89576 | |
| dc.language.iso | spa | |
| dc.publisher | Universidad Nacional de Colombia | |
| dc.publisher.branch | Universidad Nacional de Colombia - Sede Bogotá | |
| dc.publisher.faculty | Facultad de Ingeniería | |
| dc.publisher.place | Bogotá, Colombia | |
| dc.publisher.program | Bogotá - Ingeniería - Maestría en Ingeniería - Ingeniería de Sistemas y Computación | |
| dc.relation.references | Security Standards Council, “Payment Card Industry Data Security Standard,” 2024. | |
| dc.relation.references | ISO IEC, “ISO IEC 27001 Information technology — Security techniques — Information security management systems — Requirements,” 2013. | |
| dc.relation.references | National Institute of Standards and Technology, “The NIST Cybersecurity Framework (CSF) 2.0,” Feb. 2024. [Online]. Available: https://nvlpubs.nist.gov/nistpubs/CSWP/NIST.CSWP.29.pdf | |
| dc.relation.references | Project Management Institute, Ed., The Agile practice guide. Newtown Square, Pennsylvania: The Project Mana- gement Institute, 2017. | |
| dc.relation.references | Databricks, “What is a Medallion Architecture?” Mar. 2022. [Online]. Available: https://www.databricks.com/ glossary/medallion-architecture | |
| dc.relation.references | R. Zagni, Data engineering with dbt: a practical guide to building a cloud-based, pragmatic, and dependable data platform with SQL, 1st ed. Birmingham, UK: Packt Publishing Ltd., 2023, oCLC: 1389877060. | |
| dc.relation.references | R. Ilijason, Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud. Berkeley, CA: Apress, 2020. [Online]. Available: http://link.springer.com/10.1007/978-1-4842-5781-4 | |
| dc.relation.references | Banorte, “Automatic Report Specification,” Nov. 2018. | |
| dc.relation.references | Stockgro, “Payment Aggregators: Role and Benefits in payments Explained,” Oct. 2024. [Online]. Available: https://www.stockgro.club/blogs/personal-finance/what-is-payment-aggregator/ | |
| dc.relation.references | PayClip, “Página principal PayClip.” [Online]. Available: https://www.clip.mx/ | |
| dc.relation.references | Lucchesi Cristiane and Barrera Cyntia, “Nace un nuevo ‘unicornio’ en México: Clip, fintech de pagos dirigida a PyMes,” Jun. 2021. [Online]. Available: https://www.elfinanciero.com.mx/tech/2021/06/10/ nace-un-nuevo-unicornio-en-mexico-clip-fintech-de-pagos-en-dirigida-a-pymes/ | |
| dc.relation.references | Gomez Santiago and Casas Alejandro, “Simetrik.” [Online]. Available: https://www.simetrik.com/ | |
| dc.relation.references | BlackLine, “BlackLine: The Unified Cloud for Finance and Accounting Automation.” [Online]. Available: https://www.blackline.com/ | |
| dc.relation.references | ReconArt, “ReconArt World Class Reconciliation Software.” [Online]. Available: https://www.reconart.com/ | |
| dc.relation.references | Snowflake, “The Snowflake Data Cloud - Mobilize Data, Apps, and AI.” [Online]. Available: https: //www.snowflake.com/content/snowflake-site/global/en | |
| dc.relation.references | R. Lovas, E. Nagy, and J. Kovács, “Cloud agnostic Big Data platform focusing on scalability and cost-efficiency,” Advances in Engineering Software, vol. 125, pp. 167–177, Nov. 2018. [Online]. Available: https://linkinghub.elsevier.com/retrieve/pii/S096599781730889X | |
| dc.relation.references | X. Xing, “Financial Big Data Reconciliation Method,” in Proceedings - 2021 International Symposium on Advances in Informatics, Electronics and Education, ISAIEE 2021. Institute of Electrical and Electronics Engineers Inc., 2021, pp. 260–263. | |
| dc.relation.references | Alexander S. Gillis, “What is Secure File Transfer Protocol (SFTP)? A Definition from Tech- Target.com,” 2022. [Online]. Available: https://www.techtarget.com/searchcontentmanagement/definition/ Secure-File-Transfer-Protocol-SSH-File-Transfer-Protocol | |
| dc.relation.references | Amazon Web Services INC, “¿Qué es Amazon S3? - Amazon Simple Storage Service,” 2025. [Online]. Available: https://docs.aws.amazon.com/es_es/AmazonS3/latest/userguide/Welcome.html | |
| dc.relation.references | V. N. Chukwuani, “The Transformational Impact of Automation and Artificial Intelligence on the Accounting Profession,” International Journal of Accounting and Financial Risk Management, Oct. 2024, publisher: AIR JOURNALS. [Online]. Available: https://zenodo.org/doi/10.5281/zenodo.14546797 | |
| dc.relation.references | Y. Alfiana and D. Wandira, “The Impact Of Digital Transactions In The Reconciliation Process And Preparation Of Financial Reports Of Culinary Msmes In Palembang City,” EKOMBIS REVIEW: Jurnal Ilmiah Ekonomi Dan Bisnis, vol. 12, no. 1, p. 12, 2024. [Online]. Available: https://doi.org/10.37676/ekombis.v12i1 | |
| dc.relation.references | A. R. Malipeddi, “Cloud-Driven Financial Reconciliation for Insurers: Overcoming Data Complexity,” Journal of Computer Science and Technology Studies, May 2025. | |
| dc.relation.references | Nicanor Medina, “How to Migrate Databricks from GCP to Azure or AWS,” Oct. 2024. [Online]. Available: https://www.sunnydata.ai/blog/how-to-migrate-databricks-from-gcp-to-azure-or-aws | |
| dc.relation.references | Databricks, “Data Lakehouse Architecture,” Dec. 2022. [Online]. Available: https://www.databricks.com/ product/data-lakehouse | |
| dc.relation.references | Databricks, “Best practices for performance efficiency | Databricks Documentation,” Jun. 2025. [Online]. Available: https://docs.databricks.com/aws/en/lakehouse-architecture/performance-efficiency/best-practices | |
| dc.relation.references | Naseer Ahmed, Igor Alekseev, “How to Orchestrate Databricks Workloads on AWS With Managed Workflows for Apache Airflow,” Jul. 2022. [Online]. Available: https://www.databricks.com/blog/2022/01/27/ orchestrating-databricks-workloads-on-aws-with-managed-workflows-for-apache-airflow.html | |
| dc.relation.references | Tracy Rericha, “Webhook client | Python Slack SDK,” Jun. 2025. [Online]. Available: https://tools.slack.dev/ python-slack-sdk/webhook/ | |
| dc.relation.references | IBM, “Listener Java API Terminology,” Jan. 2025. [Online]. Available: https://www.ibm.com/docs/en/ filenet-p8-platform/5.6.0?topic=api-system-manager-listener-java-terminology | |
| dc.relation.references | M. Armbrust, T. Das, L. Sun, B. Yavuz, S. Zhu, M. Murthy, J. Torres, H. van Hovell, A. Ionescu, A. Łuszczak, M. Świtakowski, M. Szafrański, X. Li, T. Ueshin, M. Mokhtar, P. Boncz, A. Ghodsi, S. Paranjpye, P. Senster, R. Xin, and M. Zaharia, “Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores,” Proceedings of the VLDB Endowment, vol. 13, no. 12, pp. 3411–3424, 2020, publisher: VLDB Endowment. | |
| dc.relation.references | SentinelOne, “What is Hashing?” Mar. 2025. [Online]. Available: https://www.sentinelone.com/ cybersecurity-101/cybersecurity/hashing/ | |
| dc.relation.references | Databricks, “What is Photon? | Databricks Documentation,” Jan. 2025. [Online]. Available: https: //docs.databricks.com/aws/en/compute/photon | |
| dc.relation.references | Databricks, “Pricing Calculator,” Oct. 2022. [Online]. Available: https://www.databricks.com/product/pricing/ product-pricing/instance-types | |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.license | Atribución-NoComercial 4.0 Internacional | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | |
| dc.subject.ddc | 000 - Ciencias de la computación, información y obras generales::006 - Métodos especiales de computación | |
| dc.subject.ddc | 000 - Ciencias de la computación, información y obras generales::004 - Procesamiento de datos Ciencia de los computadores | |
| dc.subject.ddc | 350 - Administración pública y ciencia militar::352 - Consideraciones generales y administración pública | |
| dc.subject.lemb | GESTION FINANCIERA | spa |
| dc.subject.lemb | Financial management | eng |
| dc.subject.lemb | PLANIFICACION FINANCIERA | spa |
| dc.subject.lemb | Financial planning | eng |
| dc.subject.lemb | PLANIFICACION ESTRATEGICA | spa |
| dc.subject.lemb | Strategic Planning-LC | eng |
| dc.subject.lemb | ANALISIS DE INFORMACION | spa |
| dc.subject.lemb | Information analysis | eng |
| dc.subject.lemb | BANCOS-PROCESAMIENTO DE DATOS | spa |
| dc.subject.lemb | Banks and banking - data processing | eng |
| dc.subject.lemb | PROCESAMIENTO DE DATOS EN TIEMPO REAL | spa |
| dc.subject.lemb | Real-time data processing | eng |
| dc.subject.lemb | INTELIGENCIA ARTIFICIAL-PROCESAMIENTO DE DATOS | spa |
| dc.subject.lemb | Artificial intelligen - data processing | eng |
| dc.subject.lemb | SERVICIOS FINANCIEROS | spa |
| dc.subject.lemb | Financial services | eng |
| dc.subject.proposal | Arquitectura de Software | spa |
| dc.subject.proposal | Reconciliación Financiera | spa |
| dc.subject.proposal | Financial Reconciliation | eng |
| dc.subject.proposal | Finctech | eng |
| dc.subject.proposal | Gestión de Datos | spa |
| dc.subject.proposal | Databricks | eng |
| dc.subject.proposal | Data Management | eng |
| dc.subject.proposal | Software Architecture | eng |
| dc.title | Solución basada en software para la reconciliación transaccional enfocada en la gestión de datos financieros de Clip MX utilizando Databricks | spa |
| dc.title.translated | A software-based solution for transactional reconciliation focused on Clip MX financial data management using Databricks | eng |
| dc.type | Trabajo de grado - Maestría | |
| dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.content | Text | |
| dc.type.driver | info:eu-repo/semantics/masterThesis | |
| dc.type.redcol | http://purl.org/redcol/resource_type/TM | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion | |
| dcterms.audience.professionaldevelopment | Estudiantes | |
| dcterms.audience.professionaldevelopment | Investigadores | |
| dcterms.audience.professionaldevelopment | Maestros | |
| dcterms.audience.professionaldevelopment | Público general | |
| oaire.accessrights | http://purl.org/coar/access_right/c_abf2 |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- TF_de_Maestría___Manuel_Fernando_Neira_Embus_2026-02-16.pdf
- Tamaño:
- 3.73 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
- Tesis de Maestría en Ingeniería de Sistemas y Computación
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 5.74 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción:

