Identificación de eventos delictivos a partir de señales de audio utilizando modos de correlación wavelet

Álvarez Osorio, Carlos Andres

Identificación de eventos delictivos a partir de señales de audio utilizando modos de correlación wavelet

dc.contributor.advisor	Bolaños Martinez, Freddy
dc.contributor.advisor	Fletscher Bocanegra, Luis Alejandro
dc.contributor.author	Álvarez Osorio, Carlos Andres
dc.coverage.city	Medellín (Antioquia, Colombia)
dc.date.accessioned	2024-07-23T20:23:58Z
dc.date.available	2024-07-23T20:23:58Z
dc.date.issued	2024
dc.description	Ilustraciones, gráficos	spa
dc.description.abstract	Durante muchos años, la seguridad en los entornos urbanos ha sido una preocupación para los habitantes de todas las ciudades del mundo. A causa de la inseguridad que existe en mayor o menor medida, los gobiernos en diferentes regiones del mundo buscan continuamente mecanismos para prevenir actos criminales. Este estudio presenta un método que busca prevenir delitos utilizando señales de audio que pueden capturarse en un entorno urbano. Por ello, se utiliza una técnica emergente, denominada modos de correlación wavelet, con el fin de representar señales de audio de forma compacta, tras lo cual se implementa un método basado en aprendizaje automático para clasificar las señales de audio en dos categorías: aquellas asociadas con un evento o crimen violento o aquellos asociados con un evento común (no violento o no criminal). En el estudio realizado, fue posible concluir que los modos de correlación de wavelet permiten realizar la clasificación de este tipo de señales con una precisión mayor al 80 % y con tiempos de ejecución menores a 1 segundo, utilizando un máximo de 4 características para el entrenamiento de los modelos. Las pruebas se realizaron en un computador portátil con sistema operativo de 64 bits, procesador x64, Windows 11, procesador AMD Ryzen 5 5500U, 16 Gb de RAM. (Tomado de la fuente)	spa
dc.description.abstract	For many years, safety in urban environments has been a concern for inhabitants of all cities worldwide. Because of the insecurity that exists to a greater or lesser extent, governments in different regions of the world continually seeks mechanisms to prevent criminal acts. This study introduces a method to prevent crime events using audio signals that can be captured in an urban environment. For this, an emerging technique, called wavelet correlation modes, is used to represent audio signals in a compact manner, following which a machine learning based method is implemented to classify the audio signals into two categories: those associated with a violent event or crime or those associated with a common event (nonviolent or noncriminal). In the study carried out, it was possible to conclude that wavelet compression modes allow the classification of this type of signals with a precision greater than 80 % and with execution times less than 1 second, using a maximum of 4 characteristics for the model training. The tests were carried out on a laptop with a 64-bit operating system, x64 processor, Windows 11, AMD Ryzen 5 5500U processor, 16 Gb of RAM.	eng
dc.description.curriculararea	Ingeniería Eléctrica E Ingeniería De Control.Sede Medellín	spa
dc.description.degreelevel	Maestría	spa
dc.description.researcharea	Procesamiento Digital de Señales e Inteligencia Artificial	spa
dc.description.sponsorship	Este trabajo fue apoyado por el Fondo de Ciencia, Tecnología e Innovación (FCTeI) del Sistema General de Regalías (SGR) bajo el proyecto identificado con el código BPIN 2020000100044	spa
dc.format.extent	54 páginas	spa
dc.format.mimetype	application/pdf	spa
dc.identifier.instname	Universidad Nacional de Colombia	spa
dc.identifier.reponame	Repositorio Institucional Universidad Nacional de Colombia	spa
dc.identifier.repourl	https://repositorio.unal.edu.co/	spa
dc.identifier.uri	https://repositorio.unal.edu.co/handle/unal/86601
dc.language.iso	spa	spa
dc.publisher	Universidad Nacional de Colombia	spa
dc.publisher.branch	Universidad Nacional de Colombia - Sede Medellín	spa
dc.publisher.faculty	Facultad de Minas	spa
dc.publisher.place	Medellín, Colombia	spa
dc.publisher.program	Medellín - Minas - Maestría en Ingeniería - Automatización Industrial	spa
dc.relation.indexed	LaReferencia	spa
dc.relation.references	A. Gholamy, V. Kreinovich, and O. Kosheleva, ‘‘Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation,’’ 2018.	spa
dc.relation.references	G. T. S. R. R. R. Axton Pitt, Digl Dixon, ‘‘Litmaps,’’ 2023 Litmaps Ltd., 2023	spa
dc.relation.references	S. Waldekar and G. Saha, ‘‘Analysis and classification of acoustic scenes with wavelet transform-based mel-scaled features,’’ Multimedia Tools and Applications, vol. 79, pp. 7911--7926, 3 2020.	spa
dc.relation.references	G. Shen and B. Liu, ‘‘The visions, technologies, applications and security issues of internet of things,’’ pp. 1--4, IEEE, 5 2011	spa
dc.relation.references	Markets and Markets, ‘‘Research and markets, internet of things (iot) market global forecast to 2021,’’ 2022.	spa
dc.relation.references	J. . T. J. . T. H. . W. U. H. R. Bowerman, B. Braverman, ‘‘The vision of a smart city. 2nd int. life ..,’’ 2000	spa
dc.relation.references	A. Gobernación, ‘‘Plan de desarrollo unidos por la vida,’’ 2020	spa
dc.relation.references	CEJ, ‘‘En 2021 aumentó el hurto a personas y otros delitos, advierte el reloj de la criminalidad de la cej,’’ 2021	spa
dc.relation.references	M. cómo vamos, ‘‘Informe de calidad de vida de medellÍn, 2020. seguridad ciudadana y convivencia,’’ 2020.	spa
dc.relation.references	J. D. Rodriguez-Ortega, Y. A. A. Duarte-VelÃ!‘squez, C. GÃ-Toro, and J. A. Cadavid-Carmona, ‘‘Seguridad ciudadana, violencia y criminalidad: una visiÃholÃstica y criminolÃde las cifras estadÃsticas del 2018,’’ Revista Criminalidad, vol. 61, pp. 9 -- 58, 12 2019	spa
dc.relation.references	F. L. A. Tamayo-Arboleda and E. Norza, ‘‘Midiendo el crimen: cifras de criminalidad y operatividad policial en Colombia, aÃ2017,’’ Revista Criminalidad, vol. 60, pp. 49 -- 71, 12 2018.	spa
dc.relation.references	S. Mondal and A. Das Barman, ‘‘Deep learning technique based real-time audio event detection experiment in a distributed system architecture,’’ Computers and Electrical Engineering, vol. 102, p. 108252, 2022.	spa
dc.relation.references	Y. Yamamoto, J. Nam, H. Terasawa, and Y. Hiraga, ‘‘Investigating time-frequency representations for audio feature extraction in singing technique classification,’’ in 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 890--896, 2021	spa
dc.relation.references	M. Esmaeilpour, P. Cardinal, and A. L. Koerich, ‘‘Unsupervised feature learning for environmental sound classification using weighted cycle-consistent generative adversarial network,’’ Applied Soft Computing, vol. 86, p. 105912, 1 2020.	spa
dc.relation.references	I. D. Ilyashenko, R. S. Nasretdinov, Y. A. Filin, and A. A. Lependin, ‘‘Trainable wavelet-like transform for feature extraction to audio classification,’’ Journal of Physics: Conference Series, vol. 1333, p. 32029, 10 2019.	spa
dc.relation.references	D. Guillén, H. Esponda, E. Vázquez, and G. Idárraga-Ospina, ‘‘Algorithm for transformer differential protection based on wavelet correlation modes,’’ IET Generation, Transmission & Distribution, vol. 10, no. 12, pp. 2871--2879, 2016.	spa
dc.relation.references	A. Rabaoui, M. Davy, S. Rossignol, and N. Ellouze, ‘‘Using one-class svms and wavelets for audio surveillance,’’ IEEE Transactions on Information Forensics and Security, vol. 3, pp. 763--775, 12 2008	spa
dc.relation.references	A. I. Middya, B. Nag, and S. Roy, ‘‘Deep learning based multimodal emotion recognition using model-level fusion of audio–visual modalities,’’ Knowledge-Based Systems, vol. 244, p. 108580, 5 2022	spa
dc.relation.references	Y. Xu, J. Yang, and K. Mao, ‘‘Semantic-filtered soft-split-aware video captioning with audio-augmented feature,’’ Neurocomputing, vol. 357, pp. 24--35, 9 2019	spa
dc.relation.references	S. Durai, ‘‘Wavelet based feature vector formation for audio signal classification,’’ 10 2007	spa
dc.relation.references	T. Düzenli and N. Ozkurt, ‘‘Comparison of wavelet based feature extraction methods for speech/music discrimination,’’ Istanbul University - Journal of Electrical and Electronics Engineering, vol. 11, pp. 617- -621, 10 2011.	spa
dc.relation.references	G. Tzanetakis, G. Essl, and P. Cook, ‘‘Audio analysis using the discrete wavelet transform,’’ pp. 318--323, 10 2001	spa
dc.relation.references	C.-C. Lin, S.-H. Chen, T.-K. Truong, and Y. Chang, ‘‘Audio classification and categorization based on wavelets and support vector machine,’’ IEEE Transactions on Speech and Audio Processing, vol. 13, pp. 644--651, 2005.	spa
dc.relation.references	K. Kim, D. H. Youn, and C. Lee, ‘‘Evaluation of wavelet filters for speech recognition,’’ SMC 2000 Confe- rence Proceedings. 2000 IEEE International Conference on Systems, Man and Cybernetics. ’Cybernetics Evolving to Systems, Humans, Organizations, and their Complex Interactions’ (Cat. No.00CH37166), pp. 2891--2894, 2000.	spa
dc.relation.references	J. Gowdy and Z. Tufekci, ‘‘Mel-scaled discrete wavelet coefficients for speech recognition,’’ pp. 1351--1354, IEEE, 2000.	spa
dc.relation.references	S. K. Kopparapu and M. Laxminarayana, ‘‘Choice of mel filter bank in computing mfcc of a resampled speech,’’ pp. 121--124, IEEE, 5 2010.	spa
dc.relation.references	K. V. K. Kishore and P. K. Satish, ‘‘Emotion recognition in speech using mfcc and wavelet features,’’ pp. 842--847, 2013.	spa
dc.relation.references	J. Salamon, C. Jacoby, and J. P. Bello, ‘‘A dataset and taxonomy for urban sound research,’’ pp. 1041-- 1044, ACM, 11 2014.	spa
dc.relation.references	A. Rakotomamonjy and G. Gasso, ‘‘Histogram of gradients of time-frequency representations for audio scene detection,’’ IEEE/ACM Transactions on Audio, Speech, and Language Processing, pp. 1--1, 2014.	spa
dc.relation.references	S. R. Kadiri and P. Alku, ‘‘Subjective evaluation of basic emotions from audio–visual data,’’ Sensors, vol. 22, p. 4931, 6 2022.	spa
dc.relation.references	K. Umapathy, S. Krishnan, and R. K. Rao, ‘‘Audio signal feature extraction and classification using local discriminant bases,’’ IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, pp. 1236--1246, 2007	spa
dc.relation.references	F. Weninger, F. Eyben, B. W. Schuller, M. Mortillaro, and K. R. Scherer, ‘‘On the acoustics of emotion in audio: What speech, music, and sound have in common,’’ Frontiers in Psychology, vol. 4, 2013	spa
dc.relation.references	P. Wu, J. Liu, Y. Shi, Y. Sun, F. Shao, Z. Wu, and Z. Yang, ‘‘Not only look, but also listen: Learning multimodal violence detection under weak supervision,’’ pp. 322--339, 10 2020	spa
dc.relation.references	J. Liu, Y. Zhang, D. Lv, J. Lu, H. Xu, S. Xie, and Y. Xiong, ‘‘Classification method of birdsong based on gaborwt feature image and convolutional neural network,’’ pp. 134--140, 2021	spa
dc.relation.references	W. Dai, C. Dai, S. Qu, J. Li, and S. Das, ‘‘Very deep convolutional neural networks for raw waveforms,’’ pp. 421--425, IEEE, 3 2017.	spa
dc.relation.references	L. Deng, G. Hinton, and B. Kingsbury, ‘‘New types of deep neural network learning for speech recognition and related applications: an overview,’’ pp. 8599--8603, IEEE, 5 2013	spa
dc.relation.references	J. Sharma, O.-C. Granmo, and M. Goodwin, ‘‘Environment sound classification using multiple feature channels and deep convolutional neural networks,’’ 10 2019	spa
dc.relation.references	S. Lee and H.-S. Pang, ‘‘Feature extraction based on the non-negative matrix factorization of convolu- tional neural networks for monitoring domestic activity with acoustic signals,’’ IEEE Access, vol. 8, pp. 122384--122395, 2020	spa
dc.relation.references	J. Salamon and J. P. Bello, ‘‘Deep convolutional neural networks and data augmentation for environ- mental sound classification,’’ IEEE Signal Processing Letters, vol. 24, pp. 279--283, 3 2017.	spa
dc.relation.references	T. Lv, H. yong Zhang, and C. hui Yan, ‘‘Double mode surveillance system based on remote audio/video signals acquisition,’’ Applied Acoustics, vol. 129, pp. 316--321, 1 2018.	spa
dc.relation.references	G. Parascandolo, H. Huttunen, and T. Virtanen, ‘‘Recurrent neural networks for polyphonic sound event detection in real life recordings,’’ pp. 6440--6444, IEEE, 3 2016	spa
dc.relation.references	Y. Tokozume and T. Harada, ‘‘Learning environmental sounds with end-to-end convolutional neural network,’’ pp. 2721--2725, IEEE, 3 2017.	spa
dc.relation.references	D. Burgund, S. Nikolovski, D. Galić, and N. Maravić, ‘‘Pearson correlation in determination of quality of current transformers,’’ Sensors, vol. 23, no. 5, 2023	spa
dc.relation.references	G. S. Shuhe Han, ‘‘Protection algorithm based on deep convolution neural network algorithm,’’ Compu- tational Intelligence and Neuroscience, 2022.	spa
dc.relation.references	A. Roy, D. Singh, R. K. Misra, and A. Singh, ‘‘Differential protection scheme for power transformers using matched wavelets,’’ IET Generation, Transmission &amp Distribution, vol. 13, pp. 2423--2437, May 2019.	spa
dc.relation.references	S. Jena and B. R. Bhalja, ‘‘Initial travelling wavefront-based bus zone protection scheme,’’ IET Generation, Transmission &amp Distribution, vol. 13, pp. 3216--3229, July 2019.	spa
dc.relation.references	R. C. Hendriks and T. Gerkmann, ‘‘Noise correlation matrix estimation for multi-microphone speech enhancement,’’ IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 223- -233, 2012.	spa
dc.relation.references	H. Hu, Z. He, Y. Zhang, and S. Gao, ‘‘Modal frequency sensitivity analysis and application using complex nodal matrix,’’ IEEE Transactions on Power Delivery, vol. 29, no. 2, pp. 969--971, 2014	spa
dc.relation.references	P. S. Addison, The Illustrated Wavelet Transform Handbook. CRC Press, Jan. 2017	spa
dc.relation.references	E. Johansson, ‘‘Wavelet theory and some of its applications,’’ Luleå University of Technology Department of Mathematics, vol. 48, 10 2005.	spa
dc.relation.references	R. R. Merry, ‘‘Wavelet theory and applications : a literature study,’’ 2005.	spa
dc.relation.references	A. Hadd and J. L. Rodgers, Understanding correlation matrices, vol. 186. SAGE Publications, Inc, 1 2021.	spa
dc.relation.references	B. Kröse, B. Krose, P. van der Smagt, and P. Smagt, ‘‘An introduction to neural networks,’’ J Comput Sci, vol. 48, 01 1996.	spa
dc.relation.references	M. Awad and R. Khanna, ‘‘Support vector machines for classification,’’ pp. 39--66, 04 2015	spa
dc.relation.references	N. Shreyas, M. Venkatraman, S. Malini, and S. Chandrakala, ‘‘Trends of sound event recognition in audio surveillance: A recent review and study,’’ The Cognitive Approach in Cloud Computing and Internet of Things Technologies for Surveillance Tracking Systems, pp. 95--106, 2020	spa
dc.relation.references	M. A. E. M. Mohamed Elesawy, Mohamad Hussein, ‘‘Real life violence situations dataset,’’ Kaggle, 2019.	spa
dc.relation.references	T. S. (2019), ‘‘Sound events for surveillance applications (1.0.0) [data set],’’ 2019.	spa
dc.relation.references	S. R. Livingstone and F. A. Russo, ‘‘Ravdess emotional speech audio,’’ 2019.	spa
dc.relation.references	B. McFee, M. McVicar, D. Faronbi, I. Roman, M. Gover, S. Balke, S. Seyfarth, A. Malek, C. Raffel, V. Lostanlen, B. van Niekirk, D. Lee, F. Cwitkowitz, F. Zalkow, O. Nieto, D. Ellis, J. Mason, K. Lee, B. Steers, E. Halvachs, C. Thomé, F. Robert-Stöter, R. Bittner, Z. Wei, A. Weiss, E. Battenberg, K. Choi, R. Yamamoto, C. Carr, A. Metsai, S. Sullivan, P. Friesch, A. Krishnakumar, S. Hidaka, S. Kowalik, F. Keller, D. Mazur, A. Chabot-Leclerc, C. Hawthorne, C. Ramaprasad, M. Keum, J. Gomez, W. Monroe, V. A. Morozov, K. Eliasi, nullmightybofo, P. Biberstein, N. D. Sergin, R. Hennequin, R. Naktinis, beantowel, T. Kim, J. P. Åsen, J. Lim, A. Malins, D. Hereñú, S. van der Struijk, L. Nickel, J. Wu, Z. Wang, T. Gates, M. Vollrath, A. Sarroff, Xiao-Ming, A. Porter, S. Kranzler, Voodoohop, M. D. Gangi, H. Jinoz, C. Guerrero, A. Mazhar, toddrme2178, Z. Baratz, A. Kostin, X. Zhuang, C. T. Lo, P. Campr, E. Semeniuc, M. Biswal, S. Moura, P. Brossier, H. Lee, and W. Pimenta, ‘‘librosa/librosa: 0.10.1,’’ Aug. 2023.	spa
dc.relation.references	G. R. Lee, R. Gommers, F. Waselewski, K. Wohlfahrt, and A. O8217;Leary, ‘‘Pywavelets: A python package for wavelet analysis,’’ Journal of Open Source Software, vol. 4, no. 36, p. 1237, 2019	spa
dc.relation.references	F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenho- fer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, ‘‘Scikit-learn: Machine learning in Python,’’ Journal of Machine Learning Research, vol. 12, pp. 2825--2830, 2011.	spa
dc.rights.accessrights	info:eu-repo/semantics/openAccess	spa
dc.rights.license	Atribución-NoComercial-SinDerivadas 4.0 Internacional	spa
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	spa
dc.subject.ddc	620 - Ingeniería y operaciones afines	spa
dc.subject.lemb	Seguridad ciudadana - Medellín (Antioquia, Colombia)
dc.subject.lemb	Prevención del delito - Medellín (Antioquia, Colombia)
dc.subject.lemb	Innovaciones tecnológicas - Medellín (Antioquia, Colombia)
dc.subject.proposal	Seguridad Ciudadana	spa
dc.subject.proposal	Transformada Wavelet	spa
dc.subject.proposal	Señal de Audio	spa
dc.subject.proposal	Modos de Correlación Wavelet	spa
dc.subject.proposal	Aprendizaje de Máquina	spa
dc.subject.proposal	Citizen security	eng
dc.subject.proposal	Wavelet Transform	eng
dc.subject.proposal	Audio signal	eng
dc.subject.proposal	Wavelet Correlation Modes	eng
dc.subject.proposal	Machine Learning	eng
dc.title	Identificación de eventos delictivos a partir de señales de audio utilizando modos de correlación wavelet	spa
dc.title.translated	Identification of crime events from audio signals using wavelet correlation modes	eng
dc.type	Trabajo de grado - Maestría	spa
dc.type.coar	http://purl.org/coar/resource_type/c_bdcc	spa
dc.type.coarversion	http://purl.org/coar/version/c_ab4af688f83e57aa	spa
dc.type.content	Text	spa
dc.type.driver	info:eu-repo/semantics/masterThesis	spa
dc.type.redcol	http://purl.org/redcol/resource_type/TM	spa
dc.type.version	info:eu-repo/semantics/acceptedVersion	spa
dcterms.audience.professionaldevelopment	Estudiantes	spa
dcterms.audience.professionaldevelopment	Investigadores	spa
dcterms.audience.professionaldevelopment	Público general	spa
dcterms.audience.professionaldevelopment	Responsables políticos	spa
oaire.accessrights	http://purl.org/coar/access_right/c_abf2	spa
oaire.awardtitle	Identificación de eventos delictivos a partir de señales de audio utilizando modos de correlación wavelet	spa
oaire.fundername	Fondo de Ciencia, Tecnología e Innovación (FCTeI)	spa

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: 1152199519.2024.pdf
Tamaño:: 1.31 MB
Formato:: Adobe Portable Document Format
Descripción:: Tesis de Maestría en Ingeniería - Automatización Industrial

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 5.74 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Maestría en Ingeniería - Automatización Industrial