Automatic authorship analysis using Deep neural networks

dc.contributorGonzález Osorio, Fabio Augustospa
dc.contributor.authorSierra Loaiza, Sebastian Ernestospa
dc.date.accessioned2020-03-30T06:28:02Zspa
dc.date.available2020-03-30T06:28:02Zspa
dc.date.issued2018-08-31spa
dc.description.abstractAuthorship analysis helps to study the characteristics that distinguish how two different persons write. Writing style can be extracted in several ways, like using bag of words strategies or handcrafted features. However, with the growing of Internet, we have been able to witness an increase in the amount of user generated data in social networks like Facebook or Twitter. There is an increasing need in generating automatic methods capable of analyzing the style of a document for tasks like: determining the age of the author, determining the gender of the author, determining the authorship of the document given a set of possible authors, etc. Previous tasks are better known as author profiling and authorship attribution. Although capturing the style of an author can be a challenging task, in this thesis we explore representation learning strategies, in order to take advantage of the large amount of data generated by social media. In this thesis, we learned proper representations for the text inputs that were able to learn such patterns that are only distinguishable to an author (authorship attribution) or a social group of authors (author profiling). Proposed methods were compared using different publicly available datasets using social media data. Both author profiling and authorship attribution tasks are addressed using representation learning techniques such as convolutional neural networks and gated multimodal units. Our unimodal author profiling approach was submitted to the profiling shared task of the laboratory on digital forensics and stylometry(PAN). For authorship attribution, we proposed a convolutional neural network using character n-grams as input. We found that our approach outperformed standard attribution based methods as well as word based convolutional neural networks. For the author profiling task, we proposed one convolutional neural network for unimodal author profiling and adapted a gated multimodal unit for multimodal author profiling. The multimodal nature of user generated content consists of a scenario where the social group of an author can be determined not only using his/her written texts but using also the images that the user shared across the social networks. Gated multimodal units outperformed standard information fusion strategies: early and late fusion.spa
dc.description.degreelevelMaestríaspa
dc.format.mimetypeapplication/pdfspa
dc.identifier.eprintshttp://bdigital.unal.edu.co/73495/spa
dc.identifier.urihttps://repositorio.unal.edu.co/handle/unal/76747
dc.language.isospaspa
dc.relationhttp://www.ingenieria.unal.edu.co/mindlab/spa
dc.relation.haspart0 Generalidades / Computer science, information and general worksspa
dc.relation.haspart62 Ingeniería y operaciones afines / Engineeringspa
dc.relation.ispartofUniversidad Nacional de Colombia Sede Bogotá Facultad de Ingeniería Departamento de Ingeniería de Sistemas e Industrial Ingeniería de Sistemasspa
dc.relation.ispartofIngeniería de Sistemasspa
dc.relation.referencesSierra Loaiza, Sebastian Ernesto (2018) Automatic authorship analysis using Deep neural networks. Maestría thesis, Universidad Nacional de Colombia - Sede Bogotá.spa
dc.rightsDerechos reservados - Universidad Nacional de Colombiaspa
dc.rights.accessrightsinfo:eu-repo/semantics/openAccessspa
dc.rights.licenseAtribución-NoComercial 4.0 Internacionalspa
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/spa
dc.subject.proposalMachine learningspa
dc.subject.proposalSupervised Learningspa
dc.subject.proposalRepresentation Learningspa
dc.subject.proposalAutomatic Authorship Analysisspa
dc.subject.proposalAuthorship Attributionspa
dc.subject.proposalAuthor Profilingspa
dc.subject.proposalMultimodal Author Profilingspa
dc.titleAutomatic authorship analysis using Deep neural networksspa
dc.typeTrabajo de grado - Maestríaspa
dc.type.coarhttp://purl.org/coar/resource_type/c_bdccspa
dc.type.coarversionhttp://purl.org/coar/version/c_ab4af688f83e57aaspa
dc.type.contentTextspa
dc.type.driverinfo:eu-repo/semantics/masterThesisspa
dc.type.redcolhttp://purl.org/redcol/resource_type/TMspa
dc.type.versioninfo:eu-repo/semantics/acceptedVersionspa
oaire.accessrightshttp://purl.org/coar/access_right/c_abf2spa

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
SebastianSierraLoaiza.2019.pdf
Tamaño:
2.37 MB
Formato:
Adobe Portable Document Format