Hierarchical multi-label classification methods for gene function prediction

dc.contributor.advisorRocha, Camilo
dc.contributor.advisorFinke, Jorge
dc.contributor.authorRomero González , Miguel Ángel
dc.date.accessioned2024-06-09T15:32:31Z
dc.date.available2024-06-09T15:32:31Z
dc.date.issued2022
dc.description.abstractengThis dissertation studies the problem of predicting gene functions from a computational approach. The goal of this problem is to predict associations between genes and functions, where genes can be associated to multiple biological functions and functions have a hierarchical organization. Four machine learning methods are developed focusing on different aspects of the problem, which has been modeled as a classification task: (a) considering hierarchical relations between functions to produce consistent predictions; (b) creating new data representations to built predictive models; (c) exploiting paths of functions in the hierarchy to detect missing annotations of genes; and (d) integrating information available for multiple organisms into the classification task. The main contributions of this work include novel methods that (i) overcome the limitations of the combinatorial gene function prediction problem; (ii) can be used to effectively identify associations between genes and functions of different organisms, including those that do not have enough data available to train predictive models; and (iii) help to narrow down the search space for in vivo experiments. These methods have been tested in efforts to predict gene functions in rice and maize, but have been formulated more generally and are applicable to any multi-label classification problem where the classes are organized into a hierarchy.
dc.format.extent132 p.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://vitela.javerianacali.edu.co/handle/11522/2088
dc.language.isoeng
dc.publisherPontificia Universidad Javeriana Cali
dc.rights.accessrightshttp://purl.org/coar/access_right/c_abf2
dc.rights.creativecommonshttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.thesis.disciplineFacultad de Ingeniería y Ciencias. Doctorado en Ingeniería y Ciencias Aplicadas
dc.thesis.grantorPontificia Universidad Javeriana Cali
dc.thesis.levelDoctorado
dc.titleHierarchical multi-label classification methods for gene function predictioneng
dc.type.coarhttp://purl.org/coar/resource_type/c_db06
dc.type.localTesis/Trabajo de grado - Monografía - Doctorado
dc.type.redcolhttps://purl.org/redcol/resource_type/TD
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
MiguelRomero_Tesis.pdf
Size:
1.72 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
Licencia_autorizacion_biblioteca.docx(1).pdf
Size:
116.47 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: