Implementación de un algoritmo de identificación de bloques de sintenia a partir de alineamientos en ngsep

Synteny blocks are sets of similar regions between genomes or within the same genome. The identification of these blocks allows for genomic evolution studies. Recent work in comparative genomics demonstrates an interest in having synteny blocks identification tools that are easy to use and to integr...

Full description

Autores:
Ángel Villadiego, Ricardo Andrés
Tipo de recurso:
Trabajo de grado de pregrado
Fecha de publicación:
2020
Institución:
Universidad de los Andes
Repositorio:
Séneca: repositorio Uniandes
Idioma:
spa
OAI Identifier:
oai:repositorio.uniandes.edu.co:1992/51497
Acceso en línea:
http://hdl.handle.net/1992/51497
Palabra clave:
Interfaces gráficas con el usuario (Sistemas para computador)
Ingeniería de software
Arquitectura de software
Algoritmos (Computadores)
Genómica
Biología computacional
Código genético
Ingeniería
Rights
openAccess
License
http://creativecommons.org/licenses/by-nc-nd/4.0/
Description
Summary:Synteny blocks are sets of similar regions between genomes or within the same genome. The identification of these blocks allows for genomic evolution studies. Recent work in comparative genomics demonstrates an interest in having synteny blocks identification tools that are easy to use and to integrate into existing workflows. Despite the fact that there are currently various tools for identifying synteny blocks, their integration into workflows is difficult. Taking this into account, this functionality was implemented in the NGSEP software. For this, different tools currently used in the identification of synteny blocks were explored and the HalSynteny algorithm was selected and implemented in NGSEP. This algorithm consists of constructing a directed acyclic graph based on the alignments and, based on these, find the synteny blocks using as parameters the distance between the alignments and the total length of the aligned regions that make up a synthesis block. The execution times of both tools were compared and a sensitivity and specificity analysis was performed. The implementation in NGSEP has a much shorter execution time and achieves higher sensitivity in tests compared to HalSynteny. Both tools maintain a similar specificity between them when they are at their maximum sensitivity.