Implementación de un algoritmo de identificación de bloques de sintenia a partir de alineamientos en ngsep
Synteny blocks are sets of similar regions between genomes or within the same genome. The identification of these blocks allows for genomic evolution studies. Recent work in comparative genomics demonstrates an interest in having synteny blocks identification tools that are easy to use and to integr...
- Autores:
-
Ángel Villadiego, Ricardo Andrés
- Tipo de recurso:
- Trabajo de grado de pregrado
- Fecha de publicación:
- 2020
- Institución:
- Universidad de los Andes
- Repositorio:
- Séneca: repositorio Uniandes
- Idioma:
- spa
- OAI Identifier:
- oai:repositorio.uniandes.edu.co:1992/51497
- Acceso en línea:
- http://hdl.handle.net/1992/51497
- Palabra clave:
- Interfaces gráficas con el usuario (Sistemas para computador)
Ingeniería de software
Arquitectura de software
Algoritmos (Computadores)
Genómica
Biología computacional
Código genético
Ingeniería
- Rights
- openAccess
- License
- http://creativecommons.org/licenses/by-nc-nd/4.0/
Summary: | Synteny blocks are sets of similar regions between genomes or within the same genome. The identification of these blocks allows for genomic evolution studies. Recent work in comparative genomics demonstrates an interest in having synteny blocks identification tools that are easy to use and to integrate into existing workflows. Despite the fact that there are currently various tools for identifying synteny blocks, their integration into workflows is difficult. Taking this into account, this functionality was implemented in the NGSEP software. For this, different tools currently used in the identification of synteny blocks were explored and the HalSynteny algorithm was selected and implemented in NGSEP. This algorithm consists of constructing a directed acyclic graph based on the alignments and, based on these, find the synteny blocks using as parameters the distance between the alignments and the total length of the aligned regions that make up a synthesis block. The execution times of both tools were compared and a sensitivity and specificity analysis was performed. The implementation in NGSEP has a much shorter execution time and achieves higher sensitivity in tests compared to HalSynteny. Both tools maintain a similar specificity between them when they are at their maximum sensitivity. |
---|