Efficient Storage of Genomic Sequences in High Performance Computing Systems

ABSTRACT: In this dissertation, we address the challenges of genomic data storage in high performance computing systems. In particular, we focus on developing a referential compression approach for Next Generation Sequence data stored in FASTQ format files. The amount of genomic data available for r...

Full description

Autores:
Guerra Soler, Aníbal José
Tipo de recurso:
Doctoral thesis
Fecha de publicación:
2019
Institución:
Universidad de Antioquia
Repositorio:
Repositorio UdeA
Idioma:
spa
OAI Identifier:
oai:bibliotecadigital.udea.edu.co:10495/12525
Acceso en línea:
http://hdl.handle.net/10495/12525
Palabra clave:
Performance - evaluation
Genomic sequences
Parallel computing
Reads alignment
Reads compression
Referential compression
SIMD programming
http://id.loc.gov/authorities/subjects/sh2010105499
Rights
openAccess
License
Atribución-NoComercial-SinDerivadas 2.5 Colombia (CC BY-NC-ND 2.5 CO)