Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants

Given the importance of the production of ruminants, it is necessary to investigate the genetic variants associated with the traits of economic interest in these animals, as well as the biology underlying the genotype-phenotype associations. To conduct these associations, a widely used strategy is t...

Full description

Autores:
Sepúlveda Molina, Boris Julián
Tipo de recurso:
Fecha de publicación:
2019
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/76444
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/76444
http://bdigital.unal.edu.co/72840/
Palabra clave:
Genome-wide association studies
Single nucleotide polymorphism
Annotation
Ruminants
Estudios de asociación del genoma completo
Polimorfismo de nucleótido simple
Anotación
Rumiantes
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
id UNACIONAL2_0ee4c868c4fc04c6dccf0b07f2799b93
oai_identifier_str oai:repositorio.unal.edu.co:unal/76444
network_acronym_str UNACIONAL2
network_name_str Universidad Nacional de Colombia
repository_id_str
dc.title.spa.fl_str_mv Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
title Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
spellingShingle Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
Genome-wide association studies
Single nucleotide polymorphism
Annotation
Ruminants
Estudios de asociación del genoma completo
Polimorfismo de nucleótido simple
Anotación
Rumiantes
title_short Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
title_full Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
title_fullStr Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
title_full_unstemmed Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
title_sort Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants
dc.creator.fl_str_mv Sepúlveda Molina, Boris Julián
dc.contributor.author.spa.fl_str_mv Sepúlveda Molina, Boris Julián
dc.contributor.spa.fl_str_mv López Kleine, Liliana
Ariza Botero, Manuel Fernando
Castro, Susan
Ortiz, Turany Teresa
Amaya, Angela Marlen
Rincon Soledad, Edicson Mauricio
Romero, Liza
Gómez Parrado, Yenny Milena del Pilar
dc.subject.proposal.spa.fl_str_mv Genome-wide association studies
Single nucleotide polymorphism
Annotation
Ruminants
Estudios de asociación del genoma completo
Polimorfismo de nucleótido simple
Anotación
Rumiantes
topic Genome-wide association studies
Single nucleotide polymorphism
Annotation
Ruminants
Estudios de asociación del genoma completo
Polimorfismo de nucleótido simple
Anotación
Rumiantes
description Given the importance of the production of ruminants, it is necessary to investigate the genetic variants associated with the traits of economic interest in these animals, as well as the biology underlying the genotype-phenotype associations. To conduct these associations, a widely used strategy is to perform genome-wide association studies (GWAS). The GWAS must have the support of adequate quality control (QC), to then identify the associations between genetic markers type SNP and phenotypes. Additionally, the biological contextualization of these associations starts from the annotation of the genes close to the associated markers. Currently, there are several tools, including R libraries, to perform these analyses. However, it is necessary to develop a tool that allows unifying the three main steps (QC, GWAS, and annotation) for species other than human. For the above, the present work developed a methodology that unified the three mentioned steps in the R environment. The generated code was submitted for publication and is freely available in the repository https://github.com/bojusemo/Diploid-GWAS. The code was tested in two populations of ruminants, the Colombian Creole Hair Sheep and Simmental cattle. In these populations, the SNPs with low quality were removed, there was no detected population stratification, and no samples were removed for low quality. The SNP OAR26_10469468.1 was associated with the meat tenderness of Colombian Creole hair sheep. This SNP is in the gene TENM3. TENM3 protein has two domains with functions associated with meat tenderness in cattle and pigs. The SNP BovineHD4100012055 was associated with birth weight in Simmental. The closest gene to this SNP is the olfactory receptor 52E8-like, which is a member of the protein family G protein-coupled receptor (GPCR). GPCR has associated with birth weight in humans. Six markers were associated with 305-day milk yield in Simmental. Neither the closest genes of these markers nor their protein domains have been reported as associated with milk production.
publishDate 2019
dc.date.issued.spa.fl_str_mv 2019-06-19
dc.date.accessioned.spa.fl_str_mv 2020-03-30T06:20:03Z
dc.date.available.spa.fl_str_mv 2020-03-30T06:20:03Z
dc.type.spa.fl_str_mv Trabajo de grado - Maestría
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/masterThesis
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/acceptedVersion
dc.type.content.spa.fl_str_mv Text
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/TM
status_str acceptedVersion
dc.identifier.uri.none.fl_str_mv https://repositorio.unal.edu.co/handle/unal/76444
dc.identifier.eprints.spa.fl_str_mv http://bdigital.unal.edu.co/72840/
url https://repositorio.unal.edu.co/handle/unal/76444
http://bdigital.unal.edu.co/72840/
dc.language.iso.spa.fl_str_mv spa
language spa
dc.relation.ispartof.spa.fl_str_mv Universidad Nacional de Colombia Sede Bogotá Facultad de Ingeniería Departamento de Ingeniería de Sistemas e Industrial
Departamento de Ingeniería de Sistemas e Industrial
dc.relation.haspart.spa.fl_str_mv 02 Bibliotecología y ciencias de la información / Library and information sciences
57 Ciencias de la vida; Biología / Life sciences; biology
59 Animales / Animals
dc.relation.references.spa.fl_str_mv Sepúlveda Molina, Boris Julián (2019) Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants. Maestría thesis, Universidad Nacional de Colombia - Sede Bogotá.
dc.rights.spa.fl_str_mv Derechos reservados - Universidad Nacional de Colombia
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.rights.license.spa.fl_str_mv Atribución-NoComercial 4.0 Internacional
dc.rights.uri.spa.fl_str_mv http://creativecommons.org/licenses/by-nc/4.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv Atribución-NoComercial 4.0 Internacional
Derechos reservados - Universidad Nacional de Colombia
http://creativecommons.org/licenses/by-nc/4.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.mimetype.spa.fl_str_mv application/pdf
institution Universidad Nacional de Colombia
bitstream.url.fl_str_mv https://repositorio.unal.edu.co/bitstream/unal/76444/1/Documento%20de%20tesis.pdf
https://repositorio.unal.edu.co/bitstream/unal/76444/2/Documento%20de%20tesis.pdf.jpg
bitstream.checksum.fl_str_mv 1da5696d46f17f7b6553a31b798a347b
00a8a172e0537a03120d55caf9ffdb2f
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositorio Institucional Universidad Nacional de Colombia
repository.mail.fl_str_mv repositorio_nal@unal.edu.co
_version_ 1814089449609363456
spelling Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2López Kleine, LilianaAriza Botero, Manuel FernandoCastro, SusanOrtiz, Turany TeresaAmaya, Angela MarlenRincon Soledad, Edicson MauricioRomero, LizaGómez Parrado, Yenny Milena del PilarSepúlveda Molina, Boris Juliána057e466-92a1-4c71-96be-fa9ed25802b23002020-03-30T06:20:03Z2020-03-30T06:20:03Z2019-06-19https://repositorio.unal.edu.co/handle/unal/76444http://bdigital.unal.edu.co/72840/Given the importance of the production of ruminants, it is necessary to investigate the genetic variants associated with the traits of economic interest in these animals, as well as the biology underlying the genotype-phenotype associations. To conduct these associations, a widely used strategy is to perform genome-wide association studies (GWAS). The GWAS must have the support of adequate quality control (QC), to then identify the associations between genetic markers type SNP and phenotypes. Additionally, the biological contextualization of these associations starts from the annotation of the genes close to the associated markers. Currently, there are several tools, including R libraries, to perform these analyses. However, it is necessary to develop a tool that allows unifying the three main steps (QC, GWAS, and annotation) for species other than human. For the above, the present work developed a methodology that unified the three mentioned steps in the R environment. The generated code was submitted for publication and is freely available in the repository https://github.com/bojusemo/Diploid-GWAS. The code was tested in two populations of ruminants, the Colombian Creole Hair Sheep and Simmental cattle. In these populations, the SNPs with low quality were removed, there was no detected population stratification, and no samples were removed for low quality. The SNP OAR26_10469468.1 was associated with the meat tenderness of Colombian Creole hair sheep. This SNP is in the gene TENM3. TENM3 protein has two domains with functions associated with meat tenderness in cattle and pigs. The SNP BovineHD4100012055 was associated with birth weight in Simmental. The closest gene to this SNP is the olfactory receptor 52E8-like, which is a member of the protein family G protein-coupled receptor (GPCR). GPCR has associated with birth weight in humans. Six markers were associated with 305-day milk yield in Simmental. Neither the closest genes of these markers nor their protein domains have been reported as associated with milk production.Resumen: Dada la importancia que tiene la producción de rumiantes, es necesario investigar las variantes genéticas asociadas a las características de interés comercial de dichos animales, así como la biología subyacente a esas asociaciones genotipo-fenotipo. Para hacer dichas asociaciones, una estrategia ampliamente utilizada es realizar estudios de asociación del genoma completo (GWAS). Los GWAS deben partir de un filtro adecuado de la información de las variables y de los individuos, denominado control de calidad (QC), para luego identificar las asociaciones entre marcadores genéticos tipo SNP y los fenotipos. Por su parte, la contextualización biológica de estas asociaciones parte de la anotación de los genes cercanos a los marcadores asociados. Para realizar estos análisis, actualmente hay varias herramientas, incluidas librerías de R. Sin embargo, falta desarrollar una herramienta que permita unificar los tres principales pasos (QC, GWAS y anotación) para datos de especies distintas al humano en R. Por lo anterior, el presente trabajo desarrolló una metodología que unificó en el entorno de R los tres pasos mencionados. El código generado se sometió a publicación y se encuentran disponibles de manera libre en el repositorio https://github.com/bojusemo/Diploid-GWAS. El código fue probado en dos poblaciones de rumiantes, el Ovino de Pelo Criollo Colombiano y los bovinos Simmental. En estas poblaciones, se eliminaron los SNPs con una baja calidad, no se detectó estratificación poblacional y no se eliminaron muestras por baja calidad. El SNP OAR26_10469468.1 estuvo asociado con la terneza de la carne del Ovino de Pelo Criollo Colombiano. Éste SNP está en el gen TENM3. La proteína TENM3 tiene dos dominios con funciones asociadas con la terneza de la carne en bovinos y porcinos. El SNP BovineHD4100012055 estuvo asociado con el peso al nacimiento de Simmental. El gen más cercano a este SNP es el olfactory receptor 52E8-like, que pertenece a la familia de proteínas G protein-coupled receptor (GPCR). Se ha reportado asociación entre GPCR y el peso al nacimiento en humanos. Seis marcadores estuvieron asociados a la producción de leche a los 305 días en Simmental. Ni los genes más cercanos a los marcadores, ni los dominios de las proteínas han sido reportados como asociados con la producción de leche.Maestríaapplication/pdfspaUniversidad Nacional de Colombia Sede Bogotá Facultad de Ingeniería Departamento de Ingeniería de Sistemas e IndustrialDepartamento de Ingeniería de Sistemas e Industrial02 Bibliotecología y ciencias de la información / Library and information sciences57 Ciencias de la vida; Biología / Life sciences; biology59 Animales / AnimalsSepúlveda Molina, Boris Julián (2019) Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminants. Maestría thesis, Universidad Nacional de Colombia - Sede Bogotá.Standardization of a methodology for identification and annotation of associations between single nucleotide polymorphisms and highly polygenic traits in ruminantsTrabajo de grado - Maestríainfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/acceptedVersionTexthttp://purl.org/redcol/resource_type/TMGenome-wide association studiesSingle nucleotide polymorphismAnnotationRuminantsEstudios de asociación del genoma completoPolimorfismo de nucleótido simpleAnotaciónRumiantesORIGINALDocumento de tesis.pdfapplication/pdf2838815https://repositorio.unal.edu.co/bitstream/unal/76444/1/Documento%20de%20tesis.pdf1da5696d46f17f7b6553a31b798a347bMD51THUMBNAILDocumento de tesis.pdf.jpgDocumento de tesis.pdf.jpgGenerated Thumbnailimage/jpeg6232https://repositorio.unal.edu.co/bitstream/unal/76444/2/Documento%20de%20tesis.pdf.jpg00a8a172e0537a03120d55caf9ffdb2fMD52unal/76444oai:repositorio.unal.edu.co:unal/764442023-07-14 23:03:28.496Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co