A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain

Abstract The exponential growth in the amount of available data has posed new challenges to re- searchers. Search on such amount of data is a difficult task which turns even harder when the data belongs to a specific domain which has its own terminology and requires some background knowledge. Tradit...

Full description

Autores:
Riveros Cruz, Luis Alejandro
Tipo de recurso:
Fecha de publicación:
2015
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/54814
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/54814
http://bdigital.unal.edu.co/49994/
Palabra clave:
02 Bibliotecología y ciencias de la información / Library and information sciences
6 Tecnología (ciencias aplicadas) / Technology
62 Ingeniería y operaciones afines / Engineering
Information retrieval
Ontology
Nowledge bases
Búsqueda de información
Ontología
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
id UNACIONAL2_d98887bb0345ddefff670b07a923a59d
oai_identifier_str oai:repositorio.unal.edu.co:unal/54814
network_acronym_str UNACIONAL2
network_name_str Universidad Nacional de Colombia
repository_id_str
dc.title.spa.fl_str_mv A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
title A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
spellingShingle A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
02 Bibliotecología y ciencias de la información / Library and information sciences
6 Tecnología (ciencias aplicadas) / Technology
62 Ingeniería y operaciones afines / Engineering
Information retrieval
Ontology
Nowledge bases
Búsqueda de información
Ontología
title_short A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
title_full A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
title_fullStr A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
title_full_unstemmed A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
title_sort A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain
dc.creator.fl_str_mv Riveros Cruz, Luis Alejandro
dc.contributor.author.spa.fl_str_mv Riveros Cruz, Luis Alejandro
dc.contributor.spa.fl_str_mv González Osorio, Fabio Augusto
dc.subject.ddc.spa.fl_str_mv 02 Bibliotecología y ciencias de la información / Library and information sciences
6 Tecnología (ciencias aplicadas) / Technology
62 Ingeniería y operaciones afines / Engineering
topic 02 Bibliotecología y ciencias de la información / Library and information sciences
6 Tecnología (ciencias aplicadas) / Technology
62 Ingeniería y operaciones afines / Engineering
Information retrieval
Ontology
Nowledge bases
Búsqueda de información
Ontología
dc.subject.proposal.spa.fl_str_mv Information retrieval
Ontology
Nowledge bases
Búsqueda de información
Ontología
description Abstract The exponential growth in the amount of available data has posed new challenges to re- searchers. Search on such amount of data is a difficult task which turns even harder when the data belongs to a specific domain which has its own terminology and requires some background knowledge. Traditional information retrieval systems are based on keywords. In this kind of systems the output for a given query is a ranking of the documents that match the keywords. This model works well in scenarios with few documents or if the system achieves a high perfor- mance ensuring that the first results contain the most relevant documents. However, in most cases the collections are huge and the retrieval results are an endless list of documents that must be scanned manually. This work proposes an information retrieval approach which incorporates domain specific knowledge from an ontology within the traditional information retrieval model in order to overcome some of its limitations. The domain knowledge is used to add semantic capabili- ties and to provide the user with an enriched interface which includes metadata about the retrieved results, thus facilitating its exploration and filtering
publishDate 2015
dc.date.issued.spa.fl_str_mv 2015-07-01
dc.date.accessioned.spa.fl_str_mv 2019-06-29T21:40:48Z
dc.date.available.spa.fl_str_mv 2019-06-29T21:40:48Z
dc.type.spa.fl_str_mv Trabajo de grado - Maestría
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/masterThesis
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/acceptedVersion
dc.type.content.spa.fl_str_mv Text
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/TM
status_str acceptedVersion
dc.identifier.uri.none.fl_str_mv https://repositorio.unal.edu.co/handle/unal/54814
dc.identifier.eprints.spa.fl_str_mv http://bdigital.unal.edu.co/49994/
url https://repositorio.unal.edu.co/handle/unal/54814
http://bdigital.unal.edu.co/49994/
dc.language.iso.spa.fl_str_mv spa
language spa
dc.relation.ispartof.spa.fl_str_mv Universidad Nacional de Colombia Sede Bogotá Facultad de Ingeniería Departamento de Ingeniería de Sistemas e Industrial Ingeniería de Sistemas
Ingeniería de Sistemas
dc.relation.references.spa.fl_str_mv Riveros Cruz, Luis Alejandro (2015) A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain. Maestría thesis, Universidad Nacional de Colombia.
dc.rights.spa.fl_str_mv Derechos reservados - Universidad Nacional de Colombia
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.rights.license.spa.fl_str_mv Atribución-NoComercial 4.0 Internacional
dc.rights.uri.spa.fl_str_mv http://creativecommons.org/licenses/by-nc/4.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv Atribución-NoComercial 4.0 Internacional
Derechos reservados - Universidad Nacional de Colombia
http://creativecommons.org/licenses/by-nc/4.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.mimetype.spa.fl_str_mv application/pdf
institution Universidad Nacional de Colombia
bitstream.url.fl_str_mv https://repositorio.unal.edu.co/bitstream/unal/54814/1/80056739.2015.pdf
https://repositorio.unal.edu.co/bitstream/unal/54814/2/80056739.2015.pdf.jpg
bitstream.checksum.fl_str_mv 91ac51f0e6c8b3d7c5defc5cc450442e
6e0aa03544eed764700412037f127114
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositorio Institucional Universidad Nacional de Colombia
repository.mail.fl_str_mv repositorio_nal@unal.edu.co
_version_ 1806885980580347904
spelling Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2González Osorio, Fabio AugustoRiveros Cruz, Luis Alejandro71a62882-f745-4f62-ab5d-8c5b263769273002019-06-29T21:40:48Z2019-06-29T21:40:48Z2015-07-01https://repositorio.unal.edu.co/handle/unal/54814http://bdigital.unal.edu.co/49994/Abstract The exponential growth in the amount of available data has posed new challenges to re- searchers. Search on such amount of data is a difficult task which turns even harder when the data belongs to a specific domain which has its own terminology and requires some background knowledge. Traditional information retrieval systems are based on keywords. In this kind of systems the output for a given query is a ranking of the documents that match the keywords. This model works well in scenarios with few documents or if the system achieves a high perfor- mance ensuring that the first results contain the most relevant documents. However, in most cases the collections are huge and the retrieval results are an endless list of documents that must be scanned manually. This work proposes an information retrieval approach which incorporates domain specific knowledge from an ontology within the traditional information retrieval model in order to overcome some of its limitations. The domain knowledge is used to add semantic capabili- ties and to provide the user with an enriched interface which includes metadata about the retrieved results, thus facilitating its exploration and filteringEl acelerado crecimiento en la cantidad de datos disponibles ha traído consigo nuevos retos para los investigadores. Buscar información en este gran volumen de datos, es una tarea difícil, que se torna aún más compleja cuando los datos pertenecen a un dominio especifíco el cual tiene su propia terminolgía y requiere un conocimiento previo. Los sistemas de búsqueda tradicionales son basados en palabras clave. En este tipo de siste- mas la respuesta a una consulta es dada en forma de una lista ordenada de documentos que contienen las palabras en la consulta. Este modelo funciona bien en escenarios en los cuales hay pocos documentos o si el sistema puede alcanzar una precisión muy alta garantizando que los primeros resultados contienen los documentos requeridos. Sin embargo, en la mayoría de los casos las colecciones de documentos son muy grandes y los resultados son una lista interminable de documentos los cuales deben ser explorados manualmente. Este trabajo propone una aproximación a la búsqueda de informacón, que incorpora conoci- miento del dominio proveniente de una ontología, dentro del modelo de búsqueda tradicional con el fin de superar algunas de sus limitaciones. El conocimiento del dominio es usado para adicionar capacidades semánticas y para proveer a los usuarios con una interfaz enriquecida la cual incluye meta-datos acerca de los resultados, facilitando su exploración y filtrado.Maestríaapplication/pdfspaUniversidad Nacional de Colombia Sede Bogotá Facultad de Ingeniería Departamento de Ingeniería de Sistemas e Industrial Ingeniería de SistemasIngeniería de SistemasRiveros Cruz, Luis Alejandro (2015) A knowledge-based approach to information retrieval in collections of textual documents of the biomedical domain. Maestría thesis, Universidad Nacional de Colombia.02 Bibliotecología y ciencias de la información / Library and information sciences6 Tecnología (ciencias aplicadas) / Technology62 Ingeniería y operaciones afines / EngineeringInformation retrievalOntologyNowledge basesBúsqueda de informaciónOntologíaA knowledge-based approach to information retrieval in collections of textual documents of the biomedical domainTrabajo de grado - Maestríainfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/acceptedVersionTexthttp://purl.org/redcol/resource_type/TMORIGINAL80056739.2015.pdfapplication/pdf1382424https://repositorio.unal.edu.co/bitstream/unal/54814/1/80056739.2015.pdf91ac51f0e6c8b3d7c5defc5cc450442eMD51THUMBNAIL80056739.2015.pdf.jpg80056739.2015.pdf.jpgGenerated Thumbnailimage/jpeg4554https://repositorio.unal.edu.co/bitstream/unal/54814/2/80056739.2015.pdf.jpg6e0aa03544eed764700412037f127114MD52unal/54814oai:repositorio.unal.edu.co:unal/548142024-03-14 23:08:11.143Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co