Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público

In recent years, the use of Machine Learning techniques has been increasing in almost any technological environment due to the great utility that they can offer. One of these techniques is called Reinforment Learning or Reinforcement Learning (AR), which is used in different environments such as vid...

Full description

Autores:: Salcedo Rodríguez, Mateo

Tipo de recurso:: Trabajo de grado de pregrado

Fecha de publicación:: 2020

Institución:: Universidad de los Andes

Repositorio:: Séneca: repositorio Uniandes

Idioma:: spa

id	UNIANDES2_5e450efd64ce05f52e76204bc2d903e7
oai_identifier_str	oai:repositorio.uniandes.edu.co:1992/51469
network_acronym_str	UNIANDES2
network_name_str	Séneca: repositorio Uniandes
repository_id_str
spelling	Al consultar y hacer uso de este recurso, está aceptando las condiciones de uso establecidas por los autores.http://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Cardozo Álvarez, Nicolásvirtual::3376-1Salcedo Rodríguez, Mateof1e16f57-b5de-4261-8fea-20dadeed7b805002021-08-10T18:26:32Z2021-08-10T18:26:32Z2020http://hdl.handle.net/1992/5146922778.pdfinstname:Universidad de los Andesreponame:Repositorio Institucional Sénecarepourl:https://repositorio.uniandes.edu.co/In recent years, the use of Machine Learning techniques has been increasing in almost any technological environment due to the great utility that they can offer. One of these techniques is called Reinforment Learning or Reinforcement Learning (AR), which is used in different environments such as video games or control problems. One of the most interesting uses of this algorithm can be presented in a system such as public transport, where the main objective is to reduce the travel time of users. Taking into account that the space of possible states of the system is quite large, knowing exactly which is the correct action that optimizes the objective of the system for each of the states is a complex task and in many cases impossible, taking into account that the set of states does not have to be known in advance.En los últimos años el uso de técnicas de Machine Learning ha ido en aumento en casi cualquier ambiente tecnológico por la gran utilidad que estas pueden llegar a ofrecer. Una de estas técnicas es llamada Reinforment Learning o Aprendizaje por Refuerzo (AR), la cual es usada en diferentes ambientes como los video juegos o los problemas de control. Uno de los usos más interesantes de este algoritmo se puede presentar en un sistema como el transporte público, donde el objetivo principal es reducir el tiempo de viaje de los usuarios. Teniendo en cuenta que el espacio de estados posibles del sistema es bastante grande, conocer con exactitud cuál es la acción correcta que optimiza el objetivo del sistema para cada uno de los estados es una tarea compleja y en muchos casos imposible, teniendo en cuenta que el conjunto de estados no tiene que ser conocido de antemano.Ingeniero de Sistemas y ComputaciónPregrado18 hojasapplication/pdfspaUniversidad de los AndesIngeniería de Sistemas y ComputaciónFacultad de IngenieríaDepartamento de Ingeniería de Sistemas y ComputaciónAlgoritmo de aprendizaje por refuerzo para el control de un sistema de transporte públicoTrabajo de grado - Pregradoinfo:eu-repo/semantics/bachelorThesishttp://purl.org/coar/resource_type/c_7a1fhttp://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/TPTransporte públicoAlgoritmos (Computadores)Aprendizaje por refuerzo (Aprendizaje automático)Tiempo de viaje (Ingeniería del tránsito)Ingeniería201720208Publicationhttps://scholar.google.es/citations?user=3iTzjQsAAAAJvirtual::3376-10000-0002-1094-9952virtual::3376-1a77ff528-fc33-44d6-9022-814f81ef407avirtual::3376-1a77ff528-fc33-44d6-9022-814f81ef407avirtual::3376-1THUMBNAIL22778.pdf.jpg22778.pdf.jpgIM Thumbnailimage/jpeg4690https://repositorio.uniandes.edu.co/bitstreams/b37caa3c-1a1d-438a-96ee-68d399fe2115/download359d9c5443fbd2f86cd660061cdf2c74MD55ORIGINAL22778.pdfapplication/pdf877747https://repositorio.uniandes.edu.co/bitstreams/aa69a9e3-110e-41c2-9e62-c61a5e2b79d3/download78ee7824560b53c546363c5c3ffc7a9eMD51TEXT22778.pdf.txt22778.pdf.txtExtracted texttext/plain22053https://repositorio.uniandes.edu.co/bitstreams/f659b2d8-3b14-48f3-80fd-a7bd6bcff71a/download8e658724124f5546a7ca720dd4f7478cMD541992/51469oai:repositorio.uniandes.edu.co:1992/514692024-03-13 12:25:21.898http://creativecommons.org/licenses/by-nc-nd/4.0/open.accesshttps://repositorio.uniandes.edu.coRepositorio institucional Sénecaadminrepositorio@uniandes.edu.co
dc.title.spa.fl_str_mv	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
title	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
spellingShingle	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público Transporte público Algoritmos (Computadores) Aprendizaje por refuerzo (Aprendizaje automático) Tiempo de viaje (Ingeniería del tránsito) Ingeniería
title_short	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
title_full	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
title_fullStr	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
title_full_unstemmed	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
title_sort	Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público
dc.creator.fl_str_mv	Salcedo Rodríguez, Mateo
dc.contributor.advisor.none.fl_str_mv	Cardozo Álvarez, Nicolás
dc.contributor.author.none.fl_str_mv	Salcedo Rodríguez, Mateo
dc.subject.armarc.none.fl_str_mv	Transporte público Algoritmos (Computadores) Aprendizaje por refuerzo (Aprendizaje automático) Tiempo de viaje (Ingeniería del tránsito)
topic	Transporte público Algoritmos (Computadores) Aprendizaje por refuerzo (Aprendizaje automático) Tiempo de viaje (Ingeniería del tránsito) Ingeniería
dc.subject.themes.none.fl_str_mv	Ingeniería
description	In recent years, the use of Machine Learning techniques has been increasing in almost any technological environment due to the great utility that they can offer. One of these techniques is called Reinforment Learning or Reinforcement Learning (AR), which is used in different environments such as video games or control problems. One of the most interesting uses of this algorithm can be presented in a system such as public transport, where the main objective is to reduce the travel time of users. Taking into account that the space of possible states of the system is quite large, knowing exactly which is the correct action that optimizes the objective of the system for each of the states is a complex task and in many cases impossible, taking into account that the set of states does not have to be known in advance.
publishDate	2020
dc.date.issued.none.fl_str_mv	2020
dc.date.accessioned.none.fl_str_mv	2021-08-10T18:26:32Z
dc.date.available.none.fl_str_mv	2021-08-10T18:26:32Z
dc.type.spa.fl_str_mv	Trabajo de grado - Pregrado
dc.type.coarversion.fl_str_mv	http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.driver.spa.fl_str_mv	info:eu-repo/semantics/bachelorThesis
dc.type.coar.spa.fl_str_mv	http://purl.org/coar/resource_type/c_7a1f
dc.type.content.spa.fl_str_mv	Text
dc.type.redcol.spa.fl_str_mv	http://purl.org/redcol/resource_type/TP
format	http://purl.org/coar/resource_type/c_7a1f
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/1992/51469
dc.identifier.pdf.none.fl_str_mv	22778.pdf
dc.identifier.instname.spa.fl_str_mv	instname:Universidad de los Andes
dc.identifier.reponame.spa.fl_str_mv	reponame:Repositorio Institucional Séneca
dc.identifier.repourl.spa.fl_str_mv	repourl:https://repositorio.uniandes.edu.co/
url	http://hdl.handle.net/1992/51469
identifier_str_mv	22778.pdf instname:Universidad de los Andes reponame:Repositorio Institucional Séneca repourl:https://repositorio.uniandes.edu.co/
dc.language.iso.none.fl_str_mv	spa
language	spa
dc.rights.uri.*.fl_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.accessrights.spa.fl_str_mv	info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/ http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv	openAccess
dc.format.extent.none.fl_str_mv	18 hojas
dc.format.mimetype.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidad de los Andes
dc.publisher.program.none.fl_str_mv	Ingeniería de Sistemas y Computación
dc.publisher.faculty.none.fl_str_mv	Facultad de Ingeniería
dc.publisher.department.none.fl_str_mv	Departamento de Ingeniería de Sistemas y Computación
publisher.none.fl_str_mv	Universidad de los Andes
institution	Universidad de los Andes
bitstream.url.fl_str_mv	https://repositorio.uniandes.edu.co/bitstreams/b37caa3c-1a1d-438a-96ee-68d399fe2115/download https://repositorio.uniandes.edu.co/bitstreams/aa69a9e3-110e-41c2-9e62-c61a5e2b79d3/download https://repositorio.uniandes.edu.co/bitstreams/f659b2d8-3b14-48f3-80fd-a7bd6bcff71a/download
bitstream.checksum.fl_str_mv	359d9c5443fbd2f86cd660061cdf2c74 78ee7824560b53c546363c5c3ffc7a9e 8e658724124f5546a7ca720dd4f7478c
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio institucional Séneca
repository.mail.fl_str_mv	adminrepositorio@uniandes.edu.co
_version_	1837004976663035904

Algoritmo de aprendizaje por refuerzo para el control de un sistema de transporte público

Publicaciones similares