Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.

Trajectory planning in autonomous mobile robots is an open problem because, when working in dynamic environments, it is very expensive to program the entire navigation system for a particular application or, failing that, it would be very difficult for the programmer to correctly predict the changes...

Full description

Autores:: Carreño Puentes, Sergio Manuel

Tipo de recurso:: Trabajo de grado de pregrado

Fecha de publicación:: 2022

Institución:: Universidad Antonio Nariño

Repositorio:: Repositorio UAN

Idioma:: spa

id	UAntonioN2_6350e6a010caef80689f532b81bc88a6
oai_identifier_str	oai:repositorio.uan.edu.co:123456789/7242
network_acronym_str	UAntonioN2
network_name_str	Repositorio UAN
repository_id_str
dc.title.es_ES.fl_str_mv	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
title	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
spellingShingle	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles. Aprendizaje por refuerzo, aprendizaje automático, aprendizaje profundo, neurona artificial, redes neuronales, robots autónomos Reinforcement learning, machine learning, deep learning, artificial neuron, neural networks, autonomous robots.
title_short	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
title_full	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
title_fullStr	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
title_full_unstemmed	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
title_sort	Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.
dc.creator.fl_str_mv	Carreño Puentes, Sergio Manuel
dc.contributor.advisor.spa.fl_str_mv	Erazo Ordoñez, Christian
dc.contributor.author.spa.fl_str_mv	Carreño Puentes, Sergio Manuel
dc.subject.es_ES.fl_str_mv	Aprendizaje por refuerzo, aprendizaje automático, aprendizaje profundo, neurona artificial, redes neuronales, robots autónomos
topic	Aprendizaje por refuerzo, aprendizaje automático, aprendizaje profundo, neurona artificial, redes neuronales, robots autónomos Reinforcement learning, machine learning, deep learning, artificial neuron, neural networks, autonomous robots.
dc.subject.keyword.es_ES.fl_str_mv	Reinforcement learning, machine learning, deep learning, artificial neuron, neural networks, autonomous robots.
description	Trajectory planning in autonomous mobile robots is an open problem because, when working in dynamic environments, it is very expensive to program the entire navigation system for a particular application or, failing that, it would be very difficult for the programmer to correctly predict the changes in the environment. In order to contribute to this field, this document shows the process of designing an intelligent controller based on reinforced learning and more specifically using the Q-learning algorithm to drive two mobile robots through a simulated environment, and that autonomously manage to learn the trajectory that will take them to a target position without having prior knowledge about the work environment
publishDate	2022
dc.date.accessioned.none.fl_str_mv	2022-11-15T15:11:21Z
dc.date.available.none.fl_str_mv	2022-11-15T15:11:21Z
dc.date.issued.spa.fl_str_mv	2022-06-07
dc.type.spa.fl_str_mv	Trabajo de grado (Pregrado y/o Especialización)
dc.type.coar.spa.fl_str_mv	http://purl.org/coar/resource_type/c_7a1f
dc.type.coarversion.none.fl_str_mv	http://purl.org/coar/version/c_970fb48d4fbd8a85
format	http://purl.org/coar/resource_type/c_7a1f
dc.identifier.uri.none.fl_str_mv	http://repositorio.uan.edu.co/handle/123456789/7242
dc.identifier.bibliographicCitation.spa.fl_str_mv	Ambhore, S. (2020). A Comprehensive Study on Robot Learning from Demonstration. 2nd International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2020 - Conference Proceedings, Icimia, 291–299. https://doi.org/10.1109/ICIMIA48430.2020.9074946 Arias Rivera, M. (2018). Autonomous Navigation by Reinforcement Learning Artola Moreno, Á. (2019). Clasificación de imágenes usando redes neuronales convolucionales en Python. Universidad de Sevilla. Baker, B., Kanitscheider, I., Mrkov, T., Wu, Y., Powell, G., McGrew, B., & Mordatch, I. (2019). Emergent Tool Use From Multi-Agent Autocurricula. Bañó, A. (2003). Análisis y diseño del control de posición de un robot móvil con tracción diferencial. http://deeea.urv.cat/public/PROPOSTES/pub/pdf/333pub.pdf Bellemare, M., Candido, S., Castro, P. S., Gong, J., Machado, M., Subhodeep, M., Ponda, S., & Eang, Z. (2020). Autonomous navigation of stratospheric balloons using reinforcement learning. Nature 588, 77–82. Calvo, D. (2017a). Clasificación de redes neuronales artificiales. https://www.diegocalvo.es/clasificacion-de-redes-neuronales-artificiales/ Calvo, D. (2017b). Red Neuronal Convolucional CNN. https://www.diegocalvo.es/red-neuronalconvolucional/ Clinic, M. (2021). Célula nerviosa (neurona). https://www.mayoclinic.org/es-es/nerve-cellneuron/img-20007830 Contreras, L. (2003). Estudio e Implementación de algunos comportamientos básicos de un animal en un robot de tipo genérico [Tesis profesional, Universidad de las Américas Puebla]. http://catarina.udlap.mx/u_dl_a/tales/documentos/lis/contreras_o_l
dc.identifier.instname.spa.fl_str_mv	instname:Universidad Antonio Nariño
dc.identifier.reponame.spa.fl_str_mv	reponame:Repositorio Institucional UAN
dc.identifier.repourl.spa.fl_str_mv	repourl:https://repositorio.uan.edu.co/
url	http://repositorio.uan.edu.co/handle/123456789/7242
identifier_str_mv	Ambhore, S. (2020). A Comprehensive Study on Robot Learning from Demonstration. 2nd International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2020 - Conference Proceedings, Icimia, 291–299. https://doi.org/10.1109/ICIMIA48430.2020.9074946 Arias Rivera, M. (2018). Autonomous Navigation by Reinforcement Learning Artola Moreno, Á. (2019). Clasificación de imágenes usando redes neuronales convolucionales en Python. Universidad de Sevilla. Baker, B., Kanitscheider, I., Mrkov, T., Wu, Y., Powell, G., McGrew, B., & Mordatch, I. (2019). Emergent Tool Use From Multi-Agent Autocurricula. Bañó, A. (2003). Análisis y diseño del control de posición de un robot móvil con tracción diferencial. http://deeea.urv.cat/public/PROPOSTES/pub/pdf/333pub.pdf Bellemare, M., Candido, S., Castro, P. S., Gong, J., Machado, M., Subhodeep, M., Ponda, S., & Eang, Z. (2020). Autonomous navigation of stratospheric balloons using reinforcement learning. Nature 588, 77–82. Calvo, D. (2017a). Clasificación de redes neuronales artificiales. https://www.diegocalvo.es/clasificacion-de-redes-neuronales-artificiales/ Calvo, D. (2017b). Red Neuronal Convolucional CNN. https://www.diegocalvo.es/red-neuronalconvolucional/ Clinic, M. (2021). Célula nerviosa (neurona). https://www.mayoclinic.org/es-es/nerve-cellneuron/img-20007830 Contreras, L. (2003). Estudio e Implementación de algunos comportamientos básicos de un animal en un robot de tipo genérico [Tesis profesional, Universidad de las Américas Puebla]. http://catarina.udlap.mx/u_dl_a/tales/documentos/lis/contreras_o_l instname:Universidad Antonio Nariño reponame:Repositorio Institucional UAN repourl:https://repositorio.uan.edu.co/
dc.language.iso.spa.fl_str_mv	spa
language	spa
dc.rights.none.fl_str_mv	Acceso abierto
dc.rights.license.spa.fl_str_mv	Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
dc.rights.uri.spa.fl_str_mv	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.accessrights.spa.fl_str_mv	info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) Acceso abierto https://creativecommons.org/licenses/by-nc-nd/4.0/ http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv	openAccess
dc.coverage.spatial.spa.fl_str_mv	Colombia(Bogotá,Dc)
dc.publisher.spa.fl_str_mv	Universidad Antonio Nariño
dc.publisher.program.spa.fl_str_mv	Ingeniería Electrónica
dc.publisher.faculty.spa.fl_str_mv	Facultad de Ingeniería Mecánica, Electrónica y Biomédica
dc.publisher.campus.spa.fl_str_mv	Bogotá - Sur
institution	Universidad Antonio Nariño
bitstream.url.fl_str_mv	https://repositorio.uan.edu.co/bitstreams/e6fca03e-6e1e-4669-ac47-20e7ad4e9445/download https://repositorio.uan.edu.co/bitstreams/21b69bfe-033a-475c-9415-e2c908fcc05e/download https://repositorio.uan.edu.co/bitstreams/b6fc76e5-eea8-4fe9-ae8e-71547769cb7e/download https://repositorio.uan.edu.co/bitstreams/b9e91c80-568f-4b02-9cfe-3a45b5f3d99a/download
bitstream.checksum.fl_str_mv	d9061c714f1773eb12cc8d51669eb984 888d6bdaf5cf7d8f3102749084826e21 1d1fd14dd82e514330bc04ec12c39f06 9868ccc48a14c8d591352b6eaf7f6239
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio Institucional UAN
repository.mail.fl_str_mv	alertas.repositorio@uan.edu.co
_version_	1851059441221238784
spelling	Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)Acceso abiertohttps://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Erazo Ordoñez, ChristianCarreño Puentes, Sergio Manuel10441711897Colombia(Bogotá,Dc)2022-11-15T15:11:21Z2022-11-15T15:11:21Z2022-06-07http://repositorio.uan.edu.co/handle/123456789/7242Ambhore, S. (2020). A Comprehensive Study on Robot Learning from Demonstration. 2nd International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2020 - Conference Proceedings, Icimia, 291–299. https://doi.org/10.1109/ICIMIA48430.2020.9074946Arias Rivera, M. (2018). Autonomous Navigation by Reinforcement LearningArtola Moreno, Á. (2019). Clasificación de imágenes usando redes neuronales convolucionales en Python. Universidad de Sevilla.Baker, B., Kanitscheider, I., Mrkov, T., Wu, Y., Powell, G., McGrew, B., & Mordatch, I. (2019). Emergent Tool Use From Multi-Agent Autocurricula.Bañó, A. (2003). Análisis y diseño del control de posición de un robot móvil con tracción diferencial. http://deeea.urv.cat/public/PROPOSTES/pub/pdf/333pub.pdfBellemare, M., Candido, S., Castro, P. S., Gong, J., Machado, M., Subhodeep, M., Ponda, S., & Eang, Z. (2020). Autonomous navigation of stratospheric balloons using reinforcement learning. Nature 588, 77–82.Calvo, D. (2017a). Clasificación de redes neuronales artificiales. https://www.diegocalvo.es/clasificacion-de-redes-neuronales-artificiales/Calvo, D. (2017b). Red Neuronal Convolucional CNN. https://www.diegocalvo.es/red-neuronalconvolucional/Clinic, M. (2021). Célula nerviosa (neurona). https://www.mayoclinic.org/es-es/nerve-cellneuron/img-20007830Contreras, L. (2003). Estudio e Implementación de algunos comportamientos básicos de un animal en un robot de tipo genérico [Tesis profesional, Universidad de las Américas Puebla]. http://catarina.udlap.mx/u_dl_a/tales/documentos/lis/contreras_o_linstname:Universidad Antonio Nariñoreponame:Repositorio Institucional UANrepourl:https://repositorio.uan.edu.co/Trajectory planning in autonomous mobile robots is an open problem because, when working in dynamic environments, it is very expensive to program the entire navigation system for a particular application or, failing that, it would be very difficult for the programmer to correctly predict the changes in the environment. In order to contribute to this field, this document shows the process of designing an intelligent controller based on reinforced learning and more specifically using the Q-learning algorithm to drive two mobile robots through a simulated environment, and that autonomously manage to learn the trajectory that will take them to a target position without having prior knowledge about the work environmentLa planeación de trayectorias en los robots móviles autónomos es un problema abierto debido a que, al trabajar en ambientes dinámicos, resulta muy costoso programar todo el sistema de navegación para una aplicación particular o en su defecto resultaría muy complicado, para el programador, lograr predecir de manera acertada los cambios en el entorno. A fin de contribuir a este campo, en el presente documento se muestra el proceso de diseño de un controlador inteligente basado en aprendizaje reforzado y más concretamente utilizando el algoritmo Q-learning para lograr conducir dos robots móviles a través de un entorno simulado, y que de manera autónoma logren aprender la trayectoria que los llevará a una posición objetivo sin poseer conocimiento previo sobre el ambiente de trabajo.Ingeniero(a) Electrónico(a)PregradoPresencialInvestigaciónspaUniversidad Antonio NariñoIngeniería ElectrónicaFacultad de Ingeniería Mecánica, Electrónica y BiomédicaBogotá - SurAprendizaje por refuerzo,aprendizaje automático,aprendizaje profundo,neurona artificial,redes neuronales,robots autónomosReinforcement learning,machine learning,deep learning,artificial neuron,neural networks,autonomous robots.Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.Trabajo de grado (Pregrado y/o Especialización)http://purl.org/coar/resource_type/c_7a1fhttp://purl.org/coar/version/c_970fb48d4fbd8a85GeneralORIGINAL2022_T.G-SergioManuelCarreñoPuentes.pdf2022_T.G-SergioManuelCarreñoPuentes.pdfTrabajo de gradoapplication/pdf5398806https://repositorio.uan.edu.co/bitstreams/e6fca03e-6e1e-4669-ac47-20e7ad4e9445/downloadd9061c714f1773eb12cc8d51669eb984MD512022_Autorización.SergioManuelCarreñoPuentes_.pdf2022_Autorización.SergioManuelCarreñoPuentes_.pdfAutorización de autoresapplication/pdf956776https://repositorio.uan.edu.co/bitstreams/21b69bfe-033a-475c-9415-e2c908fcc05e/download888d6bdaf5cf7d8f3102749084826e21MD522022_ Actadegrabajode_grado Sergio Carreño.docx.pdf2022_ Actadegrabajode_grado Sergio Carreño.docx.pdfActa de sustentaciónapplication/pdf233643https://repositorio.uan.edu.co/bitstreams/b6fc76e5-eea8-4fe9-ae8e-71547769cb7e/download1d1fd14dd82e514330bc04ec12c39f06MD53CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.uan.edu.co/bitstreams/b9e91c80-568f-4b02-9cfe-3a45b5f3d99a/download9868ccc48a14c8d591352b6eaf7f6239MD54123456789/7242oai:repositorio.uan.edu.co:123456789/72422024-10-09 23:16:10.601https://creativecommons.org/licenses/by-nc-nd/4.0/Acceso abiertoopen.accesshttps://repositorio.uan.edu.coRepositorio Institucional UANalertas.repositorio@uan.edu.co

Diseño y simulación de un controlador inteligente utilizando aprendizaje por refuerzo Q-learning para la navegación autónoma de dos robots móviles.

Publicaciones similares