Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico

El siguiente proyecto busca demostrar, de manera didáctica, el desarrollo de un sistema de inteligencia artificial a partir de un algoritmo de aprendizaje por refuerzo ilustrado en un juego de captura y escape. El propósito de este proyecto es abarcar el funcionamiento de herramientas como los algor...

Full description

Autores:: Perafán Bermúdez, Álvaro Alfonso
Rinaldy Potes, Martha Lucía

Tipo de recurso:

Fecha de publicación:: 2021

Institución:: Universidad del Norte

Repositorio:: Repositorio Uninorte

Idioma:: spa

id	REPOUNORT2_a430f4b05f1123a06030cc8c19ca7862
oai_identifier_str	oai:manglar.uninorte.edu.co:10584/9644
network_acronym_str	REPOUNORT2
network_name_str	Repositorio Uninorte
repository_id_str
dc.title.es_ES.fl_str_mv	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
title	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
spellingShingle	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico Robótica móvil Inteligencia artificial Reinforcement learning Q-learning
title_short	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
title_full	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
title_fullStr	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
title_full_unstemmed	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
title_sort	Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico
dc.creator.fl_str_mv	Perafán Bermúdez, Álvaro Alfonso Rinaldy Potes, Martha Lucía
dc.contributor.advisor.none.fl_str_mv	Caballero, Liesle Percybrooks, Winston
dc.contributor.author.none.fl_str_mv	Perafán Bermúdez, Álvaro Alfonso Rinaldy Potes, Martha Lucía
dc.subject.es_ES.fl_str_mv	Robótica móvil Inteligencia artificial
topic	Robótica móvil Inteligencia artificial Reinforcement learning Q-learning
dc.subject.en_US.fl_str_mv	Reinforcement learning Q-learning
description	El siguiente proyecto busca demostrar, de manera didáctica, el desarrollo de un sistema de inteligencia artificial a partir de un algoritmo de aprendizaje por refuerzo ilustrado en un juego de captura y escape. El propósito de este proyecto es abarcar el funcionamiento de herramientas como los algoritmos de Reinforcement Learning o más específicamente, Q-Learning, programados en Python, con el motivo de instruir la importancia de estas en el futuro de las ingenierías y la industria. Adicionalmente, se aplican estos conceptos a un problema físico, en este caso, el movimiento de dos robots a través de una cuadrícula de 6x6. Esta demostración se hace bajo la estructura del videojuego ‘El Gato & El Ratón’, donde, a los mencionados se les conoce como agentes. El juego tiene dos modos de jugador, el Jugador-Jugador (JcJ) y el Jugador-Máquina (JcM). Para el caso del Jugador-Jugador, se crea una interfaz que puede ser controlada manualmente. Esta se enlaza al entorno real por medio de una conexión de bluetooth aplicada para la manipulación de los robots móviles desde el ordenador. En el caso del modo Jugador-Máquina, el ordenador toma el rol de uno de los agentes (En este caso, el agente entrenado es el ratón) gracias al algoritmo de Q-Learning, que le permite también tomar, de manera autónoma, el control de uno de los robots. Este mismo controla físicamente al robot encargado de simular al agente en cuestión. El juego finaliza una vez el agente ratón haya encontrado al queso o el agente gato haya atrapado al agente ratón. Finalmente, según las simulaciones realizadas, se observó el comportamiento a partir de las recompensas y penalizaciones digitadas para cada decisión realizada por el ratón. Estas simulaciones modelan la creación de una personalidad de este agente las cuales se consideraron con las etiquetas miedoso, hambriento y balanceado.
publishDate	2021
dc.date.accessioned.none.fl_str_mv	2021-06-21T14:22:15Z
dc.date.available.none.fl_str_mv	2021-06-21T14:22:15Z
dc.date.issued.none.fl_str_mv	2021-05-31
dc.type.es_ES.fl_str_mv	article
dc.type.coar.fl_str_mv	http://purl.org/coar/resource_type/c_6501
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/10584/9644
url	http://hdl.handle.net/10584/9644
dc.language.iso.es_ES.fl_str_mv	spa
language	spa
dc.rights.es_ES.fl_str_mv	Universidad del Norte
dc.rights.coar.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	Universidad del Norte http://purl.org/coar/access_right/c_abf2
dc.publisher.es_ES.fl_str_mv	Universidad del Norte, 2021
institution	Universidad del Norte
bitstream.url.fl_str_mv	https://manglar.uninorte.edu.co/bitstream/10584/9644/1/Dise%c3%b1o%20sin%20t%c3%adtulo%20%281%29.png https://manglar.uninorte.edu.co/bitstream/10584/9644/2/Dise%c3%b1o%20sin%20t%c3%adtulo.pdf https://manglar.uninorte.edu.co/bitstream/10584/9644/3/license.txt
bitstream.checksum.fl_str_mv	ad6862cbc33cb0515f05d44afcddad30 67a68fa67edff8fa4e3325b03fd513a2 8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio Digital de la Universidad del Norte
repository.mail.fl_str_mv	mauribe@uninorte.edu.co
_version_	1839636334729232384
spelling	Caballero, LieslePercybrooks, WinstonPerafán Bermúdez, Álvaro AlfonsoRinaldy Potes, Martha Lucía2021-06-21T14:22:15Z2021-06-21T14:22:15Z2021-05-31http://hdl.handle.net/10584/9644El siguiente proyecto busca demostrar, de manera didáctica, el desarrollo de un sistema de inteligencia artificial a partir de un algoritmo de aprendizaje por refuerzo ilustrado en un juego de captura y escape. El propósito de este proyecto es abarcar el funcionamiento de herramientas como los algoritmos de Reinforcement Learning o más específicamente, Q-Learning, programados en Python, con el motivo de instruir la importancia de estas en el futuro de las ingenierías y la industria. Adicionalmente, se aplican estos conceptos a un problema físico, en este caso, el movimiento de dos robots a través de una cuadrícula de 6x6. Esta demostración se hace bajo la estructura del videojuego ‘El Gato & El Ratón’, donde, a los mencionados se les conoce como agentes. El juego tiene dos modos de jugador, el Jugador-Jugador (JcJ) y el Jugador-Máquina (JcM). Para el caso del Jugador-Jugador, se crea una interfaz que puede ser controlada manualmente. Esta se enlaza al entorno real por medio de una conexión de bluetooth aplicada para la manipulación de los robots móviles desde el ordenador. En el caso del modo Jugador-Máquina, el ordenador toma el rol de uno de los agentes (En este caso, el agente entrenado es el ratón) gracias al algoritmo de Q-Learning, que le permite también tomar, de manera autónoma, el control de uno de los robots. Este mismo controla físicamente al robot encargado de simular al agente en cuestión. El juego finaliza una vez el agente ratón haya encontrado al queso o el agente gato haya atrapado al agente ratón. Finalmente, según las simulaciones realizadas, se observó el comportamiento a partir de las recompensas y penalizaciones digitadas para cada decisión realizada por el ratón. Estas simulaciones modelan la creación de una personalidad de este agente las cuales se consideraron con las etiquetas miedoso, hambriento y balanceado.The following project seeks to demonstrate, in a didactic way, the development of an Artificial Intelligence system based on a reinforcement learning algorithm illustrated in a capture and escape game. The purpose of this project is to cover the operation of tools such as Reinforcement Learning algorithms or more specifically, Q-Learning, programmed in Python, in order to teach the importance of these in the future of engineering and industry. Additionally, these concepts are applied to a physical problem, in this case, the movement of two robots through a 6x6 grid. This demonstration is done under the structure of the video game 'The Cat & The Mouse', where the mentioned are known as agents. The game has two player modes, Player-Player (JcJ) and Player-Machine (JcM). In the case of the Player-Player mode, an interface is created that can be controlled manually. This is linked to the real environment by means of a bluetooth connection applied for the manipulation of the mobile robots from the computer. In the case of the Player-Machine mode, the computer takes the role of one of the agents (in this case, the trained agent is the mouse) thanks to the Q-Learning algorithm, which also allows it to autonomously take control of one of the robots. It physically controls the robot in charge of simulating the agent in question. The game ends once the mouse agent has found the cheese or the cat agent has caught the mouse agent. Finally, according to the simulations performed, the behavior was observed from the rewards and penalties typed for each decision made by the mouse. These simulations model the creation of a personality of this agent, which were considered with the labels shyl, hungry and balanced.spaUniversidad del Norte, 2021Universidad del Nortehttp://purl.org/coar/access_right/c_abf2Robótica móvilInteligencia artificialReinforcement learningQ-learningSistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdicoarticlehttp://purl.org/coar/resource_type/c_6501ORIGINALDiseño sin título (1).pngDiseño sin título (1).pngInterfaz gráfica y entorno física del videojuegoimage/png182365https://manglar.uninorte.edu.co/bitstream/10584/9644/1/Dise%c3%b1o%20sin%20t%c3%adtulo%20%281%29.pngad6862cbc33cb0515f05d44afcddad30MD51Diseño sin título.pdfDiseño sin título.pdfapplication/pdf221497https://manglar.uninorte.edu.co/bitstream/10584/9644/2/Dise%c3%b1o%20sin%20t%c3%adtulo.pdf67a68fa67edff8fa4e3325b03fd513a2MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-81748https://manglar.uninorte.edu.co/bitstream/10584/9644/3/license.txt8a4605be74aa9ea9d79846c1fba20a33MD5310584/9644oai:manglar.uninorte.edu.co:10584/96442021-06-21 09:22:15.795Repositorio Digital de la Universidad del Nortemauribe@uninorte.edu.coTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=

Sistema de demostración de inteligencia artificial para el control de robots móviles en un entorno lúdico

Publicaciones similares