Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents

The thesis develops an ideal inverse-dynamics learning algorithm which can learn the properties of the sensors and actuators under its control. The algorithm converges on an ideal feature space, where the implementation details of the actuators under the algorithm's control are rendered invisib...

Full description

Autores:: Orozco García, Tomás

Tipo de recurso:

Fecha de publicación:: 2020

Institución:: Universidad de los Andes

Repositorio:: Séneca: repositorio Uniandes

Idioma:: eng

id	UNIANDES2_4afb929e73fd20e065cc944137dd9f15
oai_identifier_str	oai:repositorio.uniandes.edu.co:1992/50981
network_acronym_str	UNIANDES2
network_name_str	Séneca: repositorio Uniandes
repository_id_str
dc.title.spa.fl_str_mv	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
title	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
spellingShingle	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents Aprendizaje por refuerzo (Aprendizaje automático) Aprendizaje automático (Inteligencia artificial) Aprendizaje impulsado por la curiosidad Algoritmos (Computadores) Ingeniería
title_short	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
title_full	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
title_fullStr	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
title_full_unstemmed	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
title_sort	Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents
dc.creator.fl_str_mv	Orozco García, Tomás
dc.contributor.advisor.none.fl_str_mv	Cardozo Álvarez, Nicolás
dc.contributor.author.none.fl_str_mv	Orozco García, Tomás
dc.contributor.jury.none.fl_str_mv	Mariño Drews, Olga Dusparic, Ivana
dc.subject.armarc.es_CO.fl_str_mv	Aprendizaje por refuerzo (Aprendizaje automático) Aprendizaje automático (Inteligencia artificial) Aprendizaje impulsado por la curiosidad Algoritmos (Computadores)
topic	Aprendizaje por refuerzo (Aprendizaje automático) Aprendizaje automático (Inteligencia artificial) Aprendizaje impulsado por la curiosidad Algoritmos (Computadores) Ingeniería
dc.subject.themes.none.fl_str_mv	Ingeniería
description	The thesis develops an ideal inverse-dynamics learning algorithm which can learn the properties of the sensors and actuators under its control. The algorithm converges on an ideal feature space, where the implementation details of the actuators under the algorithm's control are rendered invisible to the forward dynamics of a curiosity-driven algorithm (with the same sensors and actuators), run on top of that feature space, where the curiosity-driven algorithm's reward is strictly determined by the minimization of the error of his prediction of the next state of his environment given the current state and his action. That is, the ideal feature space allows the learning trajectory of the forward dynamics of a curiosity-driven algorithm to concentrate on the dynamics of the algorithm's environment by avoiding any distractions originating in the properties of the sensors and actuators under the algorithm's control.
publishDate	2020
dc.date.issued.none.fl_str_mv	2020
dc.date.accessioned.none.fl_str_mv	2021-08-10T18:05:25Z
dc.date.available.none.fl_str_mv	2021-08-10T18:05:25Z
dc.type.spa.fl_str_mv	Trabajo de grado - Maestría
dc.type.coarversion.fl_str_mv	http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.driver.spa.fl_str_mv	info:eu-repo/semantics/masterThesis
dc.type.content.spa.fl_str_mv	Text
dc.type.redcol.spa.fl_str_mv	http://purl.org/redcol/resource_type/TM
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/1992/50981
dc.identifier.pdf.none.fl_str_mv	22826.pdf
dc.identifier.instname.spa.fl_str_mv	instname:Universidad de los Andes
dc.identifier.reponame.spa.fl_str_mv	reponame:Repositorio Institucional Séneca
dc.identifier.repourl.spa.fl_str_mv	repourl:https://repositorio.uniandes.edu.co/
url	http://hdl.handle.net/1992/50981
identifier_str_mv	22826.pdf instname:Universidad de los Andes reponame:Repositorio Institucional Séneca repourl:https://repositorio.uniandes.edu.co/
dc.language.iso.none.fl_str_mv	eng
language	eng
dc.rights.uri.*.fl_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.accessrights.spa.fl_str_mv	info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/ http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv	openAccess
dc.format.extent.none.fl_str_mv	72 hojas
dc.format.mimetype.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidad de los Andes
dc.publisher.program.none.fl_str_mv	Maestría en Ingeniería de Sistemas y Computación
dc.publisher.faculty.none.fl_str_mv	Facultad de Ingeniería
dc.publisher.department.none.fl_str_mv	Departamento de Ingeniería de Sistemas y Computación
publisher.none.fl_str_mv	Universidad de los Andes
institution	Universidad de los Andes
bitstream.url.fl_str_mv	https://repositorio.uniandes.edu.co/bitstreams/f48e1ed4-77c0-4045-aac9-4d615c7ae65d/download https://repositorio.uniandes.edu.co/bitstreams/e7f8e771-c24a-4e88-98c7-0ceea0a419e1/download https://repositorio.uniandes.edu.co/bitstreams/52baab7f-1186-46ee-ad28-01e40f25077c/download
bitstream.checksum.fl_str_mv	a282f46ed9ca5c0b86aa558f8c1c8394 8d770ec793b4e53dbfcfbfa4f8ea5bc5 fe5cfc4cc1b8ef86d73b64644ce58741
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio institucional Séneca
repository.mail.fl_str_mv	adminrepositorio@uniandes.edu.co
_version_	1837005208903745536
spelling	Al consultar y hacer uso de este recurso, está aceptando las condiciones de uso establecidas por los autores.http://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Cardozo Álvarez, Nicolásvirtual::9085-1Orozco García, Tomás1234cdb4-0a6a-470f-9d64-86d694bc585f500Mariño Drews, OlgaDusparic, Ivana2021-08-10T18:05:25Z2021-08-10T18:05:25Z2020http://hdl.handle.net/1992/5098122826.pdfinstname:Universidad de los Andesreponame:Repositorio Institucional Sénecarepourl:https://repositorio.uniandes.edu.co/The thesis develops an ideal inverse-dynamics learning algorithm which can learn the properties of the sensors and actuators under its control. The algorithm converges on an ideal feature space, where the implementation details of the actuators under the algorithm's control are rendered invisible to the forward dynamics of a curiosity-driven algorithm (with the same sensors and actuators), run on top of that feature space, where the curiosity-driven algorithm's reward is strictly determined by the minimization of the error of his prediction of the next state of his environment given the current state and his action. That is, the ideal feature space allows the learning trajectory of the forward dynamics of a curiosity-driven algorithm to concentrate on the dynamics of the algorithm's environment by avoiding any distractions originating in the properties of the sensors and actuators under the algorithm's control.La tesis desarrolla un algoritmo de aprendizaje ideal de dinámica inversa que puede aprender las propiedades de los sensores y actuadores bajo su control. El algoritmo converge en un espacio de características ideal, en el cual los detalles de implementación de los actuadores bajo el control del algoritmo se vuelven invisibles para la dinámica delantera de un algoritmo motivado por curiosidad (con los mismos sensores y actuadores). El algoritmo motivado por curiosidad corre sobre el espacio de características al que converge el algoritmo de dinámica inversa, y la recompensa del algoritmo motivado por curiosidad está estrictamente determinada por la minimización del error de sus predicciones del próximo estado del ambiente dado el estado actual y su acción. Esto es: el espacio de carcaterísticas ideal permite que la trayectoria de aprendizaje de la dinámica delantera de un algoritmo motivado por curiosidad se concentre en las propiedades de la dinámica del ambiente del algoritmo y evite distracciones provenientes de las propiedades de los sensores y actuadores bajo el control del algoritmo.Magíster en Ingeniería de Sistemas y ComputaciónMaestría72 hojasapplication/pdfengUniversidad de los AndesMaestría en Ingeniería de Sistemas y ComputaciónFacultad de IngenieríaDepartamento de Ingeniería de Sistemas y ComputaciónBuilding perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agentsTrabajo de grado - Maestríainfo:eu-repo/semantics/masterThesishttp://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/TMAprendizaje por refuerzo (Aprendizaje automático)Aprendizaje automático (Inteligencia artificial)Aprendizaje impulsado por la curiosidadAlgoritmos (Computadores)Ingeniería200923093Publicationhttps://scholar.google.es/citations?user=3iTzjQsAAAAJvirtual::9085-10000-0002-1094-9952virtual::9085-1a77ff528-fc33-44d6-9022-814f81ef407avirtual::9085-1a77ff528-fc33-44d6-9022-814f81ef407avirtual::9085-1ORIGINAL22826.pdfapplication/pdf715039https://repositorio.uniandes.edu.co/bitstreams/f48e1ed4-77c0-4045-aac9-4d615c7ae65d/downloada282f46ed9ca5c0b86aa558f8c1c8394MD51THUMBNAIL22826.pdf.jpg22826.pdf.jpgIM Thumbnailimage/jpeg9731https://repositorio.uniandes.edu.co/bitstreams/e7f8e771-c24a-4e88-98c7-0ceea0a419e1/download8d770ec793b4e53dbfcfbfa4f8ea5bc5MD55TEXT22826.pdf.txt22826.pdf.txtExtracted texttext/plain158575https://repositorio.uniandes.edu.co/bitstreams/52baab7f-1186-46ee-ad28-01e40f25077c/downloadfe5cfc4cc1b8ef86d73b64644ce58741MD541992/50981oai:repositorio.uniandes.edu.co:1992/509812024-03-13 13:50:40.784http://creativecommons.org/licenses/by-nc-nd/4.0/open.accesshttps://repositorio.uniandes.edu.coRepositorio institucional Sénecaadminrepositorio@uniandes.edu.co

Building perfectly curious machines: using structural causal modeling to define the ideal feature space at the learning baseline of curiosity-driven agents

Publicaciones similares