Deep reinforcement learning and population dynamics for water systems control

"We study the control of water-tank systems with linear and nonlinear coupled dynamics. As a way to alleviate the design difficulties and avoid modeling simplifications, we design centralized and decentralized deep reinforcement learning (RL) strategies to control interconnected linear and nonl...

Full description

Autores:: Ochoa Tamayo, Daniel Esteban

Tipo de recurso:

Fecha de publicación:: 2019

Institución:: Universidad de los Andes

Repositorio:: Séneca: repositorio Uniandes

Idioma:: eng

id	UNIANDES2_0711b4f7a158aece0b498d70400ee47b
oai_identifier_str	oai:repositorio.uniandes.edu.co:1992/44007
network_acronym_str	UNIANDES2
network_name_str	Séneca: repositorio Uniandes
repository_id_str
dc.title.es_CO.fl_str_mv	Deep reinforcement learning and population dynamics for water systems control
title	Deep reinforcement learning and population dynamics for water systems control
spellingShingle	Deep reinforcement learning and population dynamics for water systems control Sistemas de control - Aplicaciones industriales - Investigaciones Canales (Ingeniería hidráulica) - Control - Investigaciones Aprendizaje por refuerzo (Aprendizaje automático) - Aplicaciones - Investigaciones Ingeniería
title_short	Deep reinforcement learning and population dynamics for water systems control
title_full	Deep reinforcement learning and population dynamics for water systems control
title_fullStr	Deep reinforcement learning and population dynamics for water systems control
title_full_unstemmed	Deep reinforcement learning and population dynamics for water systems control
title_sort	Deep reinforcement learning and population dynamics for water systems control
dc.creator.fl_str_mv	Ochoa Tamayo, Daniel Esteban
dc.contributor.advisor.none.fl_str_mv	Quijano Silva, Nicanor
dc.contributor.author.none.fl_str_mv	Ochoa Tamayo, Daniel Esteban
dc.subject.armarc.es_CO.fl_str_mv	Sistemas de control - Aplicaciones industriales - Investigaciones Canales (Ingeniería hidráulica) - Control - Investigaciones Aprendizaje por refuerzo (Aprendizaje automático) - Aplicaciones - Investigaciones
topic	Sistemas de control - Aplicaciones industriales - Investigaciones Canales (Ingeniería hidráulica) - Control - Investigaciones Aprendizaje por refuerzo (Aprendizaje automático) - Aplicaciones - Investigaciones Ingeniería
dc.subject.themes.none.fl_str_mv	Ingeniería
description	"We study the control of water-tank systems with linear and nonlinear coupled dynamics. As a way to alleviate the design difficulties and avoid modeling simplifications, we design centralized and decentralized deep reinforcement learning (RL) strategies to control interconnected linear and nonlinear water-tank systems relevant for industrial process control. For the linear variant, we propose a hierarchical control strategy to solve the optimal drainage problem in open-channel systems by combining an optimization technique known as minimum scaled consensus control (MSCC) with the deep deterministic policy gradient (DDPG) algorithm. On the other case, for the nonlinear dynamics we use actor-critic structures for the DDPG and the proximal policy optimization (PPO) algorithm and propose a variant called the multi-critic architecture, which allows the addition of prior knowledge on dominant input-output couplings of multi-input multi-output systems. The proposed approaches for the linear and nonlinear cases show comparable performance with classical control techniques while being completely model independent. Finally, we study the problem of robust resource allocation with momentum using dynamical systems. We propose a class of time-varying differential equations with momentum that achieve acceleration and preserve most of the asymptotic properties of its time-invariant counterpart. Since time-varying dynamics with momentum in continuous-time usually lack of structural robustness properties, we present a hybrid regularization that induces the property of uniform asymptotic stability in the system. We show this by using the invariance principle for well-posed hybrid dynamical systems, and we establish the existence of strictly positive margins of robustness with respect to arbitrarily small disturbances. We illustrate our results via numerical simulations."--Tomado del Formato de Documento de Grado.
publishDate	2019
dc.date.issued.es_CO.fl_str_mv	2019
dc.date.accessioned.none.fl_str_mv	2020-09-03T14:30:25Z
dc.date.available.none.fl_str_mv	2020-09-03T14:30:25Z
dc.type.spa.fl_str_mv	Trabajo de grado - Maestría
dc.type.coarversion.fl_str_mv	http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.driver.spa.fl_str_mv	info:eu-repo/semantics/masterThesis
dc.type.content.spa.fl_str_mv	Text
dc.type.redcol.spa.fl_str_mv	http://purl.org/redcol/resource_type/TM
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/1992/44007
dc.identifier.pdf.none.fl_str_mv	u827241.pdf
dc.identifier.instname.spa.fl_str_mv	instname:Universidad de los Andes
dc.identifier.reponame.spa.fl_str_mv	reponame:Repositorio Institucional Séneca
dc.identifier.repourl.spa.fl_str_mv	repourl:https://repositorio.uniandes.edu.co/
url	http://hdl.handle.net/1992/44007
identifier_str_mv	u827241.pdf instname:Universidad de los Andes reponame:Repositorio Institucional Séneca repourl:https://repositorio.uniandes.edu.co/
dc.language.iso.es_CO.fl_str_mv	eng
language	eng
dc.rights.uri.*.fl_str_mv	https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdf
dc.rights.accessrights.spa.fl_str_mv	info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdf http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv	openAccess
dc.format.extent.es_CO.fl_str_mv	62 hojas
dc.format.mimetype.es_CO.fl_str_mv	application/pdf
dc.publisher.es_CO.fl_str_mv	Uniandes
dc.publisher.program.es_CO.fl_str_mv	Maestría en Ingeniería Electrónica y de Computadores
dc.publisher.faculty.es_CO.fl_str_mv	Facultad de Ingeniería
dc.publisher.department.es_CO.fl_str_mv	Departamento de Ingeniería Eléctrica y Electrónica
dc.source.es_CO.fl_str_mv	instname:Universidad de los Andes reponame:Repositorio Institucional Séneca
instname_str	Universidad de los Andes
institution	Universidad de los Andes
reponame_str	Repositorio Institucional Séneca
collection	Repositorio Institucional Séneca
bitstream.url.fl_str_mv	https://repositorio.uniandes.edu.co/bitstreams/3f15e6f4-31b6-467b-a446-04f30a1e105b/download https://repositorio.uniandes.edu.co/bitstreams/bcb1e80b-94e1-4732-a892-73af3749a352/download https://repositorio.uniandes.edu.co/bitstreams/aeaf128e-7f58-496b-94d4-1a8e148c996b/download
bitstream.checksum.fl_str_mv	a301f198fb225bdb2b23d8e51958d6b6 94730cacc8986f9cf5c2b07d35d9c6ec 451f15ab2c5603bb4b6ac914d406beda
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio institucional Séneca
repository.mail.fl_str_mv	adminrepositorio@uniandes.edu.co
_version_	1837005085384638464
spelling	Al consultar y hacer uso de este recurso, está aceptando las condiciones de uso establecidas por los autores.https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdfinfo:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Quijano Silva, Nicanorvirtual::5323-1Ochoa Tamayo, Daniel Esteband370f1f9-2cb8-4645-8c3c-1a13ee90bd9a5002020-09-03T14:30:25Z2020-09-03T14:30:25Z2019http://hdl.handle.net/1992/44007u827241.pdfinstname:Universidad de los Andesreponame:Repositorio Institucional Sénecarepourl:https://repositorio.uniandes.edu.co/"We study the control of water-tank systems with linear and nonlinear coupled dynamics. As a way to alleviate the design difficulties and avoid modeling simplifications, we design centralized and decentralized deep reinforcement learning (RL) strategies to control interconnected linear and nonlinear water-tank systems relevant for industrial process control. For the linear variant, we propose a hierarchical control strategy to solve the optimal drainage problem in open-channel systems by combining an optimization technique known as minimum scaled consensus control (MSCC) with the deep deterministic policy gradient (DDPG) algorithm. On the other case, for the nonlinear dynamics we use actor-critic structures for the DDPG and the proximal policy optimization (PPO) algorithm and propose a variant called the multi-critic architecture, which allows the addition of prior knowledge on dominant input-output couplings of multi-input multi-output systems. The proposed approaches for the linear and nonlinear cases show comparable performance with classical control techniques while being completely model independent. Finally, we study the problem of robust resource allocation with momentum using dynamical systems. We propose a class of time-varying differential equations with momentum that achieve acceleration and preserve most of the asymptotic properties of its time-invariant counterpart. Since time-varying dynamics with momentum in continuous-time usually lack of structural robustness properties, we present a hybrid regularization that induces the property of uniform asymptotic stability in the system. We show this by using the invariance principle for well-posed hybrid dynamical systems, and we establish the existence of strictly positive margins of robustness with respect to arbitrarily small disturbances. We illustrate our results via numerical simulations."--Tomado del Formato de Documento de Grado."Estudiamos el control de sistemas de agua con dinámicas lineales y no lineales. Como una forma de aliviar las dificultades de diseño y evitar las simplificaciones en el modelado, diseñamos estrategias centralizadas y descentralizadas de aprendizaje por refuerzo (RL) profundo para el control de de sistemas interconectados de tanques de agua con dinámicas lineales y no lineales, relevantes para el control de procesos industriales. Para la variante lineal, proponemos una estrategia de control jerárquico que resuelve el problema de drenaje óptimo en sistemas de canal abierto combinando una técnica de optimización conocida como minimum scaled consensus control (MSCC) con el algoritmo de deep deterministic policy gradients (DDPG). En el otro caso, para las dinámicas no lineales utilizamos estructuras de actor-crítico con el algoritmo DDPG y el algoritmo de proximal policy optimization (PPO) en adición de proponer una variante con el nombre de arquitectura multi-crítica. Los esquemas propuestos muestran un rendimiento comparable con las técnicas de control clásicas aún sin la inclusión explícita de un modelo del sistema. Finalmente, estudiamos el problema de la asignación robusta de recursos con momentum usando sistemas dinámicos. Proponemos una clase de ecuaciones diferenciales con momentum y variantes en el tiempo que logran aceleración y preservan la mayoría de las propiedades asintóticas de su contraparte invariante en el tiempo. Ya que las dinámicas variantes en el tiempo con momento usualmente no poseen propiedades de robustez estructural, presentamos un mecanismo de regularización híbrida que induce la propiedad de estabilidad asintótica uniforme en el sistema dinámico. Mostramos esto usando el principio de invariancia para sistemas dinámicos híbridos well-posed, y establecemos la existencia de márgenes de robustez estrictamente positivos con respecto a perturbaciones arbitrariamente pequeñas. Ilustramos nuestros resultados mediante simulaciones numéricas."--Tomado del Formato de Documento de Grado.Magíster en Ingeniería Electrónica y de ComputadoresMaestría62 hojasapplication/pdfengUniandesMaestría en Ingeniería Electrónica y de ComputadoresFacultad de IngenieríaDepartamento de Ingeniería Eléctrica y Electrónicainstname:Universidad de los Andesreponame:Repositorio Institucional SénecaDeep reinforcement learning and population dynamics for water systems controlTrabajo de grado - Maestríainfo:eu-repo/semantics/masterThesishttp://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/TMSistemas de control - Aplicaciones industriales - InvestigacionesCanales (Ingeniería hidráulica) - Control - InvestigacionesAprendizaje por refuerzo (Aprendizaje automático) - Aplicaciones - InvestigacionesIngenieríaPublicationhttps://scholar.google.es/citations?user=xu0jdYAAAAAJvirtual::5323-10000-0002-8688-3195virtual::5323-1https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0000849669virtual::5323-1698e35fc-6e9e-4c84-8960-ae30da9bc64avirtual::5323-1698e35fc-6e9e-4c84-8960-ae30da9bc64avirtual::5323-1THUMBNAILu827241.pdf.jpgu827241.pdf.jpgIM Thumbnailimage/jpeg10308https://repositorio.uniandes.edu.co/bitstreams/3f15e6f4-31b6-467b-a446-04f30a1e105b/downloada301f198fb225bdb2b23d8e51958d6b6MD55TEXTu827241.pdf.txtu827241.pdf.txtExtracted texttext/plain98823https://repositorio.uniandes.edu.co/bitstreams/bcb1e80b-94e1-4732-a892-73af3749a352/download94730cacc8986f9cf5c2b07d35d9c6ecMD54ORIGINALu827241.pdfapplication/pdf2961259https://repositorio.uniandes.edu.co/bitstreams/aeaf128e-7f58-496b-94d4-1a8e148c996b/download451f15ab2c5603bb4b6ac914d406bedaMD511992/44007oai:repositorio.uniandes.edu.co:1992/440072024-03-13 12:54:51.343https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdfopen.accesshttps://repositorio.uniandes.edu.coRepositorio institucional Sénecaadminrepositorio@uniandes.edu.co

Deep reinforcement learning and population dynamics for water systems control

Publicaciones similares