Self-Healing Distributed Scheduling Platform

Distributed systems require effective mechanisms to manage the reliable provisioning of computational resources from different and distributed providers. Moreover, the dynamic environment that affects the behaviour of such systems and the complexity of these dynamics demand autonomous capabilities t...

Full description

Autores:
Frincu, Marc E.
Rouvoy, Romain
Müller, Hausi A.
Petcu, Dana
Villegas Machado, Norha Milena
Tipo de recurso:
Part of book
Fecha de publicación:
2011
Institución:
Universidad ICESI
Repositorio:
Repositorio ICESI
Idioma:
eng
OAI Identifier:
oai:repository.icesi.edu.co:10906/79548
Acceso en línea:
http://dx.doi.org/10.1109/CCGrid.2011.23
http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=5948613
http://hdl.handle.net/10906/79548
Palabra clave:
Programación de computadores
Ingeniería de sistemas y comunicaciones
Plataforma tecnológica
Systems engineering
Rights
openAccess
License
https://creativecommons.org/licenses/by-nc-nd/4.0/
id ICESI2_9ef4e1170cda81912715f1bcefb5a5ff
oai_identifier_str oai:repository.icesi.edu.co:10906/79548
network_acronym_str ICESI2
network_name_str Repositorio ICESI
repository_id_str
spelling Frincu, Marc E.Rouvoy, RomainMüller, Hausi A.Petcu, DanaVillegas Machado, Norha Milenanvillega@icesi.edu.co2016-06-29T02:01:14Z2016-06-29T02:01:14Z2011-01-01http://dx.doi.org/10.1109/CCGrid.2011.239780769543956http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=5948613http://hdl.handle.net/10906/79548instname: Universidad Icesireponame: Biblioteca Digitalrepourl: https://repository.icesi.edu.co/Distributed systems require effective mechanisms to manage the reliable provisioning of computational resources from different and distributed providers. Moreover, the dynamic environment that affects the behaviour of such systems and the complexity of these dynamics demand autonomous capabilities to ensure the behaviour of distributed scheduling platforms and to achieve business and user objectives. In this paper we propose a self-adaptive distributed scheduling platform composed of multiple agents implemented as intelligent feedback control loops to support policy-based scheduling and expose self-healing capabilities. Our platform leverages distributed scheduling processes by (i) allowing each provider to maintain its own internal scheduling process, and (ii) implementing self-healing capabilities based on agent module recovery. Simulated tests are performed to determine the optimal number of agents to be used in the negotiation phase without affecting the scheduling cost function. Test results on a real-life platform are presented to evaluate recovery times and optimize platform parameters.engIEEEFacultad de IngenieríaIngeniería TelemáticaDepartamento Académico de Tecnologías de Información y Comunicaciones (TICs)2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2011) - 2011EL AUTOR, expresa que la obra objeto de la presente autorización es original y la elaboró sin quebrantar ni suplantar los derechos de autor de terceros, y de tal forma, la obra es de su exclusiva autoría y tiene la titularidad sobre éste. PARÁGRAFO: en caso de queja o acción por parte de un tercero referente a los derechos de autor sobre el artículo, folleto o libro en cuestión, EL AUTOR, asumirá la responsabilidad total, y saldrá en defensa de los derechos aquí autorizados; para todos los efectos, la Universidad Icesi actúa como un tercero de buena fe. Esta autorización, permite a la Universidad Icesi, de forma indefinida, para que en los términos establecidos en la Ley 23 de 1982, la Ley 44 de 1993, leyes y jurisprudencia vigente al respecto, haga publicación de este con fines educativos. Toda persona que consulte ya sea la biblioteca o en medio electrónico podrá copiar apartes del texto citando siempre la fuente, es decir el título del trabajo y el autor.https://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccessAtribuci�n-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)http://purl.org/coar/access_right/c_abf2Programación de computadoresIngeniería de sistemas y comunicacionesPlataforma tecnológicaSystems engineeringSelf-Healing Distributed Scheduling Platforminfo:eu-repo/semantics/bookParthttp://purl.org/coar/resource_type/c_3248Parte de libroinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/version/c_970fb48d4fbd8a85Comunidad Universidad Icesi – InvestigadoresORIGINALvillegas_scheduling_platform_2011.pdfvillegas_scheduling_platform_2011.pdfapplication/pdf451221http://repository.icesi.edu.co/biblioteca_digital/bitstream/10906/79548/1/villegas_scheduling_platform_2011.pdfefc24347d14ffd7f7a074f708d1a2f9eMD5110906/79548oai:repository.icesi.edu.co:10906/795482020-05-12 06:48:39.157Biblioteca Digital - Universidad icesicdcriollo@icesi.edu.co
dc.title.eng.fl_str_mv Self-Healing Distributed Scheduling Platform
title Self-Healing Distributed Scheduling Platform
spellingShingle Self-Healing Distributed Scheduling Platform
Programación de computadores
Ingeniería de sistemas y comunicaciones
Plataforma tecnológica
Systems engineering
title_short Self-Healing Distributed Scheduling Platform
title_full Self-Healing Distributed Scheduling Platform
title_fullStr Self-Healing Distributed Scheduling Platform
title_full_unstemmed Self-Healing Distributed Scheduling Platform
title_sort Self-Healing Distributed Scheduling Platform
dc.creator.fl_str_mv Frincu, Marc E.
Rouvoy, Romain
Müller, Hausi A.
Petcu, Dana
Villegas Machado, Norha Milena
dc.contributor.author.spa.fl_str_mv Frincu, Marc E.
Rouvoy, Romain
Müller, Hausi A.
Petcu, Dana
Villegas Machado, Norha Milena
dc.subject.spa.fl_str_mv Programación de computadores
Ingeniería de sistemas y comunicaciones
Plataforma tecnológica
topic Programación de computadores
Ingeniería de sistemas y comunicaciones
Plataforma tecnológica
Systems engineering
dc.subject.eng.fl_str_mv Systems engineering
description Distributed systems require effective mechanisms to manage the reliable provisioning of computational resources from different and distributed providers. Moreover, the dynamic environment that affects the behaviour of such systems and the complexity of these dynamics demand autonomous capabilities to ensure the behaviour of distributed scheduling platforms and to achieve business and user objectives. In this paper we propose a self-adaptive distributed scheduling platform composed of multiple agents implemented as intelligent feedback control loops to support policy-based scheduling and expose self-healing capabilities. Our platform leverages distributed scheduling processes by (i) allowing each provider to maintain its own internal scheduling process, and (ii) implementing self-healing capabilities based on agent module recovery. Simulated tests are performed to determine the optimal number of agents to be used in the negotiation phase without affecting the scheduling cost function. Test results on a real-life platform are presented to evaluate recovery times and optimize platform parameters.
publishDate 2011
dc.date.issued.none.fl_str_mv 2011-01-01
dc.date.accessioned.none.fl_str_mv 2016-06-29T02:01:14Z
dc.date.available.none.fl_str_mv 2016-06-29T02:01:14Z
dc.type.eng.fl_str_mv info:eu-repo/semantics/bookPart
dc.type.coar.none.fl_str_mv http://purl.org/coar/resource_type/c_3248
dc.type.local.spa.fl_str_mv Parte de libro
dc.type.version.eng.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.coarversion.none.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
format http://purl.org/coar/resource_type/c_3248
status_str publishedVersion
dc.identifier.none.fl_str_mv http://dx.doi.org/10.1109/CCGrid.2011.23
dc.identifier.isbn.none.fl_str_mv 9780769543956
dc.identifier.other.spa.fl_str_mv http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=5948613
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/10906/79548
dc.identifier.instname.none.fl_str_mv instname: Universidad Icesi
dc.identifier.reponame.none.fl_str_mv reponame: Biblioteca Digital
dc.identifier.repourl.none.fl_str_mv repourl: https://repository.icesi.edu.co/
url http://dx.doi.org/10.1109/CCGrid.2011.23
http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=5948613
http://hdl.handle.net/10906/79548
identifier_str_mv 9780769543956
instname: Universidad Icesi
reponame: Biblioteca Digital
repourl: https://repository.icesi.edu.co/
dc.language.iso.eng.fl_str_mv eng
language eng
dc.relation.ispartof.eng.fl_str_mv 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2011) - 2011
dc.rights.uri.none.fl_str_mv https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.accessrights.eng.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.license.none.fl_str_mv Atribuci�n-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)
dc.rights.coar.none.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-nd/4.0/
Atribuci�n-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv IEEE
dc.publisher.faculty.spa.fl_str_mv Facultad de Ingeniería
dc.publisher.program.spa.fl_str_mv Ingeniería Telemática
dc.publisher.department.spa.fl_str_mv Departamento Académico de Tecnologías de Información y Comunicaciones (TICs)
publisher.none.fl_str_mv IEEE
institution Universidad ICESI
bitstream.url.fl_str_mv http://repository.icesi.edu.co/biblioteca_digital/bitstream/10906/79548/1/villegas_scheduling_platform_2011.pdf
bitstream.checksum.fl_str_mv efc24347d14ffd7f7a074f708d1a2f9e
bitstream.checksumAlgorithm.fl_str_mv MD5
repository.name.fl_str_mv Biblioteca Digital - Universidad icesi
repository.mail.fl_str_mv cdcriollo@icesi.edu.co
_version_ 1814094866997575680