Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment

Wimpy virtual instances equipped with small numbers of cores and RAM are popular public and private cloud offerings because of their low cost for hosting applications. The challenge is how to run latency-sensitive applications using such instances, which trade off performance for cost. In this study...

Full description

Autores:
Tipo de recurso:
Fecha de publicación:
2017
Institución:
Universidad del Rosario
Repositorio:
Repositorio EdocUR - U. Rosario
Idioma:
eng
OAI Identifier:
oai:repository.urosario.edu.co:10336/22864
Acceso en línea:
https://doi.org/10.1109/ICDCS.2017.231
https://repository.urosario.edu.co/handle/10336/22864
Palabra clave:
Cloning
Cost effectiveness
Costs
Economic and social effects
Application performance
Closed-form analysis
Fine granularity
Multiple queries
Query replications
Sensitive application
Trace driven simulation
VM provisioning
Distributed computer systems
Rights
License
http://purl.org/coar/access_right/c_abf2
id EDOCUR2_f919c84c5faa7840a0d02bd1e53db288
oai_identifier_str oai:repository.urosario.edu.co:10336/22864
network_acronym_str EDOCUR2
network_name_str Repositorio EdocUR - U. Rosario
repository_id_str
spelling Dual Scaling VMs and Queries: Cost-Effective Latency CurtailmentCloningCost effectivenessCostsEconomic and social effectsApplication performanceClosed-form analysisFine granularityMultiple queriesQuery replicationsSensitive applicationTrace driven simulationVM provisioningDistributed computer systemsWimpy virtual instances equipped with small numbers of cores and RAM are popular public and private cloud offerings because of their low cost for hosting applications. The challenge is how to run latency-sensitive applications using such instances, which trade off performance for cost. In this study, we analytically and experimentally show that simultaneously scaling resources at coarse granularity and workloads, i.e., submitting multiple query clones to different servers, at fine granularity can overcome the performance disadvantages of wimpy VM instances and achieve stringent latency targets that are even lower than the average execution times of wimpy servers. To such an end, we first derive a closed-form analysis for the latency under any given VM provisioning and query replication level, considering cloning policies that can (not) terminate outstanding clones with (without) an overhead. Validated on trace-driven simulations, our analysis is able to accurately predict the latency and efficiently search for the optimal number of VMs and clones. Secondly, we develop a dual elastic scaler, DuoScale, that dynamically scales VMs and clones according to the workload dynamics so as to achieve the target latency in a cost-effective manner. The effectiveness of DuoScale lies on the observation that the application performance only scales sub-linearly with increasing vertical or horizontal resource provisioning, i.e., resources per VM or number of VMs. We evaluate DuoScale against VM-only scaling strategies via extensive trace-driven simulations as well as experimental results on a cloud test-bed. Our results show that DuoScale is able to achieve the stringent target latency by using clones on wimpy VMs with cost savings up to 50%, compared to scaling brawny VMs that have better performance at a higher unit cost. © 2017 IEEE.Institute of Electrical and Electronics Engineers Inc.20172020-05-25T23:58:26Zinfo:eu-repo/semantics/conferenceObjecthttp://purl.org/coar/version/c_970fb48d4fbd8a85http://purl.org/coar/resource_type/c_c94fapplication/pdfhttps://doi.org/10.1109/ICDCS.2017.2312016https://repository.urosario.edu.co/handle/10336/22864instname:Universidad del Rosarioreponame:Repositorio Institucional EdocURenghttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85027252830&doi=10.1109%2fICDCS.2017.231&partnerID=40&md5=8d14657ed2f91f9a62f278a6c681a59chttp://purl.org/coar/access_right/c_abf2Pérez, Juan F.Birke R.Bjorkqvist M.Chen L.Y.oai:repository.urosario.edu.co:10336/228642022-05-02T07:37:17Z
dc.title.none.fl_str_mv Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
title Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
spellingShingle Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
Cloning
Cost effectiveness
Costs
Economic and social effects
Application performance
Closed-form analysis
Fine granularity
Multiple queries
Query replications
Sensitive application
Trace driven simulation
VM provisioning
Distributed computer systems
title_short Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
title_full Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
title_fullStr Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
title_full_unstemmed Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
title_sort Dual Scaling VMs and Queries: Cost-Effective Latency Curtailment
dc.subject.none.fl_str_mv Cloning
Cost effectiveness
Costs
Economic and social effects
Application performance
Closed-form analysis
Fine granularity
Multiple queries
Query replications
Sensitive application
Trace driven simulation
VM provisioning
Distributed computer systems
topic Cloning
Cost effectiveness
Costs
Economic and social effects
Application performance
Closed-form analysis
Fine granularity
Multiple queries
Query replications
Sensitive application
Trace driven simulation
VM provisioning
Distributed computer systems
description Wimpy virtual instances equipped with small numbers of cores and RAM are popular public and private cloud offerings because of their low cost for hosting applications. The challenge is how to run latency-sensitive applications using such instances, which trade off performance for cost. In this study, we analytically and experimentally show that simultaneously scaling resources at coarse granularity and workloads, i.e., submitting multiple query clones to different servers, at fine granularity can overcome the performance disadvantages of wimpy VM instances and achieve stringent latency targets that are even lower than the average execution times of wimpy servers. To such an end, we first derive a closed-form analysis for the latency under any given VM provisioning and query replication level, considering cloning policies that can (not) terminate outstanding clones with (without) an overhead. Validated on trace-driven simulations, our analysis is able to accurately predict the latency and efficiently search for the optimal number of VMs and clones. Secondly, we develop a dual elastic scaler, DuoScale, that dynamically scales VMs and clones according to the workload dynamics so as to achieve the target latency in a cost-effective manner. The effectiveness of DuoScale lies on the observation that the application performance only scales sub-linearly with increasing vertical or horizontal resource provisioning, i.e., resources per VM or number of VMs. We evaluate DuoScale against VM-only scaling strategies via extensive trace-driven simulations as well as experimental results on a cloud test-bed. Our results show that DuoScale is able to achieve the stringent target latency by using clones on wimpy VMs with cost savings up to 50%, compared to scaling brawny VMs that have better performance at a higher unit cost. © 2017 IEEE.
publishDate 2017
dc.date.none.fl_str_mv 2017
2020-05-25T23:58:26Z
dc.type.none.fl_str_mv info:eu-repo/semantics/conferenceObject
dc.type.coarversion.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_c94f
dc.identifier.none.fl_str_mv https://doi.org/10.1109/ICDCS.2017.231
2016
https://repository.urosario.edu.co/handle/10336/22864
url https://doi.org/10.1109/ICDCS.2017.231
https://repository.urosario.edu.co/handle/10336/22864
identifier_str_mv 2016
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://www.scopus.com/inward/record.uri?eid=2-s2.0-85027252830&doi=10.1109%2fICDCS.2017.231&partnerID=40&md5=8d14657ed2f91f9a62f278a6c681a59c
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv http://purl.org/coar/access_right/c_abf2
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers Inc.
publisher.none.fl_str_mv Institute of Electrical and Electronics Engineers Inc.
dc.source.none.fl_str_mv instname:Universidad del Rosario
reponame:Repositorio Institucional EdocUR
instname_str Universidad del Rosario
institution Universidad del Rosario
reponame_str Repositorio Institucional EdocUR
collection Repositorio Institucional EdocUR
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1803710545748033536