Hu, C, Zhu, J, Yang, R et al. (6 more authors) (2020) TOPOSCH: Latency-Aware Scheduling Based on Critical Path Analysis on Shared YARN Clusters. In: Procedings of 2020 IEEE 13th International Conference on Cloud Computing (CLOUD). 2020 IEEE 13th International Conference on Cloud Computing (CLOUD), 19-23 Oct 2020, Beijing, China. IEEE , pp. 619-627. ISBN 978-1-7281-8780-8
Abstract
Balancing resource utilization and application QoS is a long-standing research topic in cluster resource management. Big data YARN clusters need to co-schedule diverse workloads on shared resources including batch processing jobs, streaming jobs, and other long-running applications such as web services, database services, etc. Current resource managers are only responsible for resource allocation among applications/jobs but completely unaware of runtime QoS requirements of interactive and latency-sensitive applications. Prior works to maximize the QoS of monolithic applications ignore inherent dependencies and temporal-spatio performance variability of components, characteristics of distributed applications primarily driven by microservices. In this paper, we present Toposch, a new resource management system to adaptively co-locate batch tasks and microservices by harvesting runtime latency. In particular, Toposch tracks full footprints of every request across microservices over time. A latency graph is periodically generated for identifying victim microservices through an end-to-end latency critical path analysis. We then exploit per-microservice and per-node risk assessment to gauge the visible resources to the capacity scheduler in YARN. Execution of batch tasks are adaptively throttled or delayed, thereby avoiding latency increase due to node over-saturation. TOPOSCH is integrated with YARN and experiments show that the latency of DLRAs can be reduced by up to 39.8% against the default capacity scheduling in YARN.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Keywords: | latency sensitivity, workload co-location, microservice, cluster management |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 09 Nov 2020 11:28 |
Last Modified: | 07 Aug 2023 13:24 |
Status: | Published |
Publisher: | IEEE |
Identification Number: | 10.1109/CLOUD49709.2020.00091 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:167724 |