DRLCap: Runtime GPU Frequency Capping with Deep Reinforcement Learning

Abstract

Power and energy consumption is the limiting factor of modern computing systems. As the GPU becomes a mainstream computing device, power management for GPUs becomes increasingly important. Current works focus on GPU kernel-level power management, with challenges in portability due to architecture-specific considerations. We present DRLCap , a general runtime power management framework intended to support power management across various GPU architectures. It periodically monitors system-level information to dynamically detect program phase changes and model the workload and GPU system behavior. This elimination from kernel-specific constraints enhances adaptability and responsiveness. The framework leverages dynamic GPU frequency capping, which is the most widely used power knob, to control the power consumption. DRLCap employs deep reinforcement learning (DRL) to adapt to the changing of program phases by automatically adjusting its power policy through online learning, aiming to reduce the GPU power consumption without significantly compromising the application performance. We evaluate DRLCap on three NVIDIA and one AMD GPU architectures. Experimental results show that DRLCap improves prior GPU power optimization strategies by a large margin. On average, it reduces the GPU energy consumption by 22% with less than 3% performance slowdown on NVIDIA GPUs. This translates to a 20% improvement in the energy efficiency measured by the energy-delay product (EDP) over the NVIDIA default GPU power management strategy. For the AMD GPU architecture, DRLCap saves energy consumption by 10%, on average, with a 4% percentage loss, and improves energy efficiency by 8%.

Metadata

Item Type:	Article
Authors/Creators:	Wang, Y. https://orcid.org/0000-0003-3298-5134 Hao, M. https://orcid.org/0000-0003-0043-4370 He, H. https://orcid.org/0000-0002-6494-775X Zhang, W. https://orcid.org/0000-0003-4783-876X Tang, Q. Sun, X. Wang, Z. https://orcid.org/0000-0001-6157-0662
Copyright, Publisher and Additional Information:	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Keywords:	GPUs, deep reinforcement learning, power and energy optimization, GPU power optimization
Dates:	Published (online): 6 February 2024
Institution:	The University of Leeds
Academic Units:	The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds)
Depositing User:	Symplectic Publications
Date Deposited:	13 Feb 2024 10:41
Last Modified:	13 Feb 2024 10:41
Status:	Published
Publisher:	Institute of Electrical and Electronics Engineers (IEEE)
Identification Number:	10.1109/tsusc.2024.3362697
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:209061

CORE (COnnecting REpositories)

DRLCap: Runtime GPU Frequency Capping with Deep Reinforcement Learning

Abstract

Metadata

Download

Accepted Version

Export

Statistics