TY - JOUR
T1 - Freyr+
T2 - Harvesting Idle Resources in Serverless Computing via Deep Reinforcement Learning
AU - Yu, Hanfei
AU - Wang, Hao
AU - Li, Jian
AU - Yuan, Xu
AU - Park, Seung Jong
N1 - Publisher Copyright:
©2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Serverless computing has revolutionized online service development and deployment with ease-to-use operations, auto-scaling, fine-grained resource allocation, and pay-as-you-go pricing. However, a gap remains in configuring serverless functions - the actual resource consumption may vary due to function types, dependencies, and input data sizes, thus mismatching the static resource configuration by users. Dynamic resource consumption against static configuration may lead to either poor function execution performance or low utilization. This paper proposes Freyr+, a novel resource manager (RM) that dynamically harvests idle resources from over-provisioned functions to accelerate under-provisioned functions for serverless platforms. Freyr+ monitors each function's resource utilization in real-time and detects the mismatches between user configuration and actual resource consumption. We design deep reinforcement learning (DRL) algorithms with attention-enhanced embedding, incremental learning, and safeguard mechanism for Freyr+ to harvest idle resources safely and accelerate functions efficiently. We have implemented and deployed a Freyr+ prototype in a 13-node Apache OpenWhisk cluster using AWS EC2. Freyr+ is evaluated on both large-scale simulation and real-world testbed. Experimental results show that Freyr+ harvests 38% of function invocations' idle resources and accelerates 39% of invocations using harvested resources. Freyr+ reduces the 99th-percentile function response latency by 26% compared to the baseline RMs.
AB - Serverless computing has revolutionized online service development and deployment with ease-to-use operations, auto-scaling, fine-grained resource allocation, and pay-as-you-go pricing. However, a gap remains in configuring serverless functions - the actual resource consumption may vary due to function types, dependencies, and input data sizes, thus mismatching the static resource configuration by users. Dynamic resource consumption against static configuration may lead to either poor function execution performance or low utilization. This paper proposes Freyr+, a novel resource manager (RM) that dynamically harvests idle resources from over-provisioned functions to accelerate under-provisioned functions for serverless platforms. Freyr+ monitors each function's resource utilization in real-time and detects the mismatches between user configuration and actual resource consumption. We design deep reinforcement learning (DRL) algorithms with attention-enhanced embedding, incremental learning, and safeguard mechanism for Freyr+ to harvest idle resources safely and accelerate functions efficiently. We have implemented and deployed a Freyr+ prototype in a 13-node Apache OpenWhisk cluster using AWS EC2. Freyr+ is evaluated on both large-scale simulation and real-world testbed. Experimental results show that Freyr+ harvests 38% of function invocations' idle resources and accelerates 39% of invocations using harvested resources. Freyr+ reduces the 99th-percentile function response latency by 26% compared to the baseline RMs.
KW - Attention
KW - incremental learning
KW - reinforcement learning
KW - resource harvesting
KW - serverless computing
UR - http://www.scopus.com/inward/record.url?scp=85204485761&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85204485761&partnerID=8YFLogxK
U2 - 10.1109/TPDS.2024.3462294
DO - 10.1109/TPDS.2024.3462294
M3 - Article
AN - SCOPUS:85204485761
SN - 1045-9219
VL - 35
SP - 2254
EP - 2269
JO - IEEE Transactions on Parallel and Distributed Systems
JF - IEEE Transactions on Parallel and Distributed Systems
IS - 11
ER -