TY - JOUR
T1 - VESTA
T2 - A Secure and Efficient FHE-based Three-Party Vectorized Evaluation System for Tree Aggregation Models
AU - Zhao, Haosong
AU - Huang, Junhao
AU - Chen, Zihang
AU - Zhu, Kunxiong
AU - Chen, Donglong
AU - Ji, Zhuoran
AU - Liu, Hongyuan
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/3/10
Y1 - 2025/3/10
N2 - Machine Learning as a Service (MLaaS) platforms simplify the development of machine learning applications across multiple parties. However, the model owner, compute server, and client user may not trust each other, creating a need for privacy-preserving approaches that allow applications to run without revealing proprietary data. In this work, we focus on a widely used classical machine learning model - tree ensembles. While previous efforts have applied Fully Homomorphic Encryption (FHE) to this model, these solutions suffer from slow inference speeds and excessive memory consumption. To address these issues, we propose VESTA, which includes a compiler and a runtime to reduce tree evaluation time and memory usage. VESTA includes two key techniques: First, VESTA precomputes a portion of the expensive FHE operations at compile-time, improving inference speed. Second, VESTA uses a partitioning pass in its compiler to divide the ensemble model into sub-models, enabling task-level parallelism. Comprehensive evaluation shows that VESTA achieves a 2.1× speedup and reduces memory consumption by 59.4% compared to the state-of-the-art.
AB - Machine Learning as a Service (MLaaS) platforms simplify the development of machine learning applications across multiple parties. However, the model owner, compute server, and client user may not trust each other, creating a need for privacy-preserving approaches that allow applications to run without revealing proprietary data. In this work, we focus on a widely used classical machine learning model - tree ensembles. While previous efforts have applied Fully Homomorphic Encryption (FHE) to this model, these solutions suffer from slow inference speeds and excessive memory consumption. To address these issues, we propose VESTA, which includes a compiler and a runtime to reduce tree evaluation time and memory usage. VESTA includes two key techniques: First, VESTA precomputes a portion of the expensive FHE operations at compile-time, improving inference speed. Second, VESTA uses a partitioning pass in its compiler to divide the ensemble model into sub-models, enabling task-level parallelism. Comprehensive evaluation shows that VESTA achieves a 2.1× speedup and reduces memory consumption by 59.4% compared to the state-of-the-art.
KW - domain-specific compiler
KW - homomorphic encryption
KW - parallelism
KW - tree ensembles
KW - vectorization
UR - http://www.scopus.com/inward/record.url?scp=105003217810&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105003217810&partnerID=8YFLogxK
U2 - 10.1145/3711707
DO - 10.1145/3711707
M3 - Article
AN - SCOPUS:105003217810
SN - 2476-1249
VL - 9
JO - Proceedings of the ACM on Measurement and Analysis of Computing Systems
JF - Proceedings of the ACM on Measurement and Analysis of Computing Systems
IS - 1
M1 - 14
ER -