TY - GEN
T1 - Efficient Point Cloud Analytics on Edge Devices
AU - Zhu, Kunxiong
AU - Wu, Zhenlin
AU - Liu, Hongyuan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Point clouds are crucial for 3D geometry representation, and vital in applications like autonomous driving and augmented reality. Despite advancements in deep learning-based analytics, their high computational cost limits deployment on edge devices with constrained resources. To this end, we analyze PointNet++, a leading point cloud analytics framework, identifying two major bottlenecks: 1) GPU is underutilized due to limited parallelism and excessive kernel launches in the sampling stage and voting stage, and 2) irregular memory accesses in the grouping stage. To address these, we propose parallel sampling and voting to enhance GPU utilization and fuse subroutines in grouping to improve memory efficiency. Experimental results demonstrate that our optimizations result in significant speedup (up to 5.0 ×, 3.2 × on average) across various point cloud workloads on edge devices.
AB - Point clouds are crucial for 3D geometry representation, and vital in applications like autonomous driving and augmented reality. Despite advancements in deep learning-based analytics, their high computational cost limits deployment on edge devices with constrained resources. To this end, we analyze PointNet++, a leading point cloud analytics framework, identifying two major bottlenecks: 1) GPU is underutilized due to limited parallelism and excessive kernel launches in the sampling stage and voting stage, and 2) irregular memory accesses in the grouping stage. To address these, we propose parallel sampling and voting to enhance GPU utilization and fuse subroutines in grouping to improve memory efficiency. Experimental results demonstrate that our optimizations result in significant speedup (up to 5.0 ×, 3.2 × on average) across various point cloud workloads on edge devices.
UR - http://www.scopus.com/inward/record.url?scp=85212507970&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85212507970&partnerID=8YFLogxK
U2 - 10.1109/ICPADS63350.2024.00057
DO - 10.1109/ICPADS63350.2024.00057
M3 - Conference contribution
AN - SCOPUS:85212507970
T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
SP - 382
EP - 389
BT - Proceedings - 2024 IEEE 30th International Conference on Parallel and Distributed Systems, ICPADS 2024
T2 - 30th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2024
Y2 - 10 October 2024 through 14 October 2024
ER -