Efficient Point Cloud Analytics on Edge Devices

Kunxiong Zhu, Zhenlin Wu, Hongyuan Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Point clouds are crucial for 3D geometry representation, and vital in applications like autonomous driving and augmented reality. Despite advancements in deep learning-based analytics, their high computational cost limits deployment on edge devices with constrained resources. To this end, we analyze PointNet++, a leading point cloud analytics framework, identifying two major bottlenecks: 1) GPU is underutilized due to limited parallelism and excessive kernel launches in the sampling stage and voting stage, and 2) irregular memory accesses in the grouping stage. To address these, we propose parallel sampling and voting to enhance GPU utilization and fuse subroutines in grouping to improve memory efficiency. Experimental results demonstrate that our optimizations result in significant speedup (up to 5.0 ×, 3.2 × on average) across various point cloud workloads on edge devices.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 30th International Conference on Parallel and Distributed Systems, ICPADS 2024
Pages382-389
Number of pages8
ISBN (Electronic)9798331515966
DOIs
StatePublished - 2024
Event30th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2024 - Belgrade, Serbia
Duration: 10 Oct 202414 Oct 2024

Publication series

NameProceedings of the International Conference on Parallel and Distributed Systems - ICPADS
ISSN (Print)1521-9097

Conference

Conference30th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2024
Country/TerritorySerbia
CityBelgrade
Period10/10/2414/10/24

Fingerprint

Dive into the research topics of 'Efficient Point Cloud Analytics on Edge Devices'. Together they form a unique fingerprint.

Cite this