An Efficient On-Device Federated Learning System Through the Interplay of Client Selection and Batch Size With Watermarked Data

  • Tao Ling
  • , Siping Shi
  • , Hao Wang
  • , Chuang Hu
  • , Dan Wang

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Federated Learning (FL) enables edge devices to collaboratively train a global model using local data. However, the increasing prevalence of watermarks in datasets presents a new challenge to efficient FL. While watermarks assert data ownership and copyright, they introduce complexities that can lead to shortcut learning problems and mislead utility measurements for client selection. These issues are further exacerbated by batch size variations in efficient FL frameworks, ultimately undermining their time-to-accuracy performance. We introduce LotusFL, an FL system designed to address the challenges posed by watermarked datasets in efficient FL. Specifically, it tackles the increased time-to-accuracy due to erroneous client selection and the accuracy degradation observed with larger batch sizes. LotusFL first estimates the characteristics of watermarks through statistical estimation and then adjusts the batch size using this estimated watermark information to balance the negative impact of the watermark against device idle waiting time. Additionally, its client selection mechanism, based on historical information, avoids the misleading utility signals from watermarks. This mechanism, working in conjunction with batch size adjustment, aims to accurately predict device runtime and identify potentially valuable devices. We evaluated LotusFL through a real-world deployment on 40 edge devices. Compared to state-of-the-art efficient FL frameworks, LotusFL achieves superior performance, enhancing accuracy by up to 8.2% and reducing training time by 1.97×.

Original languageEnglish
Pages (from-to)11480-11493
Number of pages14
JournalIEEE Transactions on Mobile Computing
Volume24
Issue number11
DOIs
StatePublished - 2025

Keywords

  • Federated learning
  • batch size
  • client selection
  • data and system heterogeneity
  • machine learning systems
  • watermark

Fingerprint

Dive into the research topics of 'An Efficient On-Device Federated Learning System Through the Interplay of Client Selection and Batch Size With Watermarked Data'. Together they form a unique fingerprint.

Cite this