Lube: Mitigating bottlenecks in wide area data analytics

Hao Wang, Baochun Li

Research output: Contribution to conferencePaperpeer-review

14 Scopus citations

Abstract

Over the past decade, we have witnessed exponential growth in the density (petabyte-level) and breadth (across geo-distributed datacenters) of data distribution. It becomes increasingly challenging but imperative to minimize the response times of data analytic queries over multiple geo-distributed datacenters. However, existing scheduling-based solutions have largely been motivated by pre-established mantras (e.g., bandwidth scarcity). Without data-driven insights into performance bottlenecks at runtime, schedulers might blindly assign tasks to workers that are suffering from unidentified bottlenecks. In this paper, we present Lube, a system framework that minimizes query response times by detecting and mitigating bottlenecks at runtime. Lube monitors geo-distributed data analytic queries in real-time, detects potential bottlenecks, and mitigates them with a bottleneck-aware scheduling policy. Our preliminary experiments on a real-world prototype across Amazon EC2 regions have shown that Lube can detect bottlenecks with over 90% accuracy, and reduce the median query response time by up to 33% compared to Spark’s built-in locality-based scheduler.

Original languageEnglish
StatePublished - 2017
Event9th USENIX Workshop on Hot Topics in Cloud Computing, HotCloud 2017 - Santa Clara, United States
Duration: 10 Jul 201711 Jul 2017

Conference

Conference9th USENIX Workshop on Hot Topics in Cloud Computing, HotCloud 2017
Country/TerritoryUnited States
CitySanta Clara
Period10/07/1711/07/17

Fingerprint

Dive into the research topics of 'Lube: Mitigating bottlenecks in wide area data analytics'. Together they form a unique fingerprint.

Cite this