TY - JOUR
T1 - A distributed algorithm for solving large-scale p-median problems using expectation maximization
AU - Gwalani, Harsha
AU - Helsing, Joseph
AU - Alshammari, Sultanah M.
AU - Tiwari, Chetan
AU - Mikler, Armin R.
N1 - Publisher Copyright:
© 2024 Gwalani et al.
PY - 2024
Y1 - 2024
N2 - The p-median problem selects p source locations to serve n destinations such that the average distance between the destinations and corresponding sources is minimized. It is a well-studied NP-hard combinatorial optimization problem with many existing heuristic solutions, however, existing algorithms are not scalable for large-scale problems. The fast interchange (FI) heuristic which yields results close to the optimal solution with respect to the objective function value becomes suboptimal with respect to time requirements for large-scale problems. We present a novel distributed divide and conquer algorithm, EM-FI, to solve large-scale p-median problems quickly even with limited computing resources. The algorithm identifies the existing spatial clusters of the destination locations using expectation maximization (EM) and solves them as independent p-median problems using integer programming or FI concurrently. The proposed algorithm showed an order of magnitude improvement in time without the loss of quality in terms of the objective function value on synthetic and real datasets.
AB - The p-median problem selects p source locations to serve n destinations such that the average distance between the destinations and corresponding sources is minimized. It is a well-studied NP-hard combinatorial optimization problem with many existing heuristic solutions, however, existing algorithms are not scalable for large-scale problems. The fast interchange (FI) heuristic which yields results close to the optimal solution with respect to the objective function value becomes suboptimal with respect to time requirements for large-scale problems. We present a novel distributed divide and conquer algorithm, EM-FI, to solve large-scale p-median problems quickly even with limited computing resources. The algorithm identifies the existing spatial clusters of the destination locations using expectation maximization (EM) and solves them as independent p-median problems using integer programming or FI concurrently. The proposed algorithm showed an order of magnitude improvement in time without the loss of quality in terms of the objective function value on synthetic and real datasets.
KW - Distributed algorithms
KW - Heuristic search
KW - Location allocation
KW - P-median problem
KW - Parallel computing
KW - Spatial data mining
UR - http://www.scopus.com/inward/record.url?scp=85210743563&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85210743563&partnerID=8YFLogxK
U2 - 10.7717/peerj-cs.2446
DO - 10.7717/peerj-cs.2446
M3 - Article
AN - SCOPUS:85210743563
VL - 10
SP - 1
EP - 24
JO - PeerJ Computer Science
JF - PeerJ Computer Science
M1 - e2446
ER -