TY - GEN
T1 - Online matrix completion for signed link prediction
AU - Wang, Jing
AU - Shen, Jie
AU - Li, Ping
AU - Xu, Huan
N1 - Publisher Copyright:
© 2017 ACM.
PY - 2017/2/2
Y1 - 2017/2/2
N2 - This work studies the binary matrix completion problem underlying a large body of real-world applications such as signed link prediction and information propagation. That is, each entry of the matrix indicates a binary preference such as "like" or "dislike", "trust" or "distrust". However, the performance of existing matrix completion methods may be hindered owing to three practical challenges: 1) the observed data are with binary label (i.e., not real value); 2) the data are typically sampled non-uniformly (i.e., positive links dominate the negative ones) and 3) a network may have a huge volume of data (i.e., memory and computational issue). In order to remedy these problems, we propose a novel framework which i) maximizes the resemblance between predicted and observed matrices as well as penalizing the logistic loss to fit the binary data to produce binary estimates; ii) constrains the matrix max-norm to handle nonuniformness and iii) presents online optimization technique, hence mitigating the memory cost. Extensive experiments performed on four large-scale datasets with up to hundreds of thousands of users demonstrate the superiority of our framework over the state-of-the-art matrix completion based methods and popular link prediction approaches.
AB - This work studies the binary matrix completion problem underlying a large body of real-world applications such as signed link prediction and information propagation. That is, each entry of the matrix indicates a binary preference such as "like" or "dislike", "trust" or "distrust". However, the performance of existing matrix completion methods may be hindered owing to three practical challenges: 1) the observed data are with binary label (i.e., not real value); 2) the data are typically sampled non-uniformly (i.e., positive links dominate the negative ones) and 3) a network may have a huge volume of data (i.e., memory and computational issue). In order to remedy these problems, we propose a novel framework which i) maximizes the resemblance between predicted and observed matrices as well as penalizing the logistic loss to fit the binary data to produce binary estimates; ii) constrains the matrix max-norm to handle nonuniformness and iii) presents online optimization technique, hence mitigating the memory cost. Extensive experiments performed on four large-scale datasets with up to hundreds of thousands of users demonstrate the superiority of our framework over the state-of-the-art matrix completion based methods and popular link prediction approaches.
KW - Link prediction
KW - Low-rank
KW - Matrix completion
KW - Online learning
UR - http://www.scopus.com/inward/record.url?scp=85015300186&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85015300186&partnerID=8YFLogxK
U2 - 10.1145/3018661.3018681
DO - 10.1145/3018661.3018681
M3 - Conference contribution
AN - SCOPUS:85015300186
T3 - WSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining
SP - 475
EP - 484
BT - WSDM 2017 - Proceedings of the 10th ACM International Conference on Web Search and Data Mining
T2 - 10th ACM International Conference on Web Search and Data Mining, WSDM 2017
Y2 - 6 February 2017 through 10 February 2017
ER -