TY - GEN
T1 - Modeling Complex Clickstream Data by Stochastic Models
T2 - 25th International Conference on World Wide Web, WWW 2016
AU - Lakshminarayan, Choudur
AU - Kosuru, Ram
AU - Hsu, Meichun
N1 - Publisher Copyright:
© 2016 International World Wide Web Conference Committee (IW3C2).
PY - 2016/4/11
Y1 - 2016/4/11
N2 - As the website is a primary customer touch-point, millions are spent to gather web data about customer visits. Sadly, the trove of data and corresponding analytics have not lived up to the promise. Current marketing practice relies on ambiguous summary statistics or small-sample usability studies. Idiosyncratic browsing and low conversion (browser-to-buyer) make modeling hard. In this paper, we model browsing patterns (sequence of clicks) via Markov chain theory to predict users' propensity to buy within a session. We focus on model complexity, imputing missing values, data augmentation, and other attendant issues that impact performance. The paper addresses the following aspects; (1) Determine appropriate order of the Markov chain (assess the influence of prior history in prediction), (2) Impute missing transitions by exploiting the inherent link structure in the page sequences, (3) predict the likelihood of a purchase based on variable-length page sequences, and (4) Augment the training set of buyers (which is typically very small: 2% by viewing the page transitions as a graph and exploiting its link structure to improve performance. The cocktail of solutions address important issues in practical digital marketing. Extensive analysis of data applied to a large commercial web-site shows that Markov chain based classifiers are useful predictors of user intent.
AB - As the website is a primary customer touch-point, millions are spent to gather web data about customer visits. Sadly, the trove of data and corresponding analytics have not lived up to the promise. Current marketing practice relies on ambiguous summary statistics or small-sample usability studies. Idiosyncratic browsing and low conversion (browser-to-buyer) make modeling hard. In this paper, we model browsing patterns (sequence of clicks) via Markov chain theory to predict users' propensity to buy within a session. We focus on model complexity, imputing missing values, data augmentation, and other attendant issues that impact performance. The paper addresses the following aspects; (1) Determine appropriate order of the Markov chain (assess the influence of prior history in prediction), (2) Impute missing transitions by exploiting the inherent link structure in the page sequences, (3) predict the likelihood of a purchase based on variable-length page sequences, and (4) Augment the training set of buyers (which is typically very small: 2% by viewing the page transitions as a graph and exploiting its link structure to improve performance. The cocktail of solutions address important issues in practical digital marketing. Extensive analysis of data applied to a large commercial web-site shows that Markov chain based classifiers are useful predictors of user intent.
KW - click streams
KW - imputation
KW - link analysis
KW - markov chains
KW - prediction
UR - http://www.scopus.com/inward/record.url?scp=85018390891&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85018390891&partnerID=8YFLogxK
U2 - 10.1145/2872518.2891070
DO - 10.1145/2872518.2891070
M3 - Conference contribution
AN - SCOPUS:85018390891
T3 - WWW 2016 Companion - Proceedings of the 25th International Conference on World Wide Web
SP - 879
EP - 884
BT - WWW 2016 Companion - Proceedings of the 25th International Conference on World Wide Web
Y2 - 11 May 2016 through 15 May 2016
ER -