Modeling Complex Clickstream Data by Stochastic Models: Theory and Methods

Choudur Lakshminarayan, Ram Kosuru, Meichun Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

As the website is a primary customer touch-point, millions are spent to gather web data about customer visits. Sadly, the trove of data and corresponding analytics have not lived up to the promise. Current marketing practice relies on ambiguous summary statistics or small-sample usability studies. Idiosyncratic browsing and low conversion (browser-to-buyer) make modeling hard. In this paper, we model browsing patterns (sequence of clicks) via Markov chain theory to predict users' propensity to buy within a session. We focus on model complexity, imputing missing values, data augmentation, and other attendant issues that impact performance. The paper addresses the following aspects; (1) Determine appropriate order of the Markov chain (assess the influence of prior history in prediction), (2) Impute missing transitions by exploiting the inherent link structure in the page sequences, (3) predict the likelihood of a purchase based on variable-length page sequences, and (4) Augment the training set of buyers (which is typically very small: 2% by viewing the page transitions as a graph and exploiting its link structure to improve performance. The cocktail of solutions address important issues in practical digital marketing. Extensive analysis of data applied to a large commercial web-site shows that Markov chain based classifiers are useful predictors of user intent.

Original languageEnglish
Title of host publicationWWW 2016 Companion - Proceedings of the 25th International Conference on World Wide Web
Pages879-884
Number of pages6
ISBN (Electronic)9781450341448
DOIs
StatePublished - 11 Apr 2016
Event25th International Conference on World Wide Web, WWW 2016 - Montreal, Canada
Duration: 11 May 201615 May 2016

Publication series

NameWWW 2016 Companion - Proceedings of the 25th International Conference on World Wide Web

Conference

Conference25th International Conference on World Wide Web, WWW 2016
Country/TerritoryCanada
CityMontreal
Period11/05/1615/05/16

Keywords

  • click streams
  • imputation
  • link analysis
  • markov chains
  • prediction

Fingerprint

Dive into the research topics of 'Modeling Complex Clickstream Data by Stochastic Models: Theory and Methods'. Together they form a unique fingerprint.

Cite this