Machine learning for survival analysis: A survey

Ping Wang, Yan Li, Chandan K. Reddy

Research output: Contribution to journalArticlepeer-review

399 Scopus citations

Abstract

Survival analysis is a subfield of statistics where the goal is to analyze and model data where the outcome is the time until an event of interest occurs. One of the main challenges in this context is the presence of instances whose event outcomes become unobservable after a certain time point or when some instances do not experience any event during the monitoring period. This so-called censoring can be handled most effectively using survival analysis techniques. Traditionally, statistical approaches have been widely developed in the literature to overcome the issue of censoring. In addition, many machine learning algorithms have been adapted to deal with such censored data and tackle other challenging problems that arise in real-world data. In this survey, we provide a comprehensive and structured review of the statistical methods typically used and the machine learning techniques developed for survival analysis, along with a detailed taxonomy of the existing methods. We also discuss several topics that are closely related to survival analysis and describe several successful applications in a variety of real-world application domains. We hope that this article will give readers a more comprehensive understanding of recent advances in survival analysis and offer some guidelines for applying these approaches to solve new problems arising in applications involving censored data.

Original languageEnglish
Article number110
JournalACM Computing Surveys
Volume51
Issue number6
DOIs
StatePublished - Feb 2019

Keywords

  • Censoring
  • Concordance index
  • Cox model
  • Hazard rate
  • Machine learning
  • Regression
  • Survival analysis
  • Survival data

Fingerprint

Dive into the research topics of 'Machine learning for survival analysis: A survey'. Together they form a unique fingerprint.

Cite this