Abstract
Forecasting large-scale societal events like civil unrest movements, disease outbreaks, and elections is an important and challenging problem. From the perspective of human analysts and policy makers, forecasting algorithms must not only make accurate predictions but must also provide supporting evidence, e.g., the causal factors related to the event of interest. We develop a novel multiple instance learning based approach that jointly tackles the problem of identifying evidence-based precursors and forecasts events into the future. Specifically, given a collection of streaming news articles from multiple sources we develop a nested multiple instance learning approach to forecast significant societal events such as protests. Using data from three countries in Latin America, we demonstrate how our approach is able to consistently identify news articles considered as precursors for protests. Our empirical evaluation demonstrates the strengths of our proposed approach in filtering candidate precursors, in forecasting the occurrence of events with a lead time advantage and in accurately predicting the characteristics of civil unrest events.
| Original language | English |
|---|---|
| Title of host publication | KDD 2016 - Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining |
| Pages | 1095-1104 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781450342322 |
| DOIs | |
| State | Published - 13 Aug 2016 |
| Event | 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016 - San Francisco, United States Duration: 13 Aug 2016 → 17 Aug 2016 |
Publication series
| Name | Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining |
|---|---|
| Volume | 13-17-August-2016 |
Conference
| Conference | 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016 |
|---|---|
| Country/Territory | United States |
| City | San Francisco |
| Period | 13/08/16 → 17/08/16 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Event detection
- Multi-instance learning
- Text mining
Fingerprint
Dive into the research topics of 'Modeling precursors for event forecasting via nested multi-instance learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver