Paper at CIKM 2009 — Predicting the Volume of Comments on Online News Stories

Manos Tsagkias, Wouter Weerkamp, and Maarten de Rijke

University of Amsterdam
03 September 2009
Keywords: paper, ecir, predictive analytics

Abstract

On-line news agents provide commenting facilities for readers to express their views with regard to news stories. The number of user supplied comments on a news article may be indicative of its importance or impact. We report on exploratory work that predicts the comment volume of news articles prior to publication using five feature sets. We address the prediction task as a two stage classification task: a binary classification identifies articles with the potential to receive comments, and a second binary classification receives the output from the first step to label articles “low” or “high” comment volume. The results show solid performance for the former task, while performance degrades for the latter.

References

[1] Manos Tsagkias, Wouter Weerkamp, and Maarten de Rijke. 2009. Predicting the volume of comments on online news stories. In Proceedings of the 18th ACM conference on Information and knowledge management (CIKM ‘09). Association for Computing Machinery, New York, NY, USA, 1765–1768. ACM Link PDF