Where do people tell stories online?

Very excited to be part of a new collaboration authored by Maria Antoniak: “Where Do People Tell Stories Online? Story Detection Across Online Communities.” We release a dataset to facilitate the detection of stories across Reddit communities. Of note is the way this dataset allows for the detection of stories at the span-level, i.e. predicting the likelihood of being narrative within a large post.

Our best model is a fine-tuned Roberta model that exhibits an accuracy of 0.86. Using this model, we see how rates of storytelling differ strongly across subreddits. Most importantly, we see this resource as allowing future studies of online storytelling behavior.