Recent Posts

Detecting footnotes in 32 million pages of ECCO

Detecting footnotes in 32 million pages of ECCO

I’m very proud to be part of the collaborative project that produced this new article in the Journal of Cultural Analytics where we outline the process we used to visually detect footnotes in 32 million page images. (Full disclosure: I am the editor and to avoid conflict […]

How do we Model Stereotypes without Stereotyping (Again)? How about information theory.

How do we Model Stereotypes without Stereotyping (Again)? How about information theory.

In a previous post, we explored how using language models and the idea of “perplexity” can allow us to study stereotypes in movie character roles using their dialogue as a basis. We examined a corpus of 750 Hollywood films, released between 1970 and 2014, and […]

Does the decline of gender within literary studies matter?

Does the decline of gender within literary studies matter?

A little while back I posted on the unmoving ratio between male/female pronouns in a data set of ~60,000 articles in literary studies. The ratio has been stable at about 2:1 since the mid-1990s. While pronouns do not tell us everything we need to know […]

A Sense of an Ending: Poetry and Periods

A Sense of an Ending: Poetry and Periods

As part of my on-going fascination with punctuation, in Enumerations I look at the words that are most likely to be followed by a period in a collection of 75,000 twentieth-century poems. What we see are the very pronounced ways that poems tend to end […]

On Colons, or Standardization in Literary Studies

On Colons, or Standardization in Literary Studies

Sometimes things don’t need to be complicated. Have you ever wondered about the convention of using colons in titles of academic articles? As in, “Here’s my big idea: now let me narrow it down for you”? Well, I’ve never actually seen something that exhibits this […]

Novel Worlds: Theory + Computation

Novel Worlds: Theory + Computation

I am very excited for our *fifth* (!) annual NovelTM conference coming up this week. The aim of this year’s conference is to begin the long overdue conversation between data-driven research and literary scholarship more generally. The particular theme of the conference will centre around the […]

Did we already know that?

Did we already know that?

For anyone who has ever given a talk using computational methods the response, “But didn’t we already know that?”, looms large in your consciousness. Nothing feels more deflating…and frustrating. I always want to respond, well, we believed we knew it and now we have more […]

How do we model stereotypes without stereotyping?

How do we model stereotypes without stereotyping?

We recently put out a paper on how racial bias functions in Hollywood films. This work was based on a few studies that came before it, namely this one, from USC Annenberg. We presented numerical analyses like the number of characters in different racial and […]


%d bloggers like this: