Academic Publishing

The Coding Turn in the Humanities

The Coding Turn in the Humanities

As part of my new book, I have made the code and all derived text data freely available online. The underlying text data has been shared as far as copyright restrictions would allow. As I mentioned in my initial post, this entailed a massive amount […]

The great ________ novel: How scholars classify the novel

The great ________ novel: How scholars classify the novel

Ever since Fotis Jannidis posted a graph on novel classification a few years ago I have been inspired to do something similar in English. What are the ways in which scholars over the past half-century have classified “the novel”? Using a collection of over 60,000 […]

Gender Trouble: Literary Studies’ He/She Problem

Gender Trouble: Literary Studies’ He/She Problem

Pronouns have become a hot topic of late and I thought it would be interesting to explore their use in the new JSTOR data set that I have been working on that represents 60 years of literary studies articles. Previous work has shown how men […]

Topic Stability, Part 2

Topic Stability, Part 2

In my previous post I tried to illustrate how different runs of the same topic modelling process can produce topics that appear to be slightly semantically different from one another. If you keep k and all other parameters constant, but change your initial seed, you’ll […]

Topic Modelling Literary Studies: Topic Stability, Part 1

Topic Modelling Literary Studies: Topic Stability, Part 1

I’ve started working with a new data set of ca. 60,000 articles from the field of literary studies published between 1950 and 2010 courtesy of JSTOR. I’ll have more to say about the data set in the coming weeks, but for now I want to […]

Where’s the data? Notes from an international forum on limited use text mining

Where’s the data? Notes from an international forum on limited use text mining

I’m attending a two-day workshop on issues related to data access for text and data mining (TDM). We are 25 participants from different areas, including researchers who do TDM, librarians who oversee digital content, and content providers who package and sell data to academic libraries […]

The Replication Crisis I: Restoring confidence in research through replication clusters

The Replication Crisis I: Restoring confidence in research through replication clusters

Much has been written about the so-called “replication crisis” going on across the sciences today. There are many ways that these issues impact literary and cultural studies, but not always in the most straightforward way. “Replication” has a complicated fit with more interpretive disciplines and […]