Recent Posts

.txtLAB’s Racial Lines featured by the CBC

.txtLAB’s Racial Lines featured by the CBC

Very pleased that new research on racial bias in Hollywood cinema by lab members Vicky Svaikovsky, Anne Meisner and Eve Kraicer has been featured on the CBC today. A really moving set of interviews with Canadian actors to discuss impressions of the lack of diversity in […]

Our first “collaboration”: Racial Lines

Our first “collaboration”: Racial Lines

I am very pleased to announce the launch of a new series of papers that will be coming out of .txtLAB in the months (and hopefully years) to come. We are calling them “collaborations,” and not just because it’s a pun on the word “lab.” […]

Gender Trouble: Literary Studies’ He/She Problem

Gender Trouble: Literary Studies’ He/She Problem

Pronouns have become a hot topic of late and I thought it would be interesting to explore their use in the new JSTOR data set that I have been working on that represents 60 years of literary studies articles. Previous work has shown how men […]

Topic Stability, Part 2

Topic Stability, Part 2

In my previous post I tried to illustrate how different runs of the same topic modelling process can produce topics that appear to be slightly semantically different from one another. If you keep k and all other parameters constant, but change your initial seed, you’ll […]

Topic Modelling Literary Studies: Topic Stability, Part 1

Topic Modelling Literary Studies: Topic Stability, Part 1

I’ve started working with a new data set of ca. 60,000 articles from the field of literary studies published between 1950 and 2010 courtesy of JSTOR. I’ll have more to say about the data set in the coming weeks, but for now I want to […]

Bad Graphs, or, I Have a BA in English

Bad Graphs, or, I Have a BA in English

Not to brag, but it’s almost impressive how many times I got something so simple so profoundly wrong, in so many odd ways.

Where’s the data? Notes from an international forum on limited use text mining

Where’s the data? Notes from an international forum on limited use text mining

I’m attending a two-day workshop on issues related to data access for text and data mining (TDM). We are 25 participants from different areas, including researchers who do TDM, librarians who oversee digital content, and content providers who package and sell data to academic libraries […]


%d bloggers like this: