Using #AI and #NLP to study storytelling at McGillU. Author of Enumerations: Data and Literary Study (2018) and director of .txtLAB.


  • Can We Be Wrong? The Problem of Textual Evidence in a Time of Data (Cambridge 2020)
  • Enumerations: Data and Literary Study (Chicago 2018)
  • HATHI 1M: Introducing a Million Page Historical Prose Dataset in English from the Hathi Trust (2022)
  • Cultural Capitals: Modeling Minor European Literature
  • The CONLIT Dataset of Contemporary Literature
  • Biodiversity is not declining in fiction
  • MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library

Usage metrics

Co-workers & collaborators

Andrew Piper's public data