Seshat NLP Dataset Pre-Release

modified on 2024-01-29, 18:01

This i a pre-release of Seshat NLP, a dataset of labeled text segments derived from the Seshat Databank.

The Seshat_NLP.sql file is a PostgreSQL dump that can be used to instantiate both dataset tables (the labeled descriptions and the labeled text segments).

The hierarchy_graph.gexf is a an xml based export of the hierarchy graph that can be used to uncover the hierarchical position of text labels with respect to the Seshat codebook (