On the Principles of a Digital Text Corpus: New Opportunities in Working on Heroic Epics of the Shors
- Volume 28, Number 2
- Dmitri A. Funk
- View PDF | Download PDF
This essay discusses the main principles of a Digital Text Corpus initiated in 2011 with support from the Department of Northern and Siberian Studies at the Institute of Ethnology and Anthropology of the Russian Academy of Sciences. With special focus on the vast Shor (a Turkic people in the south of Western Siberia) materials, the essay showcases how this Corpus offers unique and varied means for analyzing folklore texts in lesser used, mostly endangered Siberian languages.
Proportions of the numbers of the Shor epic texts recorded by scholars between 1861-2006.
Graph: by the author.
Volume of the Shor Corpus (with a list of the 20 most frequent word-forms).
The structure of the Shor Corpus.
Vladimir Tannagashev in his apartment (in the kitchen) in the town of Myski, Kemerovo region, 2003.
Photo: D. Funk.
List of epic texts from the Tannagashev’s repertoire in the Shor Corpus.
Torbokov - taken by an anonymous photographer on June 15, 1969.
Photo: Folklore Archive of the State Literature Museum.