On Reconstructions of Academic Canonicity: Explorations of University Course Catalogs for German Literature

1. Introduction

The aim of the paper is a partial and explorative quantitative reconstruction of a literary canon. Canon is understood as the body of literary texts and authors that are considered particularly valuable, important or influential by a group of people who are interested in passing them on (Winko 2002: 19). Texts are not canonical per se, but are made canonical by various actors belonging to different groups and institutions. Theorist Simone Winko describes this process using Adam Smith's metaphor of the invisible hand: Many actions on a micro level not necessarily intending the canonization of an author, together lead to it on a macro level (Winko 2002). At the same time, those actions can serve as an indicator, in reconstructing canonicity at a certain time. In accordance with canon theory, it can be assumed that there is not one, but several canons, which differ, among other things, in the institution to which they are linked, for example schools or universities.

Reconstructions of canon can be therefore categorized according to which canon, in which time period, they deal with and which indicators they use (Kampmann 2013). In Computational Literary Studies, various indicators have been used in such reconstructions. ¹

In this paper, I am concerned with the current academic canon of German-language literature. I use an indicator that has hardly been explored quantitatively so far: ² The study of authors in university courses. As an indicator for which authors are covered, I consider mentions of them in course descriptions.

2. Data and Methods

Six German, two Austrian universities, and one Swiss university were randomly chosen. A total of 6127 descriptions were then scraped from online catalogs for courses in German Literature. Figure 1 illustrates the quantity of descriptions per university. Figure 2 shows the quantity of obtained descriptions per semester and university.

Figure 1. Number of descriptions per university.

Figure 2. Number of descriptions per university.

Figure 2. Number of descriptions per university and semester.

Since the data is distributed unevenly across time (s. figure 2), only descriptions from between 2018/19 and 2023/2024 are used for the explorations.

Named entities (NEs) that refer to people and, as a subset, to writers were manually annotated in 54 randomly selected descriptions. Five models were then evaluated (see figure 3). Subsequently, the best performing model was used to annotate a total of 7689 NEs in all descriptions.

Since not all references to people refer to writers, the identified entities were linked to the ‘Gemeinsame Normdatei’ (GND). For the following analyses, taking the metadata from the GND, only entities belonging to writers (total of 1845) are considered.

Figure 3. Evaluations for five German NER-models.

3. Explorations

Nation and gender have often been described as relevant factors in canon formation (e.g. Heydebrand/Winko 1994, Starre 2013). For German-language literature, differences between German, Austrian and Swiss Canons could be expected. In the following, the data is explored regarding these categories. The underlying assumption is that the more often an author is mentioned, the more canonical they are.

Figure 4. Numbers of mentions and mentioned writers per university.

The table in Figure 4 shows the number of mentions and mentioned writers per university. Figure 5 shows the shares of all mentions per university by gender. An assessment of these numbers raises the question of a suitable benchmark. Compared to the expectations of a modern society that strives for equality, the proportions of female writers seem, with a mean of 16,8%, very low. For historical periods in which women did not have equal access to education and were not working as writers in equivalent numbers, however, a different ‘baseline’ would probably have to be applied. This paper, which sees itself as descriptive, cannot set this benchmark. It must be chosen by the universities according to the aims of their programs.

Figure 5. Proportion of mentions of female and male writers on all mentions per university.

It is striking that the proportion of women at Austrian universities ( wien and graz) is higher than at the other universities. Figure 6 shows the 20 most frequently mentioned writers per nation.

Figure 6. Mean relative number of mentions per writer and country (top 20 per country), colored by gender.

While at the German and Swiss universities there is only one female writer in the top 20, in Austria there are three. And with Elfriede Jelinek, one female writer even occupies second place. Like the other two, Jelinek is Austrian. When the graph is colored according to whether Austria is or was one of the writers countries of residence (Figure 7), it is clearly visible that the Austrian academic canon differs from the others in that it covers more Austrian writers.

Figure 7. Mean relative number of mentions per writer and country (top 20 per country). The green bars stand for writers with an Austrian background.

Finally, Figure 8 shows the rank differences between the Austrian and German top lists of 100 writers with most mentions. The accumulation of green bars on the left-hand side, which again stands for the writers’ Austrian place of residence, shows that the writers who are treated much more frequently in Austria than in Germany and thus occupy a higher rank, are primarily Austrian writers.

Figure 8. Rank differences between Austrian and German ranked lists of Top 100 writers with highest numbers of mentions. The bars on the left represent writers with a much higher rank in the Austrian top list than in the German top list.

4. Conclusion

The data described shows which writers appear how frequently in course descriptions of German studies courses in a randomly selected but non-representative sample of universities. This data is meant to serve as an indicator for academic canonicity.

The explorations empirically support the assumption that there are differences between the national canons. The Austrian courses differ from the others, especially in that they cover relatively more female and more Austrian writers.

In the future, the data can serve as one building block in reconstructing and describing academic canonicity in order to describe corpora or further investigating canon formation.

Appendix A

Bibliography

Algee-Hewitt, Mark / Allison, Sarah / Gemma, Marissa / Heuser, Ryan / Moretti, Franco / Walser, Hannah (2016): "Canon/Archive: Large-Scale Dynamics in the Literary Field", in: Pamphlets of the Stanford Literary Lab 11 http://publikationen.ub.uni-frankfurt.de/frontdoor/index/index/docId/47005 [15.03.2021].
Barré, Jean / Camps, Jean-Baptiste / Poibeau, Thierry (2023): "Operationalizing Canonicity: A Quantitative Study of French 19th and 20th Century Literature", in: Journal of Cultural Analytics 8, 1 https://doi.org/10.22148/001c.88113[17.10.2023].
Brottrager, Judith / Stahl, Annina / Arslan, Arda (2021): "Predicting Canonization: Comparing Canonization Scores Based on Text-Extrinsic and -Intrinsic Features", in: Computational Humanities Research Conference.
Ghosh, Arjun (2022): "Reforming the 'Eng Lit' canon: Measuring the myths and realities of English literary studies in India through a computational analysis of university curricula", in: Digital Humanities 2022. Conference Abstracts. Tokyo.
Gemeinsame Normdatei (GND), see: https://www.dnb.de/gnd [28.11.2023].
González, José Eduardo / Jacobson, Elliott / García García, Laura / Brandolini Kujman, Leonardo (2021): "Measuring Canonicity: Graduate Reading Lists in Departments of Hispanic Studies", in: Journal of Cultural Analytics 6, 1 https://doi.org/10.22148/001c.21599 [19.03.2021].
Hein, Jürgen (1990): "Kanon-Diskussion in Literaturdidaktik und Öffentlichkeit Eine Bestandsaufnahme", in: Labroisse, Gerd (ed.): Literaturdidaktik, Lektürekanon, Literaturunterricht. Amsterdamer Beiträge zur neueren Germanistik, 30. Amsterdam: Rodopi, 311–346.
Heydebrand, Renate von / Winko, Simone (1994): "Geschlechterdifferenz und literarischer Kanon. Historische Beobachtungen und systematische Überlegungen", in: IASL 19, 2: 96–173.
Kampmann, Elisabeth (2013): "Wie lässt sich ein Kanon rekonstruieren?", in: Rippl, Gabriele / Winko, Simone (eds.): Handbuch „Kanon und Wertung“. Theorien, Instanzen, Geschichte. Stuttgart, Weimar: Metzler, 407–412.
Porter, J.D. (2018): "Popularity/Prestige", in: Pamphlets of the Stanford Literary Lab 17.
Starre, Alexander (2013): "Kontextbezogene Modelle: Bildung, Ökonomie, Nation und Identität als Kanonisierungsfaktoren", in: Rippl, Gabriele / Winko, Simone (eds.): Handbuch „Kanon und Wertung“. Theorien, Instanzen, Geschichte. Stuttgart, Weimar: Metzler, 58–66.
Stuck, Elisabeth (2004): Kanon und Literaturstudium. Explicatio. Paderborn: mentis.
Winko, Simone (2002): "Literatur-Kanon als invisible hand-Phänomen", in: Arnold, Heinz Ludwig / Korte, Hermann (eds.): Literarische Kanonbildung. München: 9–24.
Winko, Simone / Rippl, Gabriele (eds.) (2013): Handbuch Kanon und Wertung. Stuttgart: Metzler.

Notes

Indicators used are for example, lists of ‘best novels’ (Algee-Hewitt/McGurl 2015), entries in bibliographies (Porter 2018), mentions on Goodreads (Porter 2018), mentions in literary histories (Brottrager et al. 2021), mentions on reading lists (González 2021, Brottrager 2021), or occurrences in exams, in anthologies and winnings of literary prizes (Barré et al. 2023).

Ghosh's study quantitatively examines English Studies course catalogs in India, focusing more on course programs than on canonicity itself (Ghosh 2022). For German-language literature, older non-computational works cover previous periods (Hein 1990, Stuck 2004).