Exploring Fictional Critique and Literary Evaluations

1. Motivation

One of the most prestigious functions of literary fiction is to criticize contemporary social affairs (see Jannidis et al. 2009, 26). Many writers publicly contend to take a critical stance on social developments by means of their fictions and lay people as well as literary critics regularly claim that certain novels or other fictional texts are “socially critical”, “critical of contemporary affairs”, or “critical of culture”. However, there is neither a widely acknowledged technical term for this kind of literature – “littérature engage” is bound to a specific aesthetic/philosophical tradition (see Hucke/Kutzmutz 2003; Ilić 2017; Wegmann 1996) – nor a corresponding field of research. Against this background, our contribution aims at identifying on the one hand ascriptions of such “fictional critique” by literary critics and on the other hand textual correlates of such “fictional critique” that are meaningful to literary scholars. The main hypothesis is that “literary evaluations” (Hunt/Vipond 1986; Winko 1991; Prinz/Winko 2013), i.e. evaluations of characters, spaces, time periods etc. that occur in fictional works of literature (e.g., "the courageous savior", "a wonderful road", but also more implicit forms) play a crucial role for the expression of fictional critique.

2. Approach

To analyze ascriptions of fictional critique (e.g., “the socially critical novel X”) we extract paragraphs that contain relevant compounds with "critique" from a German-language corpus of literary studies reference works and literary histories (> 100 books, 2005-2023) using regular expression. These paragraphs are annotated for ascriptions of fictional critique (2 annotators, Cohen’s κ = 0,69), including a gold standard creation. In a second step, we use the resulting metadata (the literary works the secondary literature is referencing, Gittel 2025) to build a digital corpus of works of fictional critique and annotate a subset of it for literary evaluations (including evaluation target, evaluated aspect – “courageous savior” for example being a moral evaluation –, and linguistic trigger) using the platform INCEpTION (de Castilho 2018). According to the annotation guidelines, minimal spans are NP’s and clauses.

Since sentiment analysis is often used for large-scale analysis of “evaluations” (Liu 2017, 12) in the non-literary domain (e.g., product reviews) and even literary researchers sometimes describe its merit in terms of analyzing “evaluation[s]” (Grisot/Herrmann 2023, 5), we hypothesize that it can be used in the literary domain to detect literary evaluations (Hunt/Vipond 1986; Winko 1991; Prinz/Winko 2013). Although a growing body of sentiment analysis studies has determined sentiment scores for whole literary texts or calculated affective variation throughout a narrative (Kim et al. 2017; Reagan et al. 2016; Jacobs 2019; Mohammad 2012; Klinger et al. 2016), it is not clear whether sentiment analysis is appropriate to measure literary evaluations. Thus, we evaluate which of the most popular sentiment analysis tools for German – like SentiWS (Remus et al. 2010), SentiMerge (Emerson/Declerck 2014) or German-Sentiment-BERT (Guhr 2020) – works best using the gold standard annotation of literary evaluations as ground truth (6 finalized texts of an ongoing annotation with 2 annotators). Text passages both annotated as evaluative and not annotated as such are run through sentiment analysis to check whether a matching (negative / positive / neutral) sentiment score is assigned.

In a third step, we use the best performing model to compare the fictional critique corpus and corresponding subsets of this corpus for social critique, cultural critique (germ. “Kulturkritik”), contemporary critique (germ. “Zeitkritik”) and others to the German ELTeC Reference Corpus (ELTeC 2020, 100 texts) regarding frequency and distribution patterns of literary evaluations. If the main hypothesis, that literary evaluations play a crucial role for the expression of fictional critique, is true, the fictional critique corpus / its subsets should show characteristic patterns in comparison to the ELTeC corpus. Among others, we test the hypotheses, that i) fictional critique (or one of its subtypes) contains on average more negative evaluations than non-critique fiction, and ii) that fictional critique (or one of its subtypes) contains stronger evaluative contrasts between positively and negatively evaluated entities than non-critique fiction. To the second end, we average sentiment scores in the surrounding text of mentions of named entities and calculate the difference between the means of positively and negatively evaluated entities of one text.

Appendix A

Bibliography
  1. de Castilho, Richard Eckart; Klie, Jan-Christoph; Kumar, Naveen; Boullosa, Beto; Gurevych, Iryna (2018): Linking Text and Knowledge using the INCEpTION annotation platform. In: Proceedings of the 14th eScience IEEE International Conference, pp. 327-328, The 14th eScience IEEE International Conference, Amsterdam, Netherlands, 29.10.2018--01.11.2018, DOI: 10.1109/eScience.2018.00077.
  2. Emerson, Guy; Declerck, Thierry (2014): SentiMerge: Combining sentiment lexicons in a Bayesian framework. In: Proceedings of the Workshop on Lexical and Grammatical Resources for Language Processing, COLING 2014, pp. 30-38.
  3. European Literary Text Collection (ELTeC), version 1.0.0, (2020), edited by Carolin Odebrecht, Lou Burnard and Christof Schöch. COST Action Distant Reading for European Literary History (CA16204). https://github.com/COST-ELTeC. DOI: doi.org/10.5281/zenodo.4274954.
  4. Gittel, Benjamin: „Gesellschaftskritik“, „Kulturkritik“, „Sozialkritik“, „Zeitkritik“ ... Zuschreibungen und textuelle Korrelate fiktional-literarischer Kritik. In: Zeitschrift für Germanistik 1 (2025) [accepted for publication].
  5. Grisot, Giulia; Herrmann, Berenike (2023): Examining the Representation of Landscape and Its Emotional Value in German-Swiss Fiction between 1840 and 1940. Journal of Cultural Analytics, vol. 8, no. 1, https://doi.org/10.22148/001c.84475.
  6. Guhr, Oliver; Schumann, Anne-Kathrin; Bahrmann, Frank; Böhme, Hans-Joachim (2020): Training a Broad-Coverage German Sentiment Classification Model for Dialog Systems. In: International Conference on Language Resources and Evaluation. https://api.semanticscholar.org/CorpusID:218974283.
  7. Hucke, Karl-Heinz; Kutzmutz, Olaf (2003): Engagierte Literatur. In: Harald Fricke, Klaus Grubmüller, Jan-Dirk Müller und Klaus Weimar (Hg.): Reallexikon der deutschen Literaturwissenschaft. Neubearbeitung des Reallexikons der deutschen Literaturgeschichte. 3. Ed. 3 Vol., Berlin, Vol. 1, pp. 446–447.
  8. Hunt, Russell A.; Vipond, Douglas (1986): Evaluations in literary reading. In: Text - Interdisciplinary Journal for the Study of Discourse 6 (1), 53-72. https://doi.org/10.1515/text.1.1986.6.1.53.
  9. Ilić, Biljana Vlašković (2017): Engaged Literature as Art, Prerogative, and Obligation. In: Zeljka Babic, Tatjana Bijelic and Petar Penda (ed.): Rethinking Tradition in English Language and Literary Studies. Cambridge.
  10. Jacobs, Arthur (2019): Sentiment Analysis for Words and Fiction Characters From the Perspective of Computational (Neuro-)Poetics. Frontiers in Robotics and AI. 6:53. doi: 10.3389/frobt.2019.00053.
  11. Jannidis, Fotis; Lauer, Gerhard; Winko, Sabine (2009): Radikal historisiert. Für einen pragmatischen Literaturbegriff. In: Fotis Jannidis, Gerhard Lauer und Sabine Winko (ed.): Grenzen der Literatur. Berlin, New York, pp. 3–37.
  12. Kim, Evgeny, et al. (2017): Prototypical Emotion Developments in Literary Genres. Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pp. 17–26.
  13. Liu, Bing (2017): Many Facets of Sentiment Analysis. In: E. Cambria et al. (Eds.): A Practical Guide to Sentiment Analysis, Springer, pp. 11–39, https://doi.org/10.1007/978-3-319-55394-8_2.
  14. Mohammad, Saif M. (2012): From Once upon a Time to Happily Ever after: Tracking Emotions in Mail and Books. In: Decision Support Systems, vol. 53 (4), pp. 730–41, https://doi.org/10.1016/j.dss.2012.05.030.
  15. Prinz, Katharina; Winko, Simone (2013): Wie rekonstruiert man Wertungen und Werte in literarischen Texten. In: Gabriele Rippl, Simone Winko (Eds.): Handbuch Kanon und Wertung. Theorien, Instanzen, Geschichte. Stuttgart, Weimar, pp. 402–407.
  16. Remus, Robert; Quasthoff, Uwe; Heyer, Gerhard (2010): SentiWS - A Publicly Available German-language Resource for Sentiment Analysis. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10). Valletta, Malta: European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2010/pdf/490_Paper.pdf.
  17. Winko, Simone (1991): Wertungen und Werte in Texten. Axiologische Grundlagen und literaturwissenschaftliches Rekonstruktionsverfahren. Braunschweig, Wiesbaden.
Benjamin Gittel (benjamingittel@gmail.com), Trier University, Germany und Gesa Bei der Wieden (g.beiderwieden@stud.uni-goettingen.de), Göttingen University, Germany