Reinventing Cultural Analytics Curriculum: Identifying Core Competencies and Sequencing for Multiple Constituencies

In recent years, the label “cultural analytics” (CA) has risen in prominence as a catchall for the application of data-driven, computational, and/or quantitative tools and methods to revisit research areas associated with the study of culture in various disciplines. The label's first use is credited to Lev Manovich, but its use has arguably grown beyond his original definition (Manovich 2016; Manovich 2020; Piper 2016). Although CA has been described as part of the “big tent” of digital humanities (DH), it operates as both a subfield within DH and an emerging field of its own with constituencies in the humanities, the social sciences, and information & computer science. Ted Underwood has gone as far as arguing that CA must be acknowledged as “a bridge between the humanities and quantitative social science” that belongs equally to both (Underwood). Alternative labels for the focal area (e.g. computational humanities, computational literary studies, distant reading, and macroanalysis) have done comparatively less to signal this confluence of multiple constituencies (see Da, Jockers, Manovich 2020, Michel et al., and Moretti). Whatever title is used, debates around CA have tended to focus on: (1) applying computational methods with sufficient quantitative rigor; (2) preserving humanities sensibilities in the face of methods that may appear to require embracing a positivist and overly reductive worldview; and (3) questioning whether quantitative and computational methods can deliver results of interest and relevance to the humanities (See Bode, Da, and Levy-Eichel & Scheinerman). All three of these topic areas suggest a clash of competing values, with one set of stakeholders typically emphasizing item 1 and another set of stakeholders typically emphasizing items 2 and 3. Failing to address this dynamic directly has slowed the progress of a core area of concern for CA: formalizing how aspiring CA practitioners enter the field. As Underwood has described, a lack of formal training for humanists in data-driven or quantitative inquiry creates a structural inequality whereby practitioners with a background in these areas tend to have a competitive advantage and feel more welcome in the field. This context creates obstacles for both established humanities researchers hoping to move into CA and graduate students who want to specialize in CA approaches.

In this long paper presentation, I address the challenges of addressing this structural inequality, as well as areas where I believe progress can be made. My central argument is that CA must work toward identifying core methodological areas (competencies) and work toward optimizing the order in which specific topics are taught (sequencing). Although this process will likely be contentious (and may read as hostile to the idea of DH’s “big tent”), it has become increasingly clear that this process is overdue. Competencies designated as core to CA will not be equally valuable to all DH practitioners, but acknowledging this fact need not precede an argument to excise CA from DH or otherwise divorce the two from one another. My paper is divided into three sections: (1) summary of CA’s current socio-technical infrastructure; (2) overview of well-functioning formalization processes; and (3) recommendations for the future of CA curriculum.

1. Socio-Technical Infrastructure

In this section, I describe the state of CA curriculum and sequencing, which has developed in the absence of the relatively well-functioning formalization processes for field entry that can be found in comparable quantitative disciplines. This summary is based on surveying publicly available syllabi and academic program websites to ascertain norms such as:

Where a cultural analytics, humanities analytics, or computational humanities courses seem to be taught , and at what level (e.g., undergraduate vs. graduate)
Whether CA (or equivalent) operates as standalone academic program or certificate, a specialty within a DH program, a specialty within some other program (such as data science or informatics), etc.
Are out-of-program prerequisites such as “Introduction to Computer Science” or “Applied Math” used to cover methodological competencies?
What is the current mix of computer programming, mathematics, and applied qualitative/quantitative research methods being covered in the program curriculum?
In what sequences are prerequisites, CA-specific courses, and other program requirements typically offered?

Although I have found that many course materials are not available online (and others may be available but hard to find), attempting to answer these questions is crucial to understanding the de facto state of CA curriculum, as well as the values that appear to be informing current practices.

I supplement this section with a literature review of scholarship relevant to or directly engaged with questions of CA training. This task is more complicated than a typical literature review, however, as it has involved locating publications in a broad range of publications, and evaluating a diverse range of material for both relevance and quality. The most relevant DH and CA materials tend to fit into one of three categories:

Pedagogical discourse within DH addressing how computational and quantitative methods should be taught, often under the assumption of a “big tent” DH context (see Hirsch and Birnbaum & Langmead)
DH or CA literature discussing the methodological or skill dependencies of rigorous research, which is on topic but not specifically focused on teaching (see Jarausch & Hardy, Haskins & Jeffrey, Siddiqui)
Materials designed to function as how-to guides, cource textbooks, lessons, or tutorials, many of which implicitly or explicitly establish core competencies (see Dombrowski, Graham et al., Posner, Programming Historian, and Walsh)

2. Analogous Formalization Processes

This section offers a fourth area for relevant course materials and scholarly literature that is often under-appreciated: curricular and pedagogical materials from analogous disciplines (e.g. computer science, applied mathematics, computational social sciences, econometrics, etc.), as well as materials documenting the processes by which these disciplines have formalized areas of core competency.

This section is informed by my research in the field of CA, my experience teaching data analytics (DA) in an interdisciplinary program at Denison University. Denison University is a diverse, highly selective, residential liberal arts college (undergraduate students only) enrolling approximately 2,300 students. The DA program has faculty with training in applied statistics, applied mathematics, astrophysics, computational biology, cultural analytics, and econometrics. Coursework in the major includes core DA courses, math and computer science prerequisites, several courses cross-listed with math and computer science, and a subject area concentration of 3-4 courses in an outside department (“Degree Requirements”) In this context, I have taught courses across the DA program; adapted CA topics and assignments into other data analytics and computer science classes; and piloted an undergraduate elective called “Introduction to Cultural Analytics” (Lavin).

Locating CA as a confluence of stakeholders from multiple backgrounds allows questions of competencies, sequencing, and synthesis to come to the foreground. In Underwood’s words, CA requires learning “elements of mathematics, computer programming, and quantitative research design” (Underwood). It has not been my experience, however, that CA can simply appropriate mathematics, computer science, and quantitative social science training from other disciplines; rather, a genuinely interdisciplinary, multi-institutional intervention is called for. As the discipline formalizes, it can look to its closest counterparts for examples and templates of the kind of socio-technical infrastructure that would best preserve inclusiveness while simultaneously facilitating a formal process by which competency to the enter the field is enabled and evaluated. Achieving this goal requires establishing processes by which such outputs are created and maintained; fostering constructive discussions among stakeholders; and developing criteria by which competing stakeholder values can be reconciled or synthesized.

3. Recommendations

This section of my paper serves as a conclusion and a call to action. It attempts to synthesize import takeaways from the previous two sections, namely:

Why a formalization effort is necessary, and why the time is now. Here I will return to the issue of structural inequality, as well as the question of how are de facto curriculum is missing core elements of quantitative research training.
How socio-technical infrastructures could support this process, and how we can begin to create them. Here I will return to questions of inclusiveness and making core competencies easier to obtain and evaluate.
My “first pass” at gaps in our de facto curriculum; that is, skills and sequencing for CA that are arguably crucial and yet seem to be absent from or undervalued by existing approaches. This will also include ideas for assignments that emphasize synthesis, problem-based learning, and project-based learning in the context of a CA program.

Appendix A

Bibliography

Birnbaum, David J., and Alison Langmead. “Task-Driven Programming Pedagogy in the Digital Humanities.” New Directions for Computing Education: Embedding Computing Across Disciplines, ed. Samuel B. Fee, Amanda M. Holland-Minkley, and Thomas E. Lombardi, 63–85. Springer, 2017.
Bode, Katherine. “The Equivalence of ‘Close’ and ‘Distant’ Reading; or, Toward a New Object for Data-Rich Literary History.” Modern Language Quarterly 78, no. 1 (March 1, 2017): 77–106. https://doi.org/10.1215/00267929-3699787 .
Da, Nan Z. “The Computational Case against Computational Literary Studies.” Critical Inquiry 45, no. 3 (March 2019): 601–39. https://doi.org/10.1086/702594 .
“Degree Requirements.” Data Analytics Program, Denison University. https://denison.edu/academics/data-analytics/degree-requirements
Denison University, “Vision & Values.” https://denison.edu/campus/about/vision-values .
Dombrowski, Quinn. The Data Sitters Club. https://datasittersclub.github.io/site/
Graham, Shawn, Ian Milligan, Scott B. Weingart, and Kimberley Martin. Exploring Big Historical Data: The Historian’s Macroscope (Second Edition). World Scientific, 2022.
Haskins, Loren, and Kirk Jeffrey. Understanding Quantitative History. Wipf and Stock Publishers, 2011.
Hirsch, Brett D. Digital Humanities Pedagogy: Practices, Principles and Politics. Open Book Publishers, 2012
Jarausch, Konrad H., and Kenneth A. Hardy. Quantitative Methods for Historians: A Guide to Research, Data, and Statistics. UNC Press Books, 2016.
Jockers, Matthew L. Macroanalysis: Digital Methods and Literary History. University of Illinois Press, 2013.
Lavin, Matthew J. “Introduction to Cultural Analytics.” Course Syllabus and Weekly Schedule. ^[ https://humanitiesanalytics.com/da-245/spring-2022/ .
Levy-Eichel, Mordechai and Daniel Scheinerman, “Digital Humanists Need to Learn How to Count: A prominent recent book in the field suffers serious methodological pitfalls.” Review of Redlining Culture: A Data History of Racial Inequality and Postwar Fiction. Chronicle of Higher Education 17 May 2022. https://www.chronicle.com/article/digital-humanists-need-to-learn-how-to-count .
Manovich, Lev. “The Science of Culture? Social Computing, Digital Humanities and Cultural Analytics.” Journal of Cultural Analytics 1, no. 1 (May 23, 2016). https://doi.org/10.22148/16.004 .
—. Cultural Analytics. MIT Press, 2020.
Michel, Jean-Baptiste, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, The Google Books Team, Joseph P. Pickett, et al. “Quantitative Analysis of Culture Using Millions of Digitized Books.” Science 331, no. 6014 (January 14, 2011): 176–82. https://doi.org/10.1126/science.1199644 .
Moretti, Franco. Distant Reading. Verso Books, 2013.
Piper, Andrew. “There Will Be Numbers.” Journal of Cultural Analytics, May 23, 2016. https://doi.org/10.22148/16.006 .
Posner, Miriam. “How Did They Make That?” https://miriamposner.com/blog/how-did-they-make-that/ .
The Programming Historian. https://programminghistorian.org/ .
Siddiqui, Nabeel. “The General Linear Model and Minimal Research Compendiums: An Approach to Assure Statistical Validity in Digital Humanities Research,” 2023. https://www.researchsquare.com/article/rs-2626035/latest .
Underwood, Ted . “A Broader Purpose,” Varieties of Digital Humanities panel, MLA, Jan 5, 2018. https://tedunderwood.com/2018/01/04/a-broader-purpose/ .
Walsh, Melanie, Introduction to Cultural Analytics & Python ( https://melaniewalsh.github.io/Intro-Cultural-Analytics/welcome.html ).