Unraveling the Threads of Language: Exploring Patterns in Digital Humanities Research
Chair: Zhou, Mengyuan

Many-to-Many Sentence Alignments

Dähne, Janis; Ritter, Jörg

Martin-Luther-Universität Halle-Wittenberg, Germany

In this paper, we tackle an issue with sentence collation of two text witnesses. The issue occurs when sentences are split into two (or more) sentences during revision, e.g., to insert a new sentence in between, or when sentences are combined into one sentence.


Disentangling Babel: How patterns of disagreement in parallel corpora can contribute to improving Translation Alignment guidelines and Gold Standard design for Ancient Greek

Palladino, Chiara (1); Shamsian, Farnoosh (2); Yousef, Tariq (3)

1: Furman University, United States of America; 2: Leipzig University, Germany; 3: University of Southern Denmark, Denmark

HTML XML

We present an investigation into a Gold Standard parallel corpus of Ancient Greek texts, focusing on disagreement across the two annotators who created the dataset, with the goal of investigating causes of inconsistency and of contributing to the improvement of Gold Standards and Alignment Guidelines for ancient corpora.


From a publishing tool to a virtual research environment: producing scholarly editions with TEI Publisher

Meier, Wolfgang (1); Wicentowski, Joseph C. (2); Turska, Magdalena (1); Bermúdez Sabel, Helena (3)

1: e-editiones; 2: Office of the Historian, U.S. Department of State; 3: JinnTec

TEI Publisher provides the means for data storage, modelling, curation, analysis and visualisation while enabling collaborative work. The recent additions to it include form-based editing for metadata and curation of authority files as well as NER support, including training the model and visual editor for adding semantic markup.