Many-to-Many Sentence Alignments
Dähne, Janis; Ritter, Jörg
Martin-Luther-Universität Halle-Wittenberg, Germany
In this paper, we tackle an issue with sentence collation of two text witnesses. The issue occurs when sentences are split into two (or more) sentences during revision, e.g., to insert a new sentence in between, or when sentences are combined into one sentence.
Disentangling Babel: How patterns of disagreement in parallel corpora can contribute to improving Translation Alignment guidelines and Gold Standard design for Ancient Greek
Palladino, Chiara (1); Shamsian, Farnoosh (2); Yousef, Tariq (3)
1: Furman University, United States of America; 2: Leipzig University, Germany; 3: University of Southern Denmark, Denmark
HTML XMLWe present an investigation into a Gold Standard parallel corpus of Ancient Greek texts, focusing on disagreement across the two annotators who created the dataset, with the goal of investigating causes of inconsistency and of contributing to the improvement of Gold Standards and Alignment Guidelines for ancient corpora.
From a publishing tool to a virtual research environment: producing scholarly editions with TEI Publisher
Meier, Wolfgang (1); Wicentowski, Joseph C. (2); Turska, Magdalena (1); Bermúdez Sabel, Helena (3)
1: e-editiones; 2: Office of the Historian, U.S. Department of State; 3: JinnTec
TEI Publisher provides the means for data storage, modelling, curation, analysis and visualisation while enabling collaborative work. The recent additions to it include form-based editing for metadata and curation of authority files as well as NER support, including training the model and visual editor for adding semantic markup.