Navigation auf uzh.ch
Datasets and resources created by the Language Technology for Accessibility research group.
Novel task requiring the processing of visual information (video frames, human pose estimation) beyond the standard paradigm of text-to-text machine translation with training data for Swiss German Sign Language (DSGS) and German (DE).
Usage examples, documentation, and code available on WMT-SLT and GitHub.
Mathias Müller, Sarah Ebling, Necati Cihan Camgöz, Zifan Jiang, Alessia Battisti, Amit Moryossef, Annette Rios, Richard Bowden, Ryan Wong
Available upon request on Zenodo.
Mathias Müller, Sarah Ebling, Necati Cihan Camgöz, Zifan Jiang, Alessia Battisti, Katja Tissi, Sandra Sidler-Miserez, Regula Perrollaz, Michèle Berger, Sabine Reinhard, Amit Moryossef, Annette Rios, Richard Bowden, Ryan Wong, Robin Ribback, Severine Schori
Available upon request on Zenodo.
Mathias Müller, Sarah Ebling, Necati Cihan Camgöz, Zifan Jiang, Alessia Battisti, Katja Tissi, Sandra Sidler-Miserez, Regula Perrollaz, Michèle Berger, Sabine Reinhard, Amit Moryossef, Annette Rios, Richard Bowden, Ryan Wong
Available upon request on Zenodo.
Andreas Säuberli
Mobile (Android/iOS) app enabling participation in cloze tests, lexical decision tasks, multiple-choice question answering, n-back working memory tasks, picture naming, reaction time tests, and Simon games (working memory task) in English, Standard German, and Swiss German.
Available on GitHub.
Nicolas Spring, Annette Rios, Sarah Ebling
Alignments extracted with LHA (Nikolov and Hahnloser, 2019) for CEFR levels A2 and B1 to the original standard German text from Austria Press Agency (Austria Presse Agentur, APA) news items between August 2018 and April 2021.
Available upon request on Zenodo.
Exploring German Multi-Level Text Simplification (2021)
Available on GitHub.
Alessia Battisti, Dominik Pfütze, Andreas Säuberli, Marek Kostrzewa, Sarah Ebling
Corpus of parallel and monolingual-only (simplified German) data compiled from web sources containing additional information on text structure, typography, and images.
A Corpus for Automatic Readability Assessment and Text Simplification of German (2020)
Available upon request on Zenodo.
Sarah Ebling, Necati Cihan Camgöz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi, Mathew Magimai Doss
Large-scale dataset containing videotaped repeated productions of 100 items of a vocabulary test with associated transcriptions and annotations, consisting of data from 11 adult L1 signers and 19 adult L2 learners of DSGS.
SMILE Swiss German Sign Language Dataset (2018)
Available upon request on Zenodo.
Annette Rios, Nicolas Spring, Tannon Kew, Marek Kostrzewa, Andreas Säuberli, Mathias Müller, Sarah Ebling
Dataset of full articles (in German) from the Swiss news magazine '20 Minuten' paired with simplified summaries.
Scripts and instructions for downloading the data available on GitHub.
20 Minuten: A Multi-task News Summarisation Dataset for German (2023)
A New Dataset and Efficient Baselines for Document-level Text Simplification in German (2021)