Kolloquiumsplan FS 2019

Kolloquium FS 2018: Berichte aus der aktuellen Forschung am Institut, BA/MA-Arbeiten, Programmierprojekte, Gastvorträge

Zeit/Ort: Circa alle 14 Tage am Dienstag von 10.15 Uhr bis 12.00 Uhr, BIN - 2.A.10 (!)

Verantwortlich: Martin Volk

Kontakt: Martin Volk

[Subject to changes]

Datum

Vortragende / Thema

19. Feb. 2019

Martin Volk: The Ideal Colloquium in the Age of Divided Attention / Binomial Adverbs in Parsing and Machine Translation

5. March 2019

Raphael Balimann (CL, UZH): NMT for Subtitles

Johannes Graën & Gerold Schneider (CL, UZH): From Parallel Corpora to Language Learning Applications

19. March 2019

Matthias Baumgartner (IfI, UZH): Integrating Structured and Unstructured Data Sources via Representation Learning

Tanja Samardzic (CorpusLab, UZH): Spatial Prominence and Text Frequency

2. April 2019

Phillip Ströbel (CL, UZH): The Impact of OCR on Topic Models

Mathias Müller (CL, UZH): Domain Robustness in Neural Machine Translation

16. April 2019

Susie Xi Rao (ETH): Understanding the Web of Science Using Deep Learning

Nora Hollenstein (ETH): Improving NLP with Human Data

30. April 2019

Olga Sozinova (CorpusLab, UZH): Measuring Inflectional and Derivational Complexity on the Universal Dependencies Dataset

Samuel Läubli (CL, UZH): Document-level Evaluation of Neural Machine Translation

14. May 2019 Canceled because of ...
16. May 2019 (17:15h im IfI-Kollo) Tobias Kuhn (Free University of Amsterdam): Semantic Publishing with Nanopublications

28. May 2019

Ensieh Davoodijam (Isfahan University, Iran; CL, UZH): Summarization of Biomedical Texts

Tatyana Ruzsics (CorpusLab, UZH): Stratified Text Normalization

Abstracts

 

19. February 2019

Martin Volk

Title: The ideal colloquium in the age of divided attention

Abstract: Some thoughts about the goals of the colloquium plus discussion.


Title: Binomial Adverbs in Parsing and Machine Translation

Abstract: Binomial adverbs are a subclass of multiword adverbs. They pose special problems for NLP since they sometimes consist of particles that are mostly used as prepositions (e.g. EN: by and large, DE: ab und zu). We present our study on how to identify idiomatic binomial adverbs and how well a selection of EN and DE adverbs are handled by a state-of-the-art dependency parser and an online MT system.



19. March 2019

Tanja Samardžić

Title: Spatial prominence and text frequency

Abstract: In this talk, we quantify and thus evaluate the relation between text frequency and properties of the outer-text, geographic setting by comparing text frequencies of mountain names to the respective geomorphometric characteristics. We focus on some 2000 unique mountain names that appear some 50,000 times in a large compilation of texts on Swiss alpine history. The results on the full data set suggest only a weak relation: only 5–10% of the variation in the text frequency being explained by the respective geomorphometric characteristics. However, an analysis of multiple scales allows us to identify a Simpson’s Paradox. What appears to be ‘noise’ in the analysis of all mountains in the whole of Switzerland shows significant local signals. Small spatial extents, found all over Switzerland, can show considerably strong correlations between text frequency and spatial prominence, with up to 90% of the total variation explained.