

NLP Shared Tasks activities
Shared tasks are a good means to collaborate with other researchers, they let students work on a defined task in a competitive way, and generally assess the methodological competences in solving practical problems. From 2017 to 2022, we had several successful shared task participations and helped to co-organized shared tasks as well:
- CLEF-HIPE-2022 (Identifying Historical People, Places and other Entities): Shared Task on Named Entity Recognition and Linking in Multilingual Historical Documents.
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions 2021for different resource settings and languages in collaboration with Peter Makarov. We co-organized the shared task, annotated data, provided the baseline system and also had a submission with extensions of the baseline solution.
- CLEF-HIPE-2020 (Identifying Historical People, Places and other Entities) evaluation campaign on named entity processing on historical newspapers in French, German and English; co-organization of the task, providing a NER baseline system and the evaluation framework.
- CoNLL-SIGMORPHON-2020 Shared Task 1 on Multilingual Grapheme-to-Phoneme Conversion on 15 languages in collaboration with Peter Makarov; our neural solution achieved 2nd rank. Our solution is described in a short paper at the SIGMORPHON workshop.
- CoNLL-SIGMORPHON-2018 Shared Task on morphological inflection on 103 languages in collaboration with Peter Makarov; our neural solution achieved 1st rank in all settings in Task I and was competitive in Task II (our system 2018 paper). Our innovative solution for imitation learning based training was also published as a short paper at EMNLP 2018.
- VarDial Shared Task (co-located with EACL 2017) on identification of written Swiss German dialects (CGI) in collaboration with Peter Makarov; our solution achieved 3rd rank (our system paper).
- CoNLL-SIGMORPHON-2017 Shared Task on morphological inflection on 52 languages in collaboration with Tatyana Ruzsics and Peter Makarov; our neural solution achieved overall 1st rank (our system paper)
- ICDAR2017 Competition on Post-OCR Text Correction (English and French) in collaboration with Chantal Amrhein; our neural solution was the best performing system in the Error Correction Task and performed well on Error Detection (official competition paper)
- TAC KBP 2017 Event Nugget Task (Text Analysis Conference) (English, Spanish and Chinese) in collaboration with Peter Makarov; our neural solution was 1st rank for Spanish and Chinese in all subtasks, and 1st for English in the subtasks that included realis value predictions (our system paper)
Projects and Work
Teaching
Lectures
- Text Mining: Spring 2020, 2022
- Machine Learning for Natural Language Processing I & II (MA) HS 2019 to Spring 2022
- Text Mining: Semantische Rollen und relationale Fakten (BA): FS 2019 FS 2017
- Deep Learning in der Sprachtechnologie (MA)HS 2018 HS 2016
- Sentimentanalyse und Media Monitoring (BA) FS 2018 FS 2016
- Aktuelle Forschungsmethodik in der Computerlinguistik (MA) HS 2015
- Maschinelle Lernverfahren für die Sprachtechnologie (MA) FS 2018 FS 2016FS 2014
- Programmiertechniken in der CL I (BA): HS 2021 HS 2020 HS 2019 HS 2018 (partly) HS 2017 (partly) HS 2016 (partly)HS 2015 (partly)HS 2014 (partly)HS 2013 (partly) HS 2012 (partly)HS 2011 (partly)
- Thema "Automatische Erschliessungsverfahren" im Aufbaumodul "Information Retrieval" im MAS Bibliotheks- und Informationswissenschaften Automatische Erschliessung (FS 2011)
- Einführung in die Computerlinguistik I (BA):HS 2021 (partly), HS 2020 (partly) HS 2019 (partly),HS 2018 (partly), HS 2017 (partly), HS 2016 (partly)HS 2015 (partly)HS 2014 (partly)HS 2013 (partly)HS 2012 (partly)HS 2011 (partly),HS 2010,HS 2009, HS 2008, HS 2007, WS 2006)
- Finite-State-Methoden in der Sprachtechnologie (BA/MA):FS 2019 FS 2017 FS 2015,FS 2013, FS 2011,FS 2010
- Morphologie und Lexikographie (FS 2009, FS 2008, SS 2007, SS 2006)
- Programmiertechniken in der CL (only final versions) WS 2005 , SS 2005)
Seminars and colloquia : FS 2015: Crowd-sourcing für Sprachtechnologie
HS 2013: Modernes Information Retrieval und Computerlinguistik
(involved as an assistant: SS 2005, SS 2003, SS 2201, SS 2000, WS2000)
Further education: Lectures on Computational Linguistics and Text Mining in MAS "Bibliotheks- und Informationswissenschaft" and DAS Datenmanagement und Informationstechnologien
Presentations and Talks
(incomplete)
- COLING 2018: Neural Transition-based String Transduction for Limited-Resource Setting in Morphology
- KONVENS 2018: A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback
- NLPCS 2013 : Disambiguation of the Semantics of German Prepositions: a Case Study
- RANLP 2013
- 2nd CALBC Workshop: OntoGene at CALBC II and Some Thoughts on the Need of Document-Wide Harmonization, Second CALBC Workshop – 16/17/18 March 2011 – EBI, Hinxton
- "Electoral campaigns, relation mining, and dependency parsing: extracting semantic network data from Swiss newspaper articles", together with B. Wüest and D. Laupper, Computer-aided methods of textual analysis RECON WP 6 Workshop, Berlin, 27-28 May 2010
- WASSA 2010 (1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis) ECAI 2010: Evaluation and Extension of a Polarity Lexicon for German
- TACOS 2010: Erweiterung des Adjektivbestands eines Polaritätslexikons für Deutsch
- LV im Modul IR im MAS Bibliotheks- und Informationswissenschaften 2009
- (Colloquium talk in Konstanz from July 2007)
- Koordination und syntaktische Disambiguierung
- Computerlinguistik in Information und Dokumentation (2006)
- Automatische Termextraktion (2004 Terminologiekurs ZHW)
- Markov Models And PoS Tagging with Markov Models (englisch)
- Probabilistic Context-Free Grammars (englisch)
- GermaNet und UniNet – Anknüpfen an semantische Netze
Publications
See my ORC-ID page for references.
The following list is generated from the Zurich Open Access Repository and contains all publications up to 2021 (most of them in full text):
ZORA Publikationsliste
Download-Optionen
Publikationen
-
Searching for Legal Documents at Paragraph Level: Automating Label Generation and Use of an Extended Attention Mask for Boosting Neural Models of Semantic Similarity. In: Proceedings of the Natural Legal Language Processing Workshop 2021, Punta Cana, Dominican Republic, 10 November 2021. Association for Computational Linguistics, 114-122.
-
CLUZH at SIGMORPHON 2021 Shared Task on Multilingual Grapheme-to-Phoneme Conversion: Variations on a Baseline. In: 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Online, 5 August 2021. Association for Computational Linguistics, 148-153.
-
Results of the Second SIGMORPHON Shared Task on Multilingual Grapheme-to-Phoneme Conversion. In: Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Online, 5 August 2021. Association for Computational Linguistics, 115-125.
-
Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers. Journal of Data Mining in Genomics & Proteomics:online.
-
Text Zoning and Classification for Job Advertisements in German, French and English. In: Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science, Online, 1 November 2020 - 30 November 2020. Association for Computational Linguistics, 83-93.
-
Extended Overview of CLEF HIPE 2020: Named Entity Processing on Historical Newspapers. In: Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, 22 September 2020 - 25 September 2020, CEUR-WS.
-
Semi-supervised Contextual Historical Text Normalization. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 1 July 2020, Association for Computational Linguistics.
-
CLUZH at SIGMORPHON 2020 Shared Task on Multilingual Grapheme-to-Phoneme Conversion. In: Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Online, 1 July 2020, Association for Computational Linguistics.
-
Ranking Georeferences for Efficient Crowdsourcing of Toponym Annotations in a Historical Corpus of Alpine Texts. In: 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Zurich, 23 June 2020 - 25 June 2020. CEUR-WS, online.
-
How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR. In: Proceedings of the 12th Language Resources and Evaluation Conference, Marseille, 1 May 2020 - 2 May 2020. ACL Anthology, 3551-3559.
-
Overview of CLEF HIPE 2020: Named Entity Recognition and Linking on Historical Newspapers. In: Arampatzis, Avi; Kanoulas, Evangelos; Tsikrika, Theodora; Vrochidis, Stefanos; Joho, Hideo; Lioma, Christina; Eickhoff, Carsten; Névéol, Aurélie; Cappellato, Linda; Ferro, Nicola. Experimental IR Meets Multilinguality, Multimodality, and Interaction. Cham: Springer, 288-310.
-
Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers. In: Jose, Joemon M; Yilmaz, Emine; Magalhães, João; Castells, Pablo; Ferro, Nicola; Silva, Mário J; Martins, Flávio. Advances in Information Retrieval: 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, April 14–17, 2020, Proceedings, Part II. Cham: Springer, 524-532.
-
Geotagging a diachronic corpus of alpine texts: comparing distinct approaches to toponym recognition. In: RANLP 2019, Workshop on Language technology for digital historical archives with a special focus on Central-, (South-)Eastern Europe, Middle East and North Africa, Varna, Bulgaria, 5 September 2019. RANLP, 11-18.
-
Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection. In: Challenges in the Management of Large Corpora (CMLC-7), Cardiff, Wales, 22 Juli 2019 - 22 Juli 2019.
-
Variable article use with acronyms and initialisms: a contrastive analysis of English, German and Italian. Languages in Contrast, 19(1):48-78.
-
Improving OCR of Black Letter in Historical Newspapers: The Unreasonable Effectiveness of HTR Models on Low-Resolution Images. Utrecht: Digital Humanities 2019.
-
A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback. In: Barbaresi, Adrien; Biber, Hanno; Neubarth, Friedrich; Osswald, Rainer. 14th Conference on Natural Language Processing - KONVENS 2018. Vienna: Verlag der Österreichischen Akademie der Wissenschaften, 29-33.
-
Imitation Learning for Neural Morphological String Transduction. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October 2018 - 4 November 2018. Association for Computational Linguistics, 2877-2882.
-
UZH at CoNLL-SIGMORPHON 2018 Shared Task on Universal Morphological Reinflection. In: CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection, Brussels, Belgium, 31 October 2018. Association for Computational Linguistics, 69-75.
-
Neural Transition-based String Transduction for Limited-Resource Setting in Morphology. In: Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, 20 August 2018 - 26 August 2018. Association for Computational Linguistics, 83-93.
-
A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish. In: Mitkov, Ruslan; Monti, Johanna; Corpas Pastor, Gloria; Seretan, Violeta. Multiword Units in Machine Translation and Translation Technology. Amsterdam: John Benjamins, 125-145.
-
Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus. Journal for Language Technology and Computational Linguistics, 33(1):25-47.
-
Supervised OCR Error Detection and Correction Using Statistical and Neural Machine Translation Methods. Journal for Language Technology and Computational Linguistics, 33(1):49-76.
-
Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie. In: Teaching NLP for Digital Humanitie, Berlin, 12 September 2017 - 12 September 2017, 17-22.
-
Align and Copy: UZH at SIGMORPHON 2017 Shared Task for Morphological Reinflection. In: 15th Annual SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology at CoNLL 2017, Vancouver, Canada, 3 August 2017 - 4 August 2017. Association for Computational Linguistics, 49-57.
-
Verb-mediated Composition of Attitude Relations Comprising Reader and Writer Perspective. In: 18th International Conference on Computational Linguistics and Intelligent Text Processing, Budapest, 17 April 2017 - 23 April 2017, ResearchBib.
-
CLUZH at VarDial GDI 2017: Testing a Variety of Machine Learning Tools for the Classification of Swiss German Dialects. In: Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, Valencia, 3 April 2017. Association for Computational Linguistics, 170-177.
-
Stance Detection in Facebook Posts of a German Right-wing Party. In: LSDSem 2017/LSD-Sem Linking Models of Lexical, Sentential and Discourse-level Semantics, Valencia, 3 April 2017, ResearchBib.
-
Crowdsourcing Swiss Dialect Transcriptions for Assessing Factors in Writing Variations. In: Proceedings of the 13th Conference on Natural Language Processing (KONVENS) Bochum, Germany September 19–21, 2016, Bochum, 19 September 2016 - 21 September 2016. Universitätsverlag Ruhr-Universität Bochum, 62-67.
-
Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs. In: KONVENS 2016, Bochum, 19 September 2016 - 21 September 2016.
-
How factuality determines sentiment inferences. In: Proceedings of *SEM 2016: The fifth joint conference on lexical and computational semantics, Berlin, 7 August 2016 - 12 August 2016. s.n., 75-84.
-
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, 23 May 2016 - 28 May 2016. European Language Resources Association (ELRA), 975-982.
-
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23.
-
Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora. In: Corpas Pastor, Gloria. Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües. Geneva: Tradulex, n/a.
-
Track 4 Overview: Extraction of Causal Network Information in Biological Expression Language (BEL). In: BioCreative V, Sevilla, 9 September 2015 - 11 September 2015, 333-346.
-
Ontogene Term and Relation Recognition for CDR. In: BioCreative V, Sevilla, 9 September 2015 - 11 September 2015, 305-310.
-
Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora. In: 3rd Workshop on the Challenges in the Management of Large Corpora, Lancaster, 20 July 2015 - 20 July 2015, 15-20.
-
Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora. In: Gintare, Grigonyte; Clematide, Simon; Utka, Andrius; Volk, Martin. Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania. Linköping, Sweden: Linköping University Electronic Press, Linköpings universitet, 6-16.
-
Detecting Code-Switching in a Multilingual Alpine Heritage Corpus. In: Proceedings of the First Workshop on Computational Approaches to Code Switching, Doha, Qatar, 25 October 2014. Association for Computational Linguistics, 24-33.
-
Tagging Complex Non-Verbal German Chunks with Conditional Random Fields. In: Proceedings of the 12th Edition of the KONVENS Converence, Hildesheim, Germany, October 8-10, 2014, Hildesheim, Germany, 8 October 2014 - 10 October 2014, 48-57.
-
Collection‐Wide Extraction of Protein‐Protein Interactions. In: 6th International Symposium on Semantic Mining in Biomedicine, Aveiro, Portugal, 6 October 2014 - 7 October 2014. s.n., 61-66.
-
OntoGene web services for biomedical text mining. BMC Bioinformatics, 15(Suppl 14):S6.
-
Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, 2014. European Language Resources Association (ELRA), 3736-3741.
-
Disambiguation of the Semantics of German Prepositions: a Case Study. In: Proceedings of NLPCS 2013: 10th International Workshop on Natural Language Processing and Cognitive Science, Marseille, France — Octobre 2013, Marseille, France, 15 October 2013 - 16 October 2013. s.n., 137-150.
-
ODIN: a customizable literature curation tool. In: Fourth BioCreative Challenge Evaluation Workshop, Bethesda, MD, US, 7 October 2013 - 9 October 2013, 219-223.
-
Assisted curation of growth conditions that affect gene expression in E. coli K-12. In: Proceedings of the Fourth BioCreative Challenge Evaluation Workshop, Bethesda, MD, US, 7 October 2013 - 9 October 2013, 214-218.
-
OntoGene: CTD entity and action term recognition. In: Fourth BioCreative Challenge Evaluation Workshop, Bethesda, MD, US, 7 October 2013 - 9 October 2013, 90-94.
-
Creating Multilingual Gold Standard Corpora for Biomedical Concept Recognition. In: CLEF 2013: Evaluation Labs and Workshop: Online Working Notes, Valencia, Spain, 23 September 2013 - 26 September 2013, CLEF.
-
Deriving an English Biomedical Silver Standard Corpus for CLEF-ER. In: CLEF 2013: Evaluation Labs and Workshop: Online Working Notes, Valencia, Spain, 23 September 2013 - 26 September 2013, s.n..
-
Exploiting BabelNet for multilingual biomedical synonym expansion. In: CLEF 2013: Evaluation Labs and Workshop: Online Working Notes, September 23-26, 2013, Valencia, Spain, Valencia, Spain, 23 September 2013, 156.
-
A Pilot Study on the Semantic Classification of Two German Prepositions: Combining Monolingual and Multilingual Evidence. In: Proceedings of the International Conference Recent Advances in Natural Language Processing RANLP 2013, Hissar, Bulgaria, 7 September 2013 - 13 September 2013. Bulgaria, 148-155.
-
UZH in BioNLP 2013. In: Proceedings of the BioNLP Shared Task 2013 Workshop, Sophia, Bulgaria, 9 August 2013 - 9 August 2013, 116-120.
-
How preferred are preferred terms?. In: Kosem, I; Kallas, J; Gantar, P; Krek, S; Langemets, M; Tuulik, M. Electronic lexicography in the 21st century: thinking outside the paper. Proceedings of the eLex 2013 conference, 17-19 October 2013, Tallinn, Estonia. Ljubljana/Tallinn: eLex, 452-459.
-
A case study in tagging case in german: an assessment of statistical approaches. In: Mahlow, Cerstin; Piotrowski, Michael. Systems and Frameworks for Computational Morphology. Heidelberg New York Dordrecht London: Springer, 22-34.
-
Entity recognition in parallel multi-lingual biomedical corpora: The CLEF-ER laboratory overview. In: Forner, Pamela; Mueller, Henning; Rosso, Paolo; Paredes, Roberto. Information Access Evaluation. Multilinguality, Multimodality, and Visualization. Valencia: Springer, 353-367.
-
Using the OntoGene pipeline for the triage task of BioCreative 2012. Database, 2013:bas053.
-
Dependency parsing for interaction detection in pharmacogenomics. In: LREC 2012: The eighth international conference on Language Resources and Evaluation, Istanbul, 21 May 2012 - 25 May 2012.
-
Ranking of CTD articles and interactions using the OntoGene pipeline. In: 2012 {BioCreative} workshop, Washington D.C., 4 April 2012 - 5 April 2012.
-
Compositional syntax-based phrase-level polarity annotation for German. In: The 10th International Workshop on Treebanks and Linguistic Theories (TLT 2012), Heidelberg, 6 January 2012 - 7 January 2012, online.
-
Using ODIN for a PharmGKB re-validation experiment. Database: The Journal Of Biological Databases And Curation, 2012:bas021.
-
Relation Mining Experiments in the Pharmacogenomics Domain. Journal of Biomedical Informatics, 45(5):851-861.
-
Ranking Interactions for a Curation Task. In: 10th International Conference on Machine Learning and Applications and Workshops, Honolulu, Hawaii USA, 18 December 2011 - 21 December 2011. IEEE Computer Society, 100-105.
-
Detection of interaction articles and experimental methods in biomedical literature. BMC Bioinformatics, 12(Suppl 8):S13.
-
Generating inflection variants of multi-word terms for French and German. In: Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011, Hamburg, Germany, 28 September 2011 - 30 September 2011, 33-37.
-
Semi-automatic test generation for tandem learning. In: Speech and Language Technology in Education, Venice, 24 August 2011 - 26 August 2011.
-
An incremental model for the coreference resolution task of BioNLP 2011. In: BioNLP 2011, Portland, Oregon, USA, 23 June 2011 - 24 June 2011. Association for Computational Linguistics (ACL), 151-152.
-
OntoGene at CALBC II and Some Thoughts on the Need of Document-Wide Harmonization. In: Second CALBC Workshop, Hinxton, Cambridgeshire, UK, 16 March 2011 - 18 March 2011, 48-51.
-
Mining complex Drug/Gene/Disease relations. In: Pacific Symposium on Biocomputing Workshop "Mining the Pharmacogenomics Literature", Hawaii, 3 January 2011 - 7 January 2011.
-
BioCreative III interactive task: an overview. BMC Bioinformatics, 12(Suppl 8):S4.
-
Electoral Campaigns and Relation Mining: Extracting Semantic Network Data from Newspaper Articles. Journal of Information Technology & Politics, 8(4):444-463.
-
Assessment of NER solutions against the first and second CALBC Silver Standard Corpus. Journal of Biomedical Semantics, 2(Suppl 5):S11.
-
ODIN: an advanced interface for the curation of biomedical literature. Nature Precedings:online.
-
OntoGene (Team 65): preliminary analysis of participation in BioCreative III. In: BioCreative III workshop, Bethesda, Maryland, 13 September 2010 - 15 September 2010.
-
Evaluation and extension of a polarity lexicon for German. In: Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA); Held in conjunction to ECAI 2010 Portugal, Lisbon, Portugal, 17 August 2010 - 17 August 2010, 7-13.
-
OntoGene in BioCreative II.5. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 7(3):472-480.
-
OntoGene in CALBC. In: First CALBC Workshop, Hinxton, Cambridgeshire, UK, 17 June 2010 - 18 June 2010, 30-31.
-
Constructing a Constructional MWE Lexicon for psycho-conceptual Annotation: Evaluation of CPA and DuELME for Lexicographic Description. In: Dykstra, A; Schoonheim, T. Proceedings of the XIV Euralex International Congress. Leeuwarden, NL: Fryske Akademy, 402-410.
-
Effective Mining of Protein Interactions. In: Third international symposium on languages in biology and medecine (LBM 2009), Jeju Island, South Korea, 8 November 2009 - 10 November 2009, 115-118.
-
A morpho-syntactic generation service for German glossary entries. In: Clematide, S; Klenner, M; Volk, Martin. Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday. Münster, Germany: Monsenstein und Vannerdat, 33-43.
-
Koordination im Deutschen und ihre syntaktische Desambiguierung. 2009, University of Zurich, Faculty of Arts.
-
Towards automatic detection of experimental methods from biomedical literature. In: Third International Symposium on Semantic Mining in Biomedicine (SMBM 2008), Turku, Finland, 1 September 2008 - 3 September 2008, 61-68.
-
OntoGene in BioCreative II. Genome Biology, 9(Suppl 2):S13.
-
Ein elektronisches Lexikon im OLIF-Format für die Erzählanalyse. In: XIII. Euralex International Congress, Barcelona, Spain, 15 July 2008 - 19 July 2008, 729-735.
-
An OLIF-based open inflectional resource and yet another morphological system for German. In: Storrer, A; Geyken, A; Siebert, A; Würzner, K M. Text Resources and Lexical Knowledge. Berlin, Germany: Mouton de Gruyter, 183-194.
-
What (the Hell) is Wrong? An Approach to Semi-automatic Construction of Self Correction Tests. In: Workshop on NLP for Educational Resources. In conjunction with RANLP07, Borovets, Bulgaria, 2007 - 2007, 15-22.
-
The importance of how-questions in technical domains. In: Proc of the Question-Answering workshop of TALN 04, Fez, Morocco, April 2004 - April 2004, 451-460.
-
GermaNet und UniNet. LDV-Forum, 19(1/2):137-142.
-
Selektive Evaluation von robusten Parsern. In: Konvens 2002, 6. Konferenz zur Verarbeitung natürlicher Sprache, Proceedings, Saarbrcken, September 2002, 23-29.
-
Linguistische und semantische Annotation eines Zeitungskorpus. In: GLDV-Jahrestagung, Giessen, 28 March 2001 - 30 March 2001, 201-209.
-
Learn-filter-apply-forget. Mixed approaches to named entity recognition. In: 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid, Spain, 2001.
-
LUIS - Ein natürlichsprachliches, universitäres Informationssystem. In: Unternehmen Hochschule, Wien, 2001, 115-126.