Simon Clematide

Simon Clematide, Dr.

Senior Academic Associate

Tel.: +41 44 635 7132
Raumbezeichnung: AND-2-30

E-Mail

NLP Shared Tasks activities

Shared tasks are a good means to collaborate with other researchers, they let students work on a defined task in a competitive way, and generally assess the methodological competences in solving practical problems. From 2017 to 2022, we had several successful shared task participations and helped to co-organized shared tasks as well:

CLEF-HIPE-2022 (Identifying Historical People, Places and other Entities): Shared Task on Named Entity Recognition and Linking in Multilingual Historical Documents.
Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions 2021for different resource settings and languages in collaboration with Peter Makarov. We co-organized the shared task, annotated data, provided the baseline system and also had a submission with extensions of the baseline solution.
CLEF-HIPE-2020 (Identifying Historical People, Places and other Entities) evaluation campaign on named entity processing on historical newspapers in French, German and English; co-organization of the task, providing a NER baseline system and the evaluation framework.
CoNLL-SIGMORPHON-2020 Shared Task 1 on Multilingual Grapheme-to-Phoneme Conversion on 15 languages in collaboration with Peter Makarov; our neural solution achieved 2nd rank. Our solution is described in a short paper at the SIGMORPHON workshop.
CoNLL-SIGMORPHON-2018 Shared Task on morphological inflection on 103 languages in collaboration with Peter Makarov; our neural solution achieved 1st rank in all settings in Task I and was competitive in Task II (our system 2018 paper). Our innovative solution for imitation learning based training was also published as a short paper at EMNLP 2018.
VarDial Shared Task (co-located with EACL 2017) on identification of written Swiss German dialects (CGI) in collaboration with Peter Makarov; our solution achieved 3rd rank (our system paper).
CoNLL-SIGMORPHON-2017 Shared Task on morphological inflection on 52 languages in collaboration with Tatyana Ruzsics and Peter Makarov; our neural solution achieved overall 1st rank (our system paper)
ICDAR2017 Competition on Post-OCR Text Correction (English and French) in collaboration with Chantal Amrhein; our neural solution was the best performing system in the Error Correction Task and performed well on Error Detection (official competition paper)
TAC KBP 2017 Event Nugget Task (Text Analysis Conference) (English, Spanish and Chinese) in collaboration with Peter Makarov; our neural solution was 1st rank for Spanish and Chinese in all subtasks, and 1st for English in the subtasks that included realis value predictions (our system paper)

Projects and Work

NFP 77 project "Task and Skill Profiles in the Digital Economy", Sinergia project impresso, Citizen Linguistics Project on Swiss German Dialects and Swiss French Citizen Linguistics,www.tonaccent.ch,www.dindialaekt.ch; Exploitation of linguistically annotated multilingual multiparallel corpora SPARCLING; Multilingual Sentiment Analysis KTI project together with Eurospider ; Biomedical Text Mining: MANTRA; SASEBio ; Political Text Mining: COSA ; German Sentiment Analysis IGGSA; Sentence Extension Tests (SET): SET ; Web-based Virtual Laboratory for Computational Linguistics CLab

I generally enjoy interdisciplinary work and sometimes consult scholars who apply text analytics methods to their problems. One example is the concept extraction for the material science journal article "A framework for evaluating the accessibility of raw materials from end-of-life products and the Earth’s crust" (work with Sandra R. Mueller). Another example is from the social sciences on the topic of "Text Zoning for Job Advertisement with Bidirectional LSTMs" in 2017 (work with Ann-Sophie Gnehm).

Teaching

Lectures

Text Mining: Spring 2020, 2022
Machine Learning for Natural Language Processing I & II (MA) HS 2019 to Spring 2022
Text Mining: Semantische Rollen und relationale Fakten (BA): FS 2019 FS 2017
Deep Learning in der Sprachtechnologie (MA)HS 2018 HS 2016
Sentimentanalyse und Media Monitoring (BA) FS 2018 FS 2016
Aktuelle Forschungsmethodik in der Computerlinguistik (MA) HS 2015
Maschinelle Lernverfahren für die Sprachtechnologie (MA) FS 2018 FS 2016 FS 2014
Programmiertechniken in der CL I (BA): HS 2021 HS 2020 HS 2019 HS 2018 (partly) HS 2017 (partly) HS 2016 (partly)HS 2015 (partly)HS 2014 (partly)HS 2013 (partly) HS 2012 (partly)HS 2011 (partly)
Thema "Automatische Erschliessungsverfahren" im Aufbaumodul "Information Retrieval" im MAS Bibliotheks- und Informationswissenschaften Automatische Erschliessung (FS 2011)
Einführung in die Computerlinguistik I (BA):HS 2021 (partly), HS 2020 (partly) HS 2019 (partly),HS 2018 (partly), HS 2017 (partly), HS 2016 (partly)HS 2015 (partly)HS 2014 (partly)HS 2013 (partly)HS 2012 (partly)HS 2011 (partly),HS 2010,HS 2009, HS 2008, HS 2007, WS 2006)
Finite-State-Methoden in der Sprachtechnologie (BA/MA):FS 2019 FS 2017 FS 2015,FS 2013, FS 2011,FS 2010
Morphologie und Lexikographie (FS 2009, FS 2008, SS 2007, SS 2006)
Programmiertechniken in der CL (only final versions) WS 2005 , SS 2005)

Seminars and colloquia : FS 2015: Crowd-sourcing für Sprachtechnologie
HS 2013: Modernes Information Retrieval und Computerlinguistik
(involved as an assistant: SS 2005, SS 2003, SS 2201, SS 2000, WS2000)

Further education: Lectures on Computational Linguistics and Text Mining in MAS "Bibliotheks- und Informationswissenschaft" and DAS Datenmanagement und Informationstechnologien

Presentations and Talks

(incomplete)

COLING 2018: Neural Transition-based String Transduction for Limited-Resource Setting in Morphology
KONVENS 2018: A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback
NLPCS 2013 : Disambiguation of the Semantics of German Prepositions: a Case Study
RANLP 2013
2nd CALBC Workshop: OntoGene at CALBC II and Some Thoughts on the Need of Document-Wide Harmonization, Second CALBC Workshop – 16/17/18 March 2011 – EBI, Hinxton
"Electoral campaigns, relation mining, and dependency parsing: extracting semantic network data from Swiss newspaper articles", together with B. Wüest and D. Laupper, Computer-aided methods of textual analysis RECON WP 6 Workshop, Berlin, 27-28 May 2010
WASSA 2010 (1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis) ECAI 2010: Evaluation and Extension of a Polarity Lexicon for German
TACOS 2010: Erweiterung des Adjektivbestands eines Polaritätslexikons für Deutsch
LV im Modul IR im MAS Bibliotheks- und Informationswissenschaften 2009
(Colloquium talk in Konstanz from July 2007)
E-Learning in der CL der Universität Zürich (PDF, 947 KB) (PDF, 925 KB)
Koordination und syntaktische Disambiguierung
Computerlinguistik in Information und Dokumentation (2006)
Automatische Termextraktion (2004 Terminologiekurs ZHW)
Markov Models And PoS Tagging with Markov Models (englisch)
Probabilistic Context-Free Grammars (englisch)
GermaNet und UniNet – Anknüpfen an semantische Netze

Publications

See my ORC-ID page for references.

The following list is generated from the Zurich Open Access Repository and contains all publications up to 2021 (most of them in full text):

ZORA Publikationsliste

Download-Optionen

Format für Download Link

Download alsCSV Download alsRIS Download alsBIBTEX

Publikationen

Tang, L., & Clematide, S. (2021). Searching for Legal Documents at Paragraph Level: Automating Label Generation and Use of an Extended Attention Mask for Boosting Neural Models of Semantic Similarity 114–122. https://doi.org/10.18653/v1/2021.nllp-1.12
Clematide, S., & Makarov, P. (2021). CLUZH at SIGMORPHON 2021 Shared Task on Multilingual Grapheme-to-Phoneme Conversion: Variations on a Baseline 148–153. https://doi.org/10.18653/v1/2021.sigmorphon-1.17
Ashby, L. F. E., Bartley, T. M., Clematide, S., Del Signore, L., Gibson, C., Gorman, K., Lee-Sikka, Y., Makarov, P., Malanoski, A., Miller, S., Ortiz, O., Raff, R., Sengupta, A., Seo, B., Spektor, Y., & Yan, W. (2021). Results of the Second SIGMORPHON Shared Task on Multilingual Grapheme-to-Phoneme Conversion 115–125. https://doi.org/10.18653/v1/2021.sigmorphon-1.13
Lüngen, H., Kupietz, M., Bański, P., Barbaresi, A., Clematide, S., & Pisetta, I. (Eds.). (2021). Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-9) 2021. Limerick, 12 July 2021 (Online-Event) Leibniz-Institut für Deutsche Sprache. https://doi.org/10.14618/ids-pub-10467
Barman, R., Ehrmann, M., Clematide, S., Oliveira, S. A., & Kaplan, F. (2021). Combining Visual and Textual Features for Semantic Segmentation of Historical Newspapers Journal of Data Mining in Genomics & Proteomics, online. https://doi.org/10.46298/jdmdh.6107
Gnehm, A.-S., & Clematide, S. (2020). Text Zoning and Classification for Job Advertisements in German, French and English 83–93. https://doi.org/10.18653/v1/2020.nlpcss-1.10
Ehrmann, M., Romanello, M., Flückiger, A., & Clematide, S. (2020, September 25). Extended Overview of CLEF HIPE 2020: Named Entity Processing on Historical Newspapers CEUR Workshop Proceedings, Article 2696. Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki. http://ceur-ws.org/Vol-2696/paper_255.pdf
Makarov, P., & Clematide, S. (2020, July 1). CLUZH at SIGMORPHON 2020 Shared Task on Multilingual Grapheme-to-Phoneme Conversion Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Online. https://doi.org/10.18653/v1/2020.sigmorphon-1.19
Makarov, P., & Clematide, S. (2020, July 1). Semi-supervised Contextual Historical Text Normalization Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online. https://doi.org/10.18653/v1/2020.acl-main.650
Goldzycher, J., Meraner, I., Volk, M., & Clematide, S. (2020). Ranking Georeferences for Efficient Crowdsourcing of Toponym Annotations in a Historical Corpus of Alpine Texts CEUR Workshop Proceedings, online. http://ceur-ws.org/Vol-2624/paper11.pdf
Ströbel, P. B., Clematide, S., & Volk, M. (2020). How Much Data Do You Need? About the Creation of a Ground Truth for Black Letter and the Effectiveness of Neural OCR 3551–3559. https://www.aclweb.org/anthology/2020.lrec-1.436
Bański, P., Barbaresi, A., Clematide, S., Kupietz, M., Lüngen, H., & Pisetta, I. (Eds.). (2020). Proceedings of the LREC 2020: 8th Workshop on Challenges in the Management of Large Corpora (CMLC-8) European Language Ressources Association. https://www.aclweb.org/anthology/2020.cmlc-1.0
Ehrmann, M., Romanello, M., Flückiger, A., & Clematide, S. (2020). Overview of CLEF HIPE 2020: Named Entity Recognition and Linking on Historical Newspapers In A. Arampatzis, E. Kanoulas, T. Tsikrika, S. Vrochidis, H. Joho, C. Lioma, C. Eickhoff, A. Névéol, L. Cappellato, & N. Ferro (Eds.), Experimental IR Meets Multilinguality, Multimodality, and Interaction (No. 12260; pp. 288–310). Springer. https://doi.org/10.1007/978-3-030-58219-7_21
Ehrmann, M., Romanello, M., Bircher, S., & Clematide, S. (2020). Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers In J. M. Jose, E. Yilmaz, J. Magalhães, P. Castells, N. Ferro, M. J. Silva, & F. Martins (Eds.), Advances in Information Retrieval: 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, April 14–17, 2020, Proceedings, Part II (No. 12036; Vol. 12036, pp. 524–532). Springer. https://doi.org/10.1007/978-3-030-45442-5_68
Kew, T., Shaitarova, A., Meraner, I., Clematide, S., Goldzycher, J., & Volk, M. (2019). Geotagging a diachronic corpus of alpine texts: comparing distinct approaches to toponym recognition 11–18. https://doi.org/10.26615/978-954-452-059-5_003
Graën, J., Kew, T., Shaitarova, A., & Volk, M. (2019). Modelling Large Parallel Corpora: The Zurich Parallel Corpus Collection (P. Bański, A. Barbaresi, H. Biber, E. Breiteneder, S. Clematide, M. Kupietz, H. Lüngen, & C. Iliadi, Eds.). Leibniz-Institut für Deutsche Sprache. https://doi.org/10.14618/ids-pub-9020
Banski, P., Barbaresi, A., Biber, H., Breiteneder, E., Clematide, S., Kupietz, M., Lüngen, H., & Iliadi, C. (Eds.). (2019). Challenges in the Management of Large Corpora (CMLC-7) 2019 Leibniz-Institut für Deutsche Sprache. https://doi.org/10.14618/ids-pub-8998
Ströbel, P. B., & Clematide, S. (2019). Improving OCR of Black Letter in Historical Newspapers: The Unreasonable Effectiveness of HTR Models on Low-Resolution Images https://dev.clariah.nl/files/dh2019/boa/0694.html
Callegaro, E., Clematide, S., Hundt, M., & Wick, S. (2019). Variable article use with acronyms and initialisms: a contrastive analysis of English, German and Italian Languages in Contrast, 19, 48–78. https://doi.org/10.1075/lic.16021.cal
Clematide, S. (2018). A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback In A. Barbaresi, H. Biber, F. Neubarth, & R. Osswald (Eds.), 14th Conference on Natural Language Processing - KONVENS 2018 (pp. 29–33). Verlag der Österreichischen Akademie der Wissenschaften. https://epub.oeaw.ac.at/?arp=0x003a238a
Makarov, P., & Clematide, S. (2018). Imitation Learning for Neural Morphological String Transduction 2877–2882. http://www.aclweb.org/anthology/D18-1314
Makarov, P., & Clematide, S. (2018). UZH at CoNLL-SIGMORPHON 2018 Shared Task on Universal Morphological Reinflection 69–75. http://www.aclweb.org/anthology/K18-3008
Makarov, P., & Clematide, S. (2018). Neural Transition-based String Transduction for Limited-Resource Setting in Morphology 83–93. http://aclweb.org/anthology/C18-1008
Banski, P., Kupietz, M., Barbaresi, A., Biber, H., Breiteneder, E., Clematide, S., & Witt, A. (Eds.). (2018). Challenges in the Management of Large Corpora (CMLC-6) European Language Resources Association (ELRA). http://lrec-conf.org/workshops/lrec2018/W17/index.html
Clematide, S., Lehner, S., Graën, J., & Volk, M. (2018). A multilingual gold standard for translation spotting of German compounds and their corresponding multiword units in English, French, Italian and Spanish In R. Mitkov, J. Monti, G. Corpas Pastor, & V. Seretan (Eds.), Multiword Units in Machine Translation and Translation Technology (No. 341; pp. 125–145). John Benjamins. https://doi.org/10.1075/cilt.341
Amrhein, C., & Clematide, S. (2018). Supervised OCR Error Detection and Correction Using Statistical and Neural Machine Translation Methods Journal for Language Technology and Computational Linguistics, 33, 49–76. https://jlcl.org/content/2-allissues/1-heft1-2018/jlcl_2018-1_3.pdf
Clematide, S., Furrer, L., & Volk, M. (2018). Crowdsourcing the OCR Ground Truth of a German and French Cultural Heritage Corpus Journal for Language Technology and Computational Linguistics, 33, 25–47. https://jlcl.org/content/2-allissues/1-heft1-2018/jlcl_2018-1_2.pdf
Clematide, S., Meraner, I., Bubenhofer, N., & Volk, M. (2017). Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanitie 17–22.
Makarov, P., Ruzsics, T., & Clematide, S. (2017). Align and Copy: UZH at SIGMORPHON 2017 Shared Task for Morphological Reinflection 49–57. https://doi.org/10.18653/v1/K17-2004
Bański, P., Kupietz, M., Lüngen, H., Rayson, P., Biber, H., Breiteneder, E., Clematide, S., Mariani, J., Stevenson, M., & Sick, T. (Eds.). (2017). Proceedings of the workshop on challenges in the management of large corpora and big data and natural language processing (CMLC-5+BigNLP) 2017 including the papers from the web-as-corpus (WAC-XI) guest section. Birmingham, 24 july 2017 Institut für Deutsche Sprache. http://nbn-resolving.de/urn:nbn:de:bsz:mh39-62434
Klenner, M., Clematide, S., & Tuggener, D. (2017, April 23). Verb-mediated Composition of Attitude Relations Comprising Reader and Writer Perspective 18th International Conference on Computational Linguistics and Intelligent Text Processing, Budapest. https://doi.org/10.1007/978-3-319-77116-8_11
Klenner, M., Tuggener, D., & Clematide, S. (2017, April 3). Stance Detection in Facebook Posts of a German Right-wing Party LSDSem 2017/LSD-Sem Linking Models of Lexical, Sentential and Discourse-level Semantics, Valencia.
Clematide, S., & Makarov, P. (2017). CLUZH at VarDial GDI 2017: Testing a Variety of Machine Learning Tools for the Classification of Swiss German Dialects 170–177. http://www.aclweb.org/anthology/W17-1221
Clematide, S., Frick, K., Aepli, N., & Goldman, J.-P. (2016). Crowdsourcing Swiss Dialect Transcriptions for Assessing Factors in Writing Variations Bochumer Linguistische Arbeitsberichte, 62–67. https://www.linguistics.rub.de/bla/016-konvens2016.pdf
Volk, M., Clematide, S., Graën, J., & Ströbel, P. (2016, September 21). Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs KONVENS 2016, Bochum. https://www.linguistics.rub.de/konvens16/program/accepted.html
Klenner, M., & Clematide, S. (2016). How factuality determines sentiment inferences 75–84. https://aclweb.org/anthology/S/S16/S16-2008.pdf
Graën, J., Clematide, S., & Volk, M. (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database (P. Bański, M. Kupietz, H. Lüngen, A. Witt, A. Barbaresi, H. Biber, E. Breiteneder, & S. Clematide, Eds.; pp. 20–23). s.n. http://www.lrec-conf.org/proceedings/lrec2016/workshops/LREC2016Workshop-CMLC_Proceedings.pdf
Clematide, S., Furrer, L., & Volk, M. (2016). Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus 975–982. http://www.lrec-conf.org/proceedings/lrec2016/pdf/917_Paper.pdf
Rinaldi, F., Ellendorff, T. R., Madan, S., Clematide, S., van der Lek, A., Mevissen, T., & Fluck, J. (2016). BioCreative V track 4: a shared task for the extraction of causal network information using the Biological Expression Language Database, 2016, baw067. https://doi.org/10.1093/database/baw067
Clematide, S., Graën, J., & Volk, M. (2016). Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora In G. Corpas Pastor (Ed.), Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües (p. n/a). Tradulex.
Fluck, J., Madan, S., Ellendorff, T. R., Mevissen, T., Clematide, S., van der Lek, A., & Rinaldi, F. (2015). Track 4 Overview: Extraction of Causal Network Information in Biological Expression Language (BEL) Proceedings of the Fifth BioCreative Challenge Evaluation Workshop, 333–346. http://www.biocreative.org/media/store/files/2015/bel-track-overview_paper.pdf
Ellendorff, T. R., Clematide, S., van der Lek, A., Furrer, L., & Rinaldi, F. (2015). Ontogene Term and Relation Recognition for CDR 305–310. http://www.biocreative.org/media/store/files/2015/BCV2015_paper_49.pdf
Graën, J., & Clematide, S. (2015). Challenges in the alignment, management and exploitation of large and richly annotated multi-parallel corpora (P. Bański, H. Biber, E. Breiteneder, M. Kupietz, H. Lüngen, & A. Witt, Eds.; pp. 15–20). Institut für Deutsche Sprache. http://ids-pub.bsz-bw.de/files/3826/Graen_Clematide_Challenges_in_the_Alignment_management_and_exploitation_2015.pdf
Grigonyte, G., Clematide, S., Utka, A., & Volk, M. (Eds.). (2015). Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (Vol. 111). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp/111/ecp15111.pdf
Clematide, S. (2015). Reflections and a Proposal for a Query and Reporting Language for Richly Annotated Multiparallel Corpora In G. Gintare, S. Clematide, A. Utka, & M. Volk (Eds.), Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania (No. 111; pp. 6–16). Linköping University Electronic Press, Linköpings universitet. http://www.ep.liu.se/ecp_home/index.en.aspx?issue=111
Volk, M., & Clematide, S. (2014). Detecting Code-Switching in a Multilingual Alpine Heritage Corpus 24–33. https://doi.org/10.3115/v1/W14-3903
Roth, L., & Clematide, S. (2014). Tagging Complex Non-Verbal German Chunks with Conditional Random Fields 48–57.
Furrer, L., Clematide, S., Marques, H., Rodriguez‐Esteban, R., Romacker, M., & Rinaldi, F. (2014). Collection‐Wide Extraction of Protein‐Protein Interactions 61–66.
Ellendorff, T., Rinaldi, F., & Clematide, S. (2014). Using Large Biomedical Databases as Gold Annotations for Automatic Relation Extraction (N. Calzolari, K. Choukri, T. Declerck, H. Loftsson, M. Bente, J. Mariani, A. Moreno, & J. Odijk, Eds.; pp. 3736–3741). European Language Resources Association (ELRA). http://www.lrec-conf.org/proceedings/lrec2014/pdf/1156_Paper.pdf
Rinaldi, F., Clematide, S., Marques, H., Ellendorff, T., Rodriguez-Esteban, R., & Romacker, M. (2014). OntoGene web services for biomedical text mining BMC Bioinformatics, 15, S6. https://doi.org/10.1186/1471-2105-15-S14-S6
Clematide, S., Klenner, M., & Furrer, L. (2013). Disambiguation of the Semantics of German Prepositions: a Case Study 137–150.
Rinaldi, F., Clematide, S., Ellendorff, T. R., & Marques, H. (2013). OntoGene: CTD entity and action term recognition 1, 90–94. http://www.biocreative.org/media/store/files/2013/bc4_v1_12.pdf
Rinaldi, F., Davis, A. P., Southan, C., Clematide, S., Ellendorff, T. R., & Schneider, G. (2013). ODIN: a customizable literature curation tool 1, 219–223. http://www.biocreative.org/media/store/files/2013/bc4_v1_30.pdf
Gama, S.-C., Rinaldi, F., & et al. (2013). Assisted curation of growth conditions that affect gene expression in E. coli K-12 1, 214–218. http://www.biocreative.org/news/biocreative-iv/volume-1-proceed-publications/
Kors, J. A., Clematide, S., Akhondi, S. A., van Mulligen, E. M., & Rebholz-Schuhmann, D. (2013, September 26). Creating Multilingual Gold Standard Corpora for Biomedical Concept Recognition CLEF 2013: Evaluation Labs and Workshop: Online Working Notes, Valencia. http://www.clef-initiative.eu/documents/71612/b9297e71-0849-439b-bf16-a379639e4ea5
Lewin, I., & Clematide, S. (2013). Deriving an English Biomedical Silver Standard Corpus for CLEF-ER In P. Forner, R. Navigli, & D. Tufis (Eds.), CLEF 2013 Evaluation Labs and Workshop Online Working Notes 23 - 26 September, Valencia - Spain. s.n. http://www.clef-initiative.eu/documents/71612/7d846ed5-afef-429b-aa61-ed9ca1911e52
Clematide, S., Davtyan, M., Rinaldi, F., & Rebholz-Schuhmann, D. (2013). Exploiting BabelNet for multilingual biomedical synonym expansion 156–156. http://www.clef-initiative.eu/documents/71612/8554bae8-6cc3-4850-a677-b6f31af4850b
Clematide, S., & Klenner, M. (2013). A Pilot Study on the Semantic Classification of Two German Prepositions: Combining Monolingual and Multilingual Evidence (G. Angelova, K. Bontcheva, & R. Mitkov, Eds.; pp. 148–155). Bulgaria. http://aclweb.org/anthology/R/R13/
Schneider, G., Clematide, S., Ellendorff, T., Tuggener, D., Rinaldi, F., & Grigonyte, G. (2013). UZH in BioNLP 2013 116–120. http://www.aclweb.org/anthology/W13-2016
Clematide, S. (2013). A case study in tagging case in german: an assessment of statistical approaches In C. Mahlow & M. Piotrowski (Eds.), Systems and Frameworks for Computational Morphology (pp. 22–34). Springer. https://doi.org/10.1007/978-3-642-40486-3_2
Rinaldi, F., Clematide, S., Hafner, S., Schneider, G., Grigonyte, G., Romacker, M., & Vachon, T. (2013). Using the OntoGene pipeline for the triage task of BioCreative 2012 Database, 2013, bas053. https://doi.org/10.1093/database/bas053
Rebholz-Schuhmann, D., Clematide, S., Rinaldi, F., Kafkas, S., van Mulligen, E. M., Bui, C., Hellrich, J., Lewin, I., Milward, D., Poprat, M., Jimeno-Yepes, A., Hahn, U., & Kors, J. (2013). Entity recognition in parallel multi-lingual biomedical corpora: The CLEF-ER laboratory overview In P. Forner, H. Mueller, P. Rosso, & R. Paredes (Eds.), Information Access Evaluation. Multilinguality, Multimodality, and Visualization (pp. 353–367). Springer. https://doi.org/10.1007/978-3-642-40802-1_32
Grigonyte, G., Clematide, S., & Rinaldi, F. (2013). How preferred are preferred terms? In I. Kosem, J. Kallas, P. Gantar, S. Krek, M. Langemets, & M. Tuulik (Eds.), Electronic lexicography in the 21st century: thinking outside the paper. Proceedings of the eLex 2013 conference, 17-19 October 2013, Tallinn, Estonia (pp. 452–459). eLex. http://eki.ee/elex2013/proceedings/eLex2013_31_Grigonyte+Clematide+Rinaldi.pdf
Schneider, G., Rinaldi, F., & Clematide, S. (2012, May 25). Dependency parsing for interaction detection in pharmacogenomics Proceedings of LREC 2012: The Eighth International Conference on Language Resources and Evaluation. LREC 2012: The eighth international conference on Language Resources and Evaluation, Istanbul.
Rinaldi, F., Clematide, S., & Hafner, S. (2012, April 5). Ranking of CTD articles and interactions using the OntoGene pipeline Proceedings of the 2012 BioCreative Workshopp. 2012 {BioCreative} workshop, Washington D.C.
Klenner, M., Clematide, S., Petrakis, S., & Luder, M. (2012). Compositional syntax-based phrase-level polarity annotation for German (7/15). online. http://elanguage.net/journals/lilt/article/view/2697
Rinaldi, F., Clematide, S., Garten, Y., Whirl-Carrillo, M., Gong, L., Hebert, J. M., Sangkuhl, K., Thorn, C. F., Klein, T. E., & Altman, R. B. (2012). Using ODIN for a PharmGKB re-validation experiment Database, 2012, bas021. https://doi.org/10.1093/database/bas021
Rinaldi, F., Schneider, G., & Clematide, S. (2012). Relation Mining Experiments in the Pharmacogenomics Domain Journal of Biomedical Informatics, 45, 851–861. https://doi.org/10.1016/j.jbi.2012.04.014
Clematide, S., & Rinaldi, F. (2011). Ranking Interactions for a Curation Task Machine Learning And Applications, Fourth International Conference On, 2, 100–105. https://doi.org/10.1109/ICMLA.2011.119
Schneider, G., Clematide, S., & Rinaldi, F. (2011). Detection of interaction articles and experimental methods in biomedical literature BMC Bioinformatics, 12, S13. https://doi.org/10.1186/1471-2105-12-S8-S13
Clematide, S., & Roth, L. (2011). Generating inflection variants of multi-word terms for French and German In H. Hedeland, T. Schmidt, & K. Wörner (Eds.), Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011 (No. 96; pp. 33–37). Universität Hamburg. http://www.corpora.uni-hamburg.de/gscl2011/downloads/AZM96.pdf
Klenner, M., Clematide, S., & Amsler, M. (2011, August 26). Semi-automatic test generation for tandem learning Speech and Language Technology in Education, Venice. http://project.cgm.unive.it/events/SLaTE2011/papers/Klenner--slate.pdf
Tuggener, D., Klenner, M., Schneider, G., Clematide, S., & Rinaldi, F. (2011). An incremental model for the coreference resolution task of BioNLP 2011 151–152. http://aclweb.org/anthology-new/W/W11/W11-1823.pdf
Clematide, S., Rinaldi, F., & Schneider, G. (2011). OntoGene at CALBC II and Some Thoughts on the Need of Document-Wide Harmonization (D. Rebholz-Schuhmann & S. Kafkas, Eds.; pp. 48–51). http://www.ebi.ac.uk/Rebholz-srv/CALBC/CALBC_WorkshopIIProceedings.pdf
Rinaldi, F., Schneider, G., & Clematide, S. (2011, January 7). Mining complex Drug/Gene/Disease relations Pacific Symposium on Biocomputing Workshop “Mining the Pharmacogenomics Literature,” Hawaii. http://psb.stanford.edu/psb11/conference-materials/wkshp-pharma/rinaldi.pdf
Wüest, B., Clematide, S., Bünzli, A., Laupper, D., & Frey, T. (2011). Electoral Campaigns and Relation Mining: Extracting Semantic Network Data from Newspaper Articles Journal of Information Technology & Politics, 8, 444–463. https://doi.org/10.1080/19331681.2011.567387
Arighi, C. N., Roberts, P. M., Agarwal, S., Bhattacharya, S., Cesareni, G., Chatr-aryamontri, A., Clematide, S., Gaudet, P., Giglio, M. G., Harrow, I., Huala, E., Krallinger, M., Leser, U., Li, D., Liu, F., Lu, Z., Maltais, L. J., Okazaki, N., Perfetto, L., … Wu, C. H. (2011). BioCreative III interactive task: an overview BMC Bioinformatics, 12, S4. https://doi.org/10.1186/1471-2105-12-S8-S4
Rebholz-Schuhmann, D., Jimeno, A., Li, C., Kafkas, S., Lewin, I., Kang, N., Corbett, P., Milward, D., Buyko, E., Beisswanger, E., Hornbostel, K., Kouznetsov, A., Witte, R., Laurila, J., Baker, C., Kuo, C., Clematide, S., Rinaldi, F., Farkas, R., … Hahn, U. (2011). Assessment of NER solutions against the first and second CALBC Silver Standard Corpus Journal of Biomedical Semantics, 2, S11. https://doi.org/10.1186/2041-1480-2-S5-S11
Rinaldi, F., Clematide, S., Schneider, G., Romacker, M., & Vachon, T. (2010). ODIN: an advanced interface for the curation of biomedical literature Nature Precedings, online. https://doi.org/10.1038/npre.2010.5169.1
Rinaldi, F., Schneider, G., Clematide, S., Jegen, S., & et al. (2010, September 15). OntoGene (Team 65): preliminary analysis of participation in BioCreative III BioCreative III workshop, Bethesda. http://www.biocreative.org/events/biocreative-iii/
Clematide, S., & Klenner, M. (2010). Evaluation and extension of a polarity lexicon for German In A. Montoyo, P. Martínez-Barco, A. Balahur, & E. Boldrini (Eds.), Proceedings of the 1st Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA) (pp. 7–13). http://gplsi.dlsi.ua.es/congresos/wassa2010/fitxers/WASSA2010_Proceedings_.pdf
Rinaldi, F., Schneider, G., Kaljurand, K., Clematide, S., Vachon, T., & Romacker, M. (2010). OntoGene in BioCreative II.5 IEEE - ACM Transactions on Computational Biology and Bioinformatics, 7, 472–480. https://doi.org/10.1109/TCBB.2010.50
Clematide, S., Rinaldi, F., & Schneider, G. (2010). OntoGene in CALBC 30–31. http://workshop.calbc.eu/FirstProceedings.pdf
Luder, M., & Clematide, S. (2010). Constructing a Constructional MWE Lexicon for psycho-conceptual Annotation: Evaluation of CPA and DuELME for Lexicographic Description In A. Dykstra & T. Schoonheim (Eds.), Proceedings of the XIV Euralex International Congress (pp. 402–410). Fryske Akademy.
Rinaldi, F., Schneider, G., Kaljurand, K., & Clematide, S. (2009). Effective Mining of Protein Interactions 115–118. http://lbm2009.biopathway.org/
Clematide, S., Klenner, M., & Volk, M. (Eds.). (2009). Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Clematide, S. R. (2009). Koordination im Deutschen und ihre syntaktische Desambiguierung (Dissertation, University of Zurich) https://doi.org/10.5167/uzh-26847
Clematide, S. (2009). A morpho-syntactic generation service for German glossary entries In S. Clematide, M. Klenner, & M. Volk (Eds.), Searching Answers: Festschrift in Honour of Michael Hess on the Occasion of His 60th Birthday (pp. 33–43). Monsenstein und Vannerdat. http://www.mv-buchshop.de/catalog/product_info.php/cPath/36_51/products_id/1338
Kappeler, T., Clematide, S., Kaljurand, K., Schneider, G., & Rinaldi, F. (2008). Towards automatic detection of experimental methods from biomedical literature 61–68. http://mars.cs.utu.fi/smbm2008/?q=proceedings
Rinaldi, F., Kappeler, T., Kaljurand, K., Schneider, G., Klenner, M., Clematide, S., Hess, M., von Allmen, J. M., Parisot, P., Romacker, M., & Vachon, T. (2008). OntoGene in BioCreative II Genome Biology, 9, S13. https://doi.org/10.1186/gb-2008-9-s2-s13
Luder, M., Clematide, S., & Distl, B. (2008). Ein elektronisches Lexikon im OLIF-Format für die Erzählanalyse 729–735. http://www.iula.upf.edu/agenda/euralex_08/euralex0202uk.htm
Clematide, S. (2008). An OLIF-based open inflectional resource and yet another morphological system for German In A. Storrer, A. Geyken, A. Siebert, & K. M. Würzner (Eds.), Text Resources and Lexical Knowledge (No. 8; pp. 183–194). Mouton de Gruyter. https://doi.org/10.1515/9783110211818.3.183
Klenner, M., Clematide, S., & Peric, B. (2007). What (the Hell) is Wrong? An Approach to Semi-automatic Construction of Self Correction Tests 15–22.
Clematide, S., Amsler, M., Roth, S., Thöny, L., & Bünzli, A. (2007). CLab - eine web-basierte interaktive Lernplattform für Studierende der Computerlinguistik 301–302.
Schwitter, R., Rinaldi, F., & Clematide, S. (2004). The importance of how-questions in technical domains 451–460.
Clematide, S. (2004). GermaNet und UniNet LDV-Forum, 19, 137–142. http://www.ldv-forum.org/2004_Doppelheft/137-142_Clematide.pdf
Clematide, S. (2002). Selektive Evaluation von robusten Parsern 23–29. http://konvens2002.dfki.de/cd/pdf/38V-Clematide.pdf
Clematide, S., & Volk, M. (2001). Linguistische und semantische Annotation eines Zeitungskorpus 201–209.
Volk, M., & Clematide, S. (2001). Learn-filter-apply-forget. Mixed approaches to named entity recognition 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid.
Arnold, T., Clematide, S., Nespeca, R., Roth, J., & Volk, M. (2001). LUIS - Ein natürlichsprachliches, universitäres Informationssystem 115–126.

Curriculum Vitae

Curriculum Vitae, Simon Clematide (PDF, 62 KB)

Weiterführende Informationen

Test our finite-state morphology systems for Rumansh Grishun, Standard German and Swiss German.

Quicklinks

Hauptnavigation