Curriculum Vitae
Education and work
since 2022 | Data Scientist at the Federal Office for Migration and Refugees |
2009–2022 | Researcher at the Chair for Corpus and Computational Linguistics, FAU Erlangen-Nürnberg |
2021 | Visiting Lecturer at the department for German Linguistics, University of Göttingen |
2019 | Interim Professor for Computational Linguistics and Cognitive Science at the Institute of Cognitive Science, University of Osnabrück |
2018 | Dr. phil. in Computational Linguistics, FAU Erlangen-Nürnberg. Thesis: The Cooccurrence of Linguistic Structures, awarded the STAEDTLER Promotionspreis and the GSCL-Promotionspreis zum Gedenken an Wolfgang Hoeppner |
2014 | Research Assistant at Symanto Research GmbH & Co. KG |
2009–2014 | Various contracts as Researcher and Lecturer at the Chair for English Linguistics, FAU Erlangen-Nürnberg |
2009 | Certificate: European Masters of Language and Speech |
2005–2009 | M.A. in Computational Linguistics and English Linguistics, FAU Erlangen-Nürnberg. Thesis: Integration von Valenzdaten in die grammatische Analyse unter Verwendung des Valency Dictionary of English |
Events
- Co-organization of the 15th Conference on Natural Language Processing (KONVENS 2019) in Erlangen, 8–11 October 2019 (with Stefan Evert, Andreas Blombach, Natalie Dykes, Paul Greiner, Tim Griebel, Philipp Heinrich, Besim Kabashi and Tanja Schorr)
- Co-organization of the GermEval 2019 shared task on the lemmatization of German web and social media data (EmpiriST-lemmatization 2019) in Erlangen, 8 October 2019 (with Natalie Dykes, Stefan Evert, Philipp Heinrich, Besim Kabashi)
Reviewing activity
Journals
- Digital Scholarship in the Humanities (2019)
- Künstliche Intelligenz (2017)
- Nature Machine Intelligence (2019)
- Language Resources and Evaluation (LREV) (2020, 2021)
- Transactions on Asian and Low-Resource Language Information Processing (TALLIP) (2020, 2021)
- Linguistics Vanguard (2020)
- Language Technology and Computational Linguistics (JLCL) (2021)
Publishers
- De Gruyter
Conferences
- Digital Humanities (DH)
- Digital Humanities im deutschsprachigen Raum (DHd)
- International Conference on Language Resources and Evaluation (LREC)
- KONVENS
Workshops and shared tasks
- EVALITA
- SemEval
- *SEM
- SIGTYP
- Systems and Frameworks for Computational Morphology (SFCM)
- Web as Corpus Workshop (WAC)
- Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA)
Invited talks
- Proisl, Thomas. 2022. “Ein semantischer Tagger für das Deutsche.” Presentation at Oberseminar Computerlinguistik. Erlangen. [bib]
- Proisl, Thomas. 2022. “The Statistical Analysis of Cooccurrences: From Collocations to Arbitrary Structures.” Presentation in the GSCL Research Talks series. https://gscl.org/en/events/talks/februar-2022-research-talk. [bib, pdf]
- Blombach, Andreas, and Thomas Proisl. 2020. “Unexpected Complexity and Romance in Disguise: The Case of Science Fiction Novels and Fanfiction.” Presentation at 9th Hildesheim-Göttingen-Workshop on DH and CL. Göttingen. [bib]
- Proisl, Thomas, and Philipp Heinrich. 2019. “NLP for German CMC Data.” Poster presentation at Amazon Research Days. Berlin. [bib, poster, pdf]
- Proisl, Thomas, Natalie Dykes, Besim Kabashi, Philipp Heinrich, and Andreas Blombach. 2019. “NLP for German CMC Texts: Tokenization, POS Tagging, and a New Gold Standard for Lemmatization.” Presentation at Annotation of Non-Standard Corpora. Bamberg. [bib]
- Büttner, Andreas, and Thomas Proisl. 2016. “Delta und Merkmalsselektion: Welche Wörter unterscheiden arabisch-lateinische Übersetzer?” Presentation at <philtag n="13"/>. Würzburg. http://kallimachos.de/kallimachos/images/kallimachos/f/f5/Abstract%C3%9Cbersetzer.pdf. [bib, pdf]
- Evert, Stefan, and Thomas Proisl. 2016. “Burrows’s Delta verstehen.” Presentation at <philtag n="13"/>. Würzburg. http://kallimachos.de/kallimachos/images/kallimachos/b/bf/AbstractFAU.pdf. [bib, pdf]
- Proisl, Thomas. 2015. “Maschinelles Lernen mit Python.” Presentation at DARIAH-Methodenworkshop Natural Language Processing für Literaturwissenschaftler. Würzburg. [bib]
- Uhrig, Peter, and Thomas Proisl. 2012. “A Fast and User-Friendly Interface for Large Treebanks.” Presentation at Otto-Friedrich-Universität Bamberg. [bib]
- Uhrig, Peter, and Thomas Proisl. 2012. “Sprachstrukturen effizient speichern, verarbeiten und abfragen.” Presentation at Vortragsreihe Digital Humanities. Erlangen. [bib]
- Uhrig, Peter, and Thomas Proisl. 2011. “The Erlangen Treebank.” Presentation at Vortragsreihe Approaches to Corpus Linguistics. Erlangen. [bib]