Marcel Bollmann
Associate Professor
Computational linguist and researcher in natural language processing (NLP).
Natural language processing and machine learning
I am associate professor at the division Artificial Intelligence and Integrated Computer Systems (AIICS) at the Department of Computer and Information Science (IDA).
I am a computational linguist and researcher in natural language processing (NLP). My research interests revolve around NLP and machine learning in challenging scenarios, such as under-resourced languages, multilinguality, or historical documents. I am particularly interested in linguistically-informed approaches to NLP, as well as bringing improvements in NLP technology to a wider range of languages and text genres.
CV in brief
PhD
- I have obtained my PhD degree (Dr. phil.) in Computational Linguistics at Ruhr-Universität Bochum, Germany, in 2018.
- My research was funded by various projects related to historical language corpora of German, and my dissertation is on the normalization of historical language data.
Postdoc
- Between 2018 and 2021, I was a postdoc in the CoAStaL NLP group at University of Copenhagen, Denmark.
- During that time, I received funding from the EU in form of a Marie Skłodowska-Curie Fellowship (MSCA) for a project on “morphologically-informed representations for NLP.”
- Our paper on “Error Analysis and the Role of Morphology” won a “Best Long Paper” award at the EACL 2021 conference.
Assistant professor
- Between 2021 and 2023, I was assistant professor at the Jönköping AI Lab (JAIL) at Jönköping University, Sweden.
- I contributed to the advanced-level study program in “AI Engineering”, creating new courses on “Data Science Programming” and “Natural Language Processing.”
Volunteer
- As a volunteer, I also contribute to the development of the ACL Anthology as Site Development Lead.
Publications
2024
CreoleVal: Multilingual Multitask Benchmarks for Creoles
Transactions of the Association for Computational Linguistics, Vol. 12, p. 950-978
(Article in journal)
https://dx.doi.org/10.1162/tacl_a_00682
2023
Two Decades of the ACL Anthology: Development, Impact, and Open Challenges
Proceedings of the 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023), p. 83-94
(Conference paper)
https://dx.doi.org/10.18653/v1/2023.nlposs-1.10
2021
Moses and the Character-Based Random Babbling Baseline: CoAStaL at AmericasNLP 2021 Shared Task
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, p. 248-254
(Conference paper)
https://dx.doi.org/10.18653/v1/2021.americasnlp-1.28
Error Analysis and the Role of Morphology
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, p. 1887-1900
(Conference paper)
https://dx.doi.org/10.18653/v1/2021.eacl-main.162
2020
On Forgetting to Cite Older Papers: An Analysis of the ACL Anthology
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, p. 7819-7827
(Conference paper)
https://dx.doi.org/10.18653/v1/2020.acl-main.699