
Hi there! 👋
Originally from the Finnish Arctic, I’m currently a final-year PhD student in the Michtom School of Computer Science at Brandeis University, where I’m advised by Constantine Lignos.
My research centers on multilingual/low-resource NLP and is motivated by the fact that only about 100 out of 7,000 existing languages are supported by current AI and language technology.
In addition to model building and resource creation, I develop methods to better understand what types of tasks/languages models perform well at, what they struggle with and what features explain performance disparities. I am also interested in how factors such as tokenization impact model model performance, especially in low-resource settings.
My research has been published at leading NLP conferences (e.g. NAACL, EMNLP, *ACL, LREC) and has been recognized with several awards:
- A recent LLM evaluation paper won the best paper award at AACL 2025 in Mumbai, India, representing the top 0.25% of all submissions at the conference.
- An earlier tokenization paper was awarded the best paper award at the Insights from Negative Results Workshop held in conjuction with EACL 2023.
- I was also part of the team behind the winning submission to the Multi-Label Similar Language Identification Shared Task at the VarDial 2024 held at NAACL 2024.
I am currently looking for AI/ML engineer and industry research positions starting in 2026. Feel free to reach out if you have an opening that you think would be a good fit! Extra points if the position involves multilinguality or AI model evaluation.