NLP research group @IDSIA
@idsianlp.bsky.social
📤 5
📥 5
📝 7
Presenting at
#EMNLP2025
in a moment, session on "Multilinguality and Language Diversity 2" (A301). Our paper on Tokenization Fairness:
arxiv.org/abs/2509.20045
loading . . .
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
Dialectal data are characterized by linguistic variation that appears small to humans but has a significant impact on the performance of models. This dialect gap has been related to various factors (e...
https://arxiv.org/abs/2509.20045
7 months ago
1
1
0
This is the inaugural post for the idsianlp account on Bluesky.
over 1 year ago
1
0
0
you reached the end!!
feeds!
log in