Even with identical sociodemographic info, how LLMs are given it changes downstream bias results. Our new preprint (w/
@veraneplenbroek.bsky.social, Jan Batzner & Sebastian Padó) tests cues with varying external validity across 10 personas, 4 tasks & 7 LLMs:
arxiv.org/abs/2601.18572