Martin Tutek 4 months ago
ManagerBench was accepted to ICLR!
@iclr-conf.bsky.social #ICLR2026
LLMs are still either unsafe, or completely harm avoidant - even when the harm affects furniture ๐๏ธ
Check out our benchmark, online or in Rio ๐ง๐ท
add a skeleton here at some point