Adi Simhi
@adisimhi.bsky.social
📤 21
📥 25
📝 9
NLProc, and machine learning. Ph.D. student Technion
Check out our new paper on evaluating LLM agents on their preference for achieving their goal and avoiding human harm, called ManagerBench👔
add a skeleton here at some point
about 1 month ago
0
2
0
🚨New arXiv preprint!🚨 LLMs can hallucinate - but did you know they can do so with high certainty even when they know the correct answer? 🤯 We find those hallucinations in our latest work with
@itay-itzhak.bsky.social
,
@fbarez.bsky.social
,
@gabistanovsky.bsky.social
and Yonatan Belinkov
9 months ago
3
21
12
you reached the end!!
feeds!
log in