Adi Simhi
@adisimhi.bsky.social
📤 22
📥 26
📝 11
NLProc, and machine learning. Ph.D. student Technion
Check out our new paper, investigating phenomena (hallucination, refusal, and sycophancy) both externally and internally! Showing a high correlation between the two!
add a skeleton here at some point
14 days ago
0
4
0
ManagerBench was accepted to
#ICLR2026🎉
Check it out⬇️
add a skeleton here at some point
about 2 months ago
0
1
0
Check out our new paper on evaluating LLM agents on their preference for achieving their goal and avoiding human harm, called ManagerBench👔
add a skeleton here at some point
6 months ago
0
2
0
🚨New arXiv preprint!🚨 LLMs can hallucinate - but did you know they can do so with high certainty even when they know the correct answer? 🤯 We find those hallucinations in our latest work with
@itay-itzhak.bsky.social
,
@fbarez.bsky.social
,
@gabistanovsky.bsky.social
and Yonatan Belinkov
about 1 year ago
3
21
12
you reached the end!!
feeds!
log in