Xinpeng Wang
@xinpeng.bsky.social
๐ค 61
๐ฅ 97
๐ 1
PhD student @LMU. Eval & LLM Alignment.
https://xinpeng-wang.github.io/
reposted by
Xinpeng Wang
MaiNLP lab, LMU Munich
7 months ago
Reunion in Singapore!๐ธ๐ฌ
@barbaraplank.bsky.social
,
@xinpeng.bsky.social
, who's currently on a research stay at NYU, and Chengzhi are presenting their work at
@iclr-conf.bsky.social
2
19
2
reposted by
Xinpeng Wang
Barbara Plank
7 months ago
Upcoming ICLR 2025 paper: โ๏ธ Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation We propose a surgical & flexible approach to mitigate false refusal in LLMs with minimal effect on performance and inference cost led by
@xinpeng.bsky.social
(1/2)
1
10
4
reposted by
Xinpeng Wang
MaiNLP lab, LMU Munich
8 months ago
๐MaiNLP is turning 3 today!๐๐ฅณ Weโve grown a lot since
@barbaraplank.bsky.social
started this group with nothing but three aspiring researches and a hand-drawn sign on the door. Huge thanks to all the amazing people who have joined or visited us since. Hereโs to many more years of exciting research!๐
1
19
11
Iโm thrilled to share that our paper on mitigating false refusal in language models has been accepted to ICLR 2025
@iclr-conf.bsky.social
!
arxiv.org/abs/2410.03415
Joint work with chengzhi,
@paul-rottger.bsky.social
,
@barbaraplank.bsky.social
.
10 months ago
0
8
2
you reached the end!!
feeds!
log in