Alessandro Stolfo (@alestolfo.bsky.social)

Our paper "Improving Instruction-Following in Language Models through Activation Steering” has been accepted to #ICLR2025! We're also excited to share that our public GitHub repo is now live. Code: github.com/microsoft/ll... Camera-ready: arxiv.org/abs/2410.12877

about 1 year ago