3 months ago
NEW PAPER ALERT: Recent studies have shown that LLMs often lack robustness to distribution shifts in their reasoning. Our paper proposes a new method, AbstRaL, to augment LLMsโ reasoning robustness, by promoting their abstract thinking with granular reinforcement learning.