One of the takeaways I prefer about this work is a “negative result” that bugged us for more than a month: out of the box LLMs (zero-shot) consistently beat their fine-tuned counterpart. While confused and discouraged by the results, closer inspection revealed a trivial truth…
add a skeleton here at some point
6 months ago