Ilia Breitburg (@breitburg.com)

Introducing the "Code Comments Slop" bench, that measures the rate at which LLMs put sloppy sections in code comments like these: # ============================================ # CONFIG # ============================================ evals.breitburg.com/code-comment...

loading . . .

Code Comments Slop — Ilia Breitburg's Evals Handcrafted evals for AI models. https://evals.breitburg.com/code-comments-slop/

4 months ago