THE EVAL HAS LANDED. 🦅
pydantic-evals has been a major piece of work from David Montague over the last month.
I'm pretty excited to see it move the dial on how easily Python developers can benchmark and improve AI code.
add a skeleton here at some point
6 months ago