This lack of reliability is I believe the real reason most companies haven't seen massive productivity gains with agents.
Just because a system *can* get it right, doesn't mean it *will*. And in LLM agent loops, errors earn compound interest.
add a skeleton here at some point
2 months ago