hey, remember all the people who cited METR as a sign that AI was unreliable and offered no time savings for the median dev a year ago?
well, guess what METR is saying now?
now is a great opportunity to revise priors rather than cherry-picking science. either METR is reliable or it isn't. Pick.
add a skeleton here at some point
2 days ago