akbir khan (@akbir.bsky.social)

4. Factorio Learning Environment by Jack Hopkins, Märt Bakler , and @akbir.bsky.social This benchmark uses the factory-building game Factorio to test complex, long-term planning, with settings for lab-play (structured tasks) and open-play (unbounded growth). jackhopkins.github.io/factorio-lea...

loading . . .

Factorio Learning Environment Claude Sonnet 3.5 builds factories https://jackhopkins.github.io/factorio-learning-environment/