Children exhibit visual understanding from limited experience, orders of magnitude less than our best models.
We introduce the Zero-shot World Model (ZWM). Trained on a single child's visual experience, BabyZWM rapidly generates competence across diverse benchmarks with no task-specific training. đź§µ
2 days ago