Lexin Zhou 7 months ago
Thrilled to unlock AI Evaluation with explanatory and predictive power through general ability scales!
With a new methodology to
-Explain what common benchmarks really measure
-Extract explainable ability profiles of AI systems
-Predict performance for new task instances, in & out-of-distribution
π§΅