@asaf-yehudai.bsky.social
π€ 83
π₯ 45
π 12
pinned post!
New preprint! β¨ Interested in LLM-as-a-Judge? Want to get the best judge for ranking your system? our new work is just for you: "JuStRank: Benchmarking LLM Judges for System Ranking" πΊπ
arxiv.org/abs/2412.09569
loading . . .
JuStRank: Benchmarking LLM Judges for System Ranking
Given the rapid progress of generative AI, there is a pressing need to systematically compare and choose between the numerous models and configurations available. The scale and versatility of such eva...
https://arxiv.org/abs/2412.09569
10 months ago
1
9
6
New preprint! β¨ Interested in LLM-as-a-Judge? Want to get the best judge for ranking your system? our new work is just for you: "JuStRank: Benchmarking LLM Judges for System Ranking" πΊπ
arxiv.org/abs/2412.09569
loading . . .
JuStRank: Benchmarking LLM Judges for System Ranking
Given the rapid progress of generative AI, there is a pressing need to systematically compare and choose between the numerous models and configurations available. The scale and versatility of such eva...
https://arxiv.org/abs/2412.09569
10 months ago
1
9
6
you reached the end!!
feeds!
log in