@navitas.bsky.social
๐ค 18
๐ฅ 189
๐ 0
reposted by
Institute of Formal and Applied Linguistics
5 months ago
Today,
@tuetschek.bsky.social
shared the work of his team on evaluating LLM text generation with both human annotation frameworks and LLM-based metrics. Their approach tackles the benchmark data leakage problem and how to get unseen data for unbiased LLM testing.
1
8
3
reposted by
Institute of Formal and Applied Linguistics
5 months ago
The ๐Machine Learning Prague 2025๐ is happening right now! Today,
@patuchen.bsky.social
and
@navitas.bsky.social
presented their posters on text generation with LLMs. Also, don't miss
@tuetschek.bsky.social
's invited talk tomorrow at 11 a.m.
1
11
5
reposted by
Zdenฤk Kasner
6 months ago
How do LLMs compare to human crowdworkers in annotating text spans? ๐ง๐ค And how can span annotation help us with evaluating texts? Find out in our new paper:
llm-span-annotators.github.io
Arxiv:
arxiv.org/abs/2504.08697
loading . . .
Large Language Models as Span Annotators
Website for the paper Large Language Models as Span Annotators
https://llm-span-annotators.github.io
1
20
9
you reached the end!!
feeds!
log in