@navitas.bsky.social
๐ค 19
๐ฅ 189
๐ 0
reposted by
Institute of Formal and Applied Linguistics
9 months ago
Today,
@tuetschek.bsky.social
shared the work of his team on evaluating LLM text generation with both human annotation frameworks and LLM-based metrics. Their approach tackles the benchmark data leakage problem and how to get unseen data for unbiased LLM testing.
1
8
3
reposted by
Institute of Formal and Applied Linguistics
9 months ago
The ๐Machine Learning Prague 2025๐ is happening right now! Today,
@patuchen.bsky.social
and
@navitas.bsky.social
presented their posters on text generation with LLMs. Also, don't miss
@tuetschek.bsky.social
's invited talk tomorrow at 11 a.m.
1
11
5
reposted by
Zdenฤk Kasner
10 months ago
How do LLMs compare to human crowdworkers in annotating text spans? ๐ง๐ค And how can span annotation help us with evaluating texts? Find out in our new paper:
llm-span-annotators.github.io
Arxiv:
arxiv.org/abs/2504.08697
loading . . .
Large Language Models as Span Annotators
Website for the paper Large Language Models as Span Annotators
https://llm-span-annotators.github.io
1
20
9
you reached the end!!
feeds!
log in