Maik Fröbe
@maik-froebe.bsky.social
📤 502
📥 374
📝 25
PhD-Student in the webis.de group. Interested in IR and NLP.
reposted by
Maik Fröbe
UofGNews
about 2 months ago
PyTerrier, a software platform developed at
@uofgcompsci.bsky.social
which helps facilitate the development of AI-powered search engines, has won a national award from
@wearebcs.bsky.social
! Read more here:
www.gla.ac.uk/news/headlin..
.
0
7
4
reposted by
Maik Fröbe
Webis Group
3 months ago
We just released "German Commons", the largest openly-licensed German text dataset for LLM training: 154B tokens with clear usage rights for research and commercial use.
huggingface.co/datasets/coral-nlp/german-commons
loading . . .
coral-nlp/german-commons · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/coral-nlp/german-commons
1
21
9
reposted by
Maik Fröbe
Djoerd Hiemstra 🍉
3 months ago
Dutch-Belgian Information Retrieval workshop
#dir3025
today in Nijmegen! Opening by the great Harrie Oosterhuis
https://informagus.nl/dir2025/schedule
0
6
1
reposted by
Maik Fröbe
Svitlana Vakulenko
4 months ago
Check out the slides from our SCAI'2025
#convsearch
workshop collocated with
@ijcai.org
#IJCAI2025
on LLMs, retrieval & QA, recommendations, negotiations, evaluation and transparency
scai.info/scai-2025
@patuchen.bsky.social
@maik-froebe.bsky.social
@tuetschek.bsky.social
@mila-quebec.bsky.social
loading . . .
SCAI 2025
Online Event on Search-Oriented Conversational AI.
https://scai.info/scai-2025/
0
9
3
reposted by
Maik Fröbe
Johanne Trippas
5 months ago
🌟Really excited to share the fourth Strategic Workshop on Information Retrieval (SWIRL) report published in SIGIR Forum! Paper 👉🏻
www.johannetrippas.com/papers/tripp...
More info 👉🏻
sites.google.com/view/swirl20...
#SWIRL2025
#SIGIR2026
#IR
#GenAI
#Research
#CHIIR2026
0
13
10
reposted by
Maik Fröbe
Bhaskar Mitra | ভাস্কর মিত্র
5 months ago
Some exciting news! 🤗 After 3 amazing years at TREC, the Tip-of-the-Tongue (ToT) shared task will be a core task at NTCIR-19 in 2026. The new track will focus on tip-of-the-tongue information needs in English and East Asian languages. More details coming soon. See you all in Tokyo next year!
0
5
3
reposted by
Maik Fröbe
Bhaskar Mitra | ভাস্কর মিত্র
5 months ago
Hello TREC-ToTers! 👋🏽 📆 Good news! We are extending the run submission deadline by 2 weeks. Please submit your runs by **September 10 (Wednesday)** and spread the word. More info:
trec-tot.github.io/guidelines
#TREC2025
#TRECToT
#TREC2025ToT
add a skeleton here at some point
1
1
2
reposted by
Maik Fröbe
Bhaskar Mitra | ভাস্কর মিত্র
5 months ago
Gentle reminder 📢 All run submissions for the Tip-of-the-Tongue (ToT) Track are due next week Wednesday (Aug 27). More info:
trec-tot.github.io/guidelines
#TREC2025
#TRECToT
#TREC2025ToT
add a skeleton here at some point
0
2
3
Here are some impressions from our ReNeuIR workshop on "Reaching Efficiency in Neural IR" that we had yesterday at
#SIGIR2025
.
6 months ago
1
8
1
reposted by
Maik Fröbe
Webis Group
6 months ago
Happy to share that our paper "The Viability of Crowdsourcing for RAG Evaluation" received the Best Paper Honourable Mention at
#SIGIR2025
! Very grateful to the community for recognizing our work on improving RAG evaluation. 📄
webis.de/publications...
2
27
11
Now
@fschlatt.bsky.social
presents "TITE: Token-Independent Text Encoder for Information Retrieval" at
#SIGIR2025
Paper:
webis.de/publications...
6 months ago
0
8
3
reposted by
Maik Fröbe
Ferdinand Schlatt
6 months ago
Want to know how to make bi-encoders more than 3x faster with a new backbone encoder model? Check out our talk on the Token-Independent Text Encoder (TITE)
#SIGIR2025
in the efficiency track. It pools vectors within the model to improve efficiency
dl.acm.org/doi/10.1145/...
0
10
5
To Eun Kim just presented the work on "Tip of the Tongue Query Elicitation for Simulated Evaluation" at
#SIGIR2025
. The approach will be used in the
#TREC2025
Tip-of-the-Tongue track, and we had some sweets at the poster :) The paper is available online:
dl.acm.org/doi/10.1145/...
6 months ago
0
12
3
Lukas Gienapp presents "The Viability of Crowdsourcing for RAG Evaluation" at
#SIGIR2025
The paper is available at:
webis.de/publications...
6 months ago
0
10
6
reposted by
Maik Fröbe
Ferdinand Schlatt
6 months ago
@mrparryparry.bsky.social
presenting our work on reproducing TREC DL 2019 judgements and the implications for evaluating modern ranking models on modern collections. Paper:
arxiv.org/abs/2502.20937
loading . . .
Variations in Relevance Judgments and the Shelf Life of Test Collections
The fundamental property of Cranfield-style evaluations, that system rankings are stable even when assessors disagree on individual relevance decisions, was validated on traditional test collections. ...
https://arxiv.org/abs/2502.20937
1
4
3
Here are some of the statistics that I found very interesting from the
#SIGIR2025
opening session. (Over 1000 attendees!)
6 months ago
1
5
1
reposted by
Maik Fröbe
Bhaskar Mitra | ভাস্কর মিত্র
6 months ago
Hello TREC-ToTers! We have released the test queries for the TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track. Please see the guidelines for more information:
trec-tot.github.io/guidelines
. Run submission deadline will tentatively be in August.
#TREC2025
#TRECToT
#TREC2025ToT
Please spread the word!
add a skeleton here at some point
0
3
4
reposted by
Maik Fröbe
Ferdinand Schlatt
6 months ago
Thank you Carlos for the shout-out of Lightning IR in the LSR tutorial at
#SIGIR2025
If you want to fine your own LSR models, check out our framework at
github.com/webis-de/lig...
0
7
5
reposted by
Maik Fröbe
IRRJ
7 months ago
Never seen our editor in chief, Djoerd Hiemstra, more happy than today, holding a copy of the first issue of
#irrj
0
14
6
Do not forget to participate in the
#TREC2025
Tip-of-the-Tongue (ToT) Track :) The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API. More details are available at:
trec-tot.github.io/guidelines
7 months ago
0
11
7
The deadline for submissions to the ReNeuIR workshop at
#SIGIR2025
is extended to June 10 😸 Details:
reneuir.org
#ReNeuIr2025
#SIGIR25
loading . . .
ReNeuIR’25
Workshop on Reaching Efficiency in Neural Information Retrieval
https://reneuir.org
8 months ago
0
4
3
reposted by
Maik Fröbe
Bhaskar Mitra | ভাস্কর মিত্র
9 months ago
Hello TREC-ToTers! 👋🏽 Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines:
trec-tot.github.io/guidelines
. We will release test queries in July and run submission deadline will be in August.
#TREC2025
#TRECToT
#TREC2025ToT
Please register to participate:
loading . . .
TREC 2025 Tip-of-the-Tongue (ToT) Track
Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.
https://trec-tot.github.io/guidelines
0
4
3
Today I had the pleasure to talk about child-safe search at
#ECIR2025
. We created an cranfield-style evaluation dataset to contrast relevance with harm in web search scenarios. Details:
webis.de/publications...
10 months ago
0
6
2
The Workshop on Open Web Search just finished
#WOWS2025
#ECIR2025
. It was a very cool experience with many interesting talks. Lets hope we can do it again next year at
#ECIR2026
in Delft :)
10 months ago
0
8
5
The Workshop on Open Web Search at
#ECIR2025
just starts with a keynote by
@claclarke.bsky.social
on Annotative Indexing.
#WOWS25
#WOWS2025
#ECIR25
10 months ago
0
10
5
reposted by
Maik Fröbe
Ferdinand Schlatt
10 months ago
Honored to receive the best short paper award and best paper honourable mention award at
#ECIR2025
. Thank you to all co-authors
@maik-froebe.bsky.social
,
@hscells.bsky.social
, Shengyao Zhuang,
@bevankoopman.bsky.social
, Guido Zuccon, Benno Stein,
@martin-potthast.com
,
@matthias-hagen.bsky.social
🥳
1
17
4
Now we have
@fschlatt.bsky.social
on the
#ECIR2025
stage predenting the research on the Set-Encoder. The paper is online at:
webis.de/publications...
10 months ago
0
9
4
An cool example of the Circle of Research at
#ECIR2025
(or was it the Circle of Life?): Yesterday, I saw Ronak presenting research to Ferdi, today it was the other way around :)
10 months ago
0
4
0
I was very happy to talk about corpus subsampling at
#ECIR2025
today. Please find the paper at
webis.de/publications...
And lat bur not least, here are some of my favorite impressions of the first day of ECIR :)
10 months ago
0
8
2
reposted by
Maik Fröbe
Webis Group
10 months ago
📢 Our paper "The Viability of Crowdsourcing for RAG Evaluation" has been accepted to
#SIGIR2025
! We compared how good humans and LLMs are at writing and judging RAG responses, assembling 1800+ responses across 3 styles, and 47K+ pairwise judgments in 7 quality dimensions. 🧵➡️
1
12
7
reposted by
Maik Fröbe
IRRJ
10 months ago
Paper accepted at
#sigir2025
, but you had a hard time keeping the page limit? Congratulations, you can now submit an extended version with 50% "new" material to
#irrj
!
https://irrj.org/second-call-for-papers
loading . . .
Call for Papers for Issue 2 | Information Retrieval Research
https://irrj.org/second-call-for-papers
0
4
4
Some impressions from
#WSDM2025
from two weeks ago :)
10 months ago
1
4
0
The panel at the
#LLM4Eval2025
workshop just start
#WSDM2025
10 months ago
0
6
0
reposted by
Maik Fröbe
Danny To Eun Kim
11 months ago
🚨New Breakthrough in Tip-of-the-Tongue (TOT) Retrieval Research! We address data limitations and offer a fresh evaluation method for these complex queries. Curious how TREC TOT track test queries are created? Check out this thread 🧵 and our paper 📄:
arxiv.org/abs/2502.17776
loading . . .
Tip of the Tongue Query Elicitation for Simulated Evaluation
Tip-of-the-tongue (TOT) search occurs when a user struggles to recall a specific identifier, such as a document title. While common, existing search systems often fail to effectively support TOT scena...
https://arxiv.org/abs/2502.17776
2
17
8
reposted by
Maik Fröbe
Webis Group
11 months ago
PAN 2025 Call for Participation: Shared Tasks on Authorship Analysis, Computational Ethics, and Originality We'd like to invite you to participate in the following shared tasks at PAN 2025 held in conjunction with the CLEF conference in Madrid, Spain. Find out more at
pan.webis.de/clef25/pan25...
loading . . .
https://pan.webis.de/clef25/pan25-web
1
9
7
reposted by
Maik Fröbe
IRRJ
11 months ago
At
#irrj
, everyone gets a chance to publish
#openaccess
. Our article processing charges are: zero dollar, zero euro, zero yuan, zero real, zero ruble, zero rand, zero yen, zero rupees, zero pound, zero rial, zero peso, ...
0
6
2
reposted by
Maik Fröbe
Arjen P. de Vries Timmers 🕊️
11 months ago
Deadline for
#ossym25
was extended to March 17h! the #ossym25 Open Search Symposium brings together the European and international Open Search community for the seventh time. 8-10 October 2025 Helsinki, Finland
https://indico.cern.ch/event/1477549/
loading . . .
OSSYM 2025 - 7th International Open Search Symposium
8 - 10 October 2025 Continuing the successful conference series from the past 6 years, the #ossym25 Open Search Symposium brings together the European and international Open Search community for the seventh time. The hybrid conference provides a forum to discuss and further develop the ideas and concepts of open web search and related topics in various formats including scientific talks, panels, workshops, demonstrations, student challenges and informal discussion spaces. Participants...
https://indico.cern.ch/event/1477549/
0
3
3
reposted by
Maik Fröbe
11 months ago
🚨 New Pre-Print! 🚨 Reviewer 2 has once again asked for DL’19, what can you say in rebuttal? To help, we have re-annotated DL’19. Work done with @maik_froebe.bsky.social,
@hscells.bsky.social
, @fschlatt1.bsky.social, Guglielmo Faggioli, Saber Zerhoudi,
@macavaney.bsky.social
, Eugene Yang 🧵
1
6
3
reposted by
Maik Fröbe
Webis Group
about 1 year ago
2nd International Workshop on Open Web Search: CfP We invite you to the
#ECIR2025
Workshop on Open Web Search
#wows2025
. Please consider to submit to the scientific track or the WOWS-Eval shared task to enrich the Open Web Index with relevance judgments. Details:
opensearchfoundation.org/wows2025
loading . . .
1st International Workshop on Open Web Search #wows2024 - 28 March 2024
Discuss ideas and approaches to open up the web search ecosystem!
https://opensearchfoundation.org/wows2025
0
13
4
reposted by
Maik Fröbe
arxiv cs.IR
about 1 year ago
Charles L. A. Clarke, Laura Dietz LLM-based relevance assessment still can't replace human relevance assessment
https://arxiv.org/abs/2412.17156
0
6
1
reposted by
Maik Fröbe
about 1 year ago
What a team of keynote speakers. I must confess seeing that Steve Robertson will be there is a thrill. One of the legends of information retrieval reflecting on the field.
#sigir2025
sigir2025.dei.unipd.it/keynote-spea...
loading . . .
SIGIR 2025, Padua, 13-18 July | Keynotes
The SIGIR 2025 keynotes are held by esteemed speakers: Robertson S., Gurevych I. and Frieder O., who will cover topics that range from AI in medical search and ecommendation to BM25 and probabilistic ...
https://sigir2025.dei.unipd.it/keynote-speakers.html
0
10
5
reposted by
Maik Fröbe
Arjen P. de Vries Timmers 🕊️
about 1 year ago
New preprint of WSDM demo by @maik_froebe @matthias and Ferdinand Schlatt Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information Retrieval
https://arxiv.org/abs/2411.04677
https://webis.de/lightning-ir/
loading . . .
Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information Retrieval
A wide range of transformer-based language models have been proposed for information retrieval tasks. However, including transformer-based models in retrieval pipelines is often complex and requires substantial engineering effort. In this paper, we introduce Lightning IR, an easy-to-use PyTorch Lightning-based framework for applying transformer-based language models in retrieval scenarios. Lightning IR provides a modular and extensible architecture that supports all stages of a retrieval pipeline: from fine-tuning and indexing to searching and re-ranking. Designed to be scalable and reproducible, Lightning IR is available as open-source: https://github.com/webis-de/lightning-ir.
https://arxiv.org/abs/2411.04677
0
9
5
reposted by
Maik Fröbe
Christopher Schröder
about 1 year ago
🐣 New release: small-text v2.0.0.dev1 With Small Language Models on the rise, the new version of small-text has been long overdue! Despite the generative AI hype, many real-world tasks still rely on supervised learning—which is reliant on labeled data.
#activelearning
#nlproc
#nlp
#llms
3
41
8
I can relate to that 😺😸
add a skeleton here at some point
about 1 year ago
0
1
0
The
#TREC2024
conference just started. Turns out that BM25 is turning 30 🥳
#TREC
#TREC24
about 1 year ago
1
28
9
reposted by
Maik Fröbe
Webis Group
over 1 year ago
Goodbye Washington! We had a fantastic week with interesting talks, discussions, and new ideas at
#SIGIR24
#SIGIR2024
. We hope to see you all again next year in Italy :)
https://x.com/webis_de/status/1815115279510208625/photo/1
0
3
1
reposted by
Maik Fröbe
Martin Potthast
about 1 year ago
Time for a starter pack on information retrieval:
go.bsky.app/MXPJoTn
add a skeleton here at some point
17
43
21
you reached the end!!
feeds!
log in