Joe Stacey
@joestacey.bsky.social
π€ 2528
π₯ 2059
π 143
NLP PhD student at Imperial College London and Apple AI/ML Scholar.
pinned post!
We have a fun new
#NLProc
paper on arXiv about improving the robustness of fine-tuned NLI models! Have a look :)
arxiv.org/abs/2505.20209
6 months ago
1
6
0
reposted by
Joe Stacey
Lisa Alazraki
3 months ago
We have released
#AgentCoMa
, an agentic reasoning benchmark where each task requires a mix of commonsense and math to be solved π§ LLM agents performing real-world tasks should be able to combine these different types of reasoning, but are they fit for the job? π€ π§΅β¬οΈ
1
4
2
Hereβs my review of the US after a few days here. Did I miss anything? π€ The good: - Americans are the most charming, friendly and hospitable people - itβs super fun how the country is split into states that all have different laws and stuff, with different vibes state to state
4 months ago
1
1
0
Any chance Keir Starmer can reshuffle himself in as foreign secretary, and shuffle in another prime minister who actually has some vague idea about what they want to achieve? ππ€¦ββοΈ
4 months ago
0
0
0
Finally the heatwave has ended, and the UK is once again a bearable place to be ππ If you have any UK-based collaborations, their productivity is about to increase like 10 fold
4 months ago
0
2
0
We have a fun new
#NLProc
paper on arXiv about improving the robustness of fine-tuned NLI models! Have a look :)
arxiv.org/abs/2505.20209
6 months ago
1
6
0
Should I use an LLM to help refine my paper writing for the ARR deadline? π€π€ It will improve the paper for sure, but probably also making the tone a whole lot more annoying
6 months ago
1
0
0
reposted by
Joe Stacey
Juan Diego Rodriguez
7 months ago
If you're at
#NAACL2025
and want to hear about similarity effects for property inheritance in LMs, please stop by! I will be presenting this work on Wednesday at the 11-12:30 poster session on Interpretability & analysis for language models (Hall 3).
aclanthology.org/2025.naacl-l...
add a skeleton here at some point
0
12
4
reposted by
Joe Stacey
Imperial NLP
7 months ago
Excited to share our ICLR and NAACL papers! Please come and say hi, we're super friendly :)
0
14
5
Wow, the old ITV Agatha Christieβs Poirot is brilliant. Some tv for 1989β¦ Gonna go binge watch the 13 seasons now π
7 months ago
0
1
0
I feel like the length of the ARR author rebuttals keep growing every cycle Is this a good thing for authors or reviewers that the responses can be so long? I feel like itβs a bit sub-optimal for both at the moment
7 months ago
3
4
0
reposted by
Joe Stacey
Nishant Balepur
8 months ago
Had a great time presenting my research on building more helpful QA systems
@imperialcollegeldn.bsky.social
! Thank you
@joestacey.bsky.social
for letting me invite myself π«Ά And loved visiting London+Edinburgh this week, hope to be back soon! π
0
6
2
Was fantastic to have you here at Imperial! Thanks for your excellent talk, and looking forward to following what you do next π
add a skeleton here at some point
8 months ago
0
4
0
reposted by
Joe Stacey
Lisa Alazraki
9 months ago
Do LLMs need rationales for learning from mistakes? π€ When LLMs learn from previous incorrect answers, they typically observe corrective feedback in the form of rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance! π§΅
1
21
12
reposted by
Joe Stacey
Marek Rei
8 months ago
Today was the launch event of the
@genaihub.bsky.social
. We announced the development of Nightingale AI, a foundation world model for health. It was great to be on the panel for GenAI in Healthcare, among such amazing experts.
www.genai.ac.uk
1
5
3
Thanks so much to everyone who has helped make this switch to BlueSky work. Honestly, making this switch was a pretty massive achievement, so thanks everyone for contributing β€οΈβ€οΈ
9 months ago
2
12
0
This paper is really cool. They decompose NLI (and defeasible NLI) hypotheses into atoms, and then use these atoms to measure the logical consistency of LLMs. E.g. for an entailment NLI example, each hypothesis atom should also be entailed by the premise. Very nice idea ππ
9 months ago
2
15
3
Iβm a week into my trip from Cairo to Riyadh, and wow what a place Egypt is! Honestly its been one of the funnest places Iβve travelled, and for sure I need to come back again Crossed into Aqaba (Jordan) yesterday, so now onto Saudi π
9 months ago
0
4
0
Iβm going away to do a bit of travelling, going overland from Cairo to Riyadh π I love travelling in the Middle East so it should be interesting Iβve got that feeling of nervous excitement I always get before a trip π¬π
10 months ago
0
5
0
Insanely jealous to everyone who has papers at
#NAACL
in Albuquerque! Albuquerque just sounds so exotic, and is such a cool place for a conference. No offence to Vienna, but Albuquerque sounds way more fun π
10 months ago
1
0
0
Feeling gooooood after submitting my
#ARR
reviews early π Time to enjoy the weekend! πΊ
10 months ago
0
2
0
I was super excited to read the ModernBERT paper! Love this interest in creating a better encoder model. "ModernBERT-base is the first encoder to beat DeBERTaV3-base since its release in 2021" π€―-
arxiv.org/pdf/2412.13663
Pretty amazing how successful DeBERTa has been!
10 months ago
0
16
0
Excited to start my
#ARR
#NLP
reviews! I'll try my best and see if I can get 100% of my reviews to be 'great' this round. If you didn't see it already, ARR publishes how many of your reviews are considered to be 'great':
stats.aclrollingreview.org
Join me for the challenge :)
loading . . .
ARR Dashboard
https://stats.aclrollingreview.org
10 months ago
1
12
3
At some point in life I realised I actually really love travelling by train. Kind of a strange hobby, but wow it is fun π Here are my top ten train journeys so far.
10 months ago
4
35
2
Imperial are hiring computing lecturers (including for AI/ML/NLP)! Here's a little thread about why you should consider applying :)
add a skeleton here at some point
11 months ago
1
4
1
Made it to northern Sweden (Kiruna) by train from London. Freezing cold with northern lights π Just over a week ago and I was in the crazy Miami heat for
#EMNLP2024
12 months ago
1
24
0
Okay genius idea to improve quality of
#nlp
#arr
reviews. Literally give gold stars to the best reviewers, visible on open review next to your anonymously ID during review process. Hereβs why it would work, and why would you should RT this fab idea:
12 months ago
3
27
6
This papers' findings about testing LLMs on NLI aligns with many of personal thoughts: 1) NLI remains a difficult task for LLMs 2) Having more few-shot examples is helpful (in my view, helping LLMs better understand class boundaries) 3) Incorrect predictions are often a result of ambiguous labels
12 months ago
1
27
3
Iβve seen some pretty amazing metros before (like Moscow), but wow Stockholm is wild. Never seen anything like it!
12 months ago
1
13
0
If anyone is getting annoyed with their BlueSky feed, try 'Popular with Friends' - you can add this from the 'Feeds' tab. I'm finding it works a bit better for me, and is more like what I had on Twitter. Thanks
@lasha.bsky.social
for suggesting!
12 months ago
1
2
0
reposted by
Joe Stacey
Pasquale Minervini
12 months ago
Starter pack for University of Edinburgh researchers done by the amazing
ramandutt4.bsky.social
-
go.bsky.app/KRNDkN7
loading . . .
University of Edinburgh Starter Pack
Join the conversation
http://go.bsky.app/KRNDkN7
9
35
10
Now I have like a gazillion new Bluesky followers, posting a link again to a blog post about my
#EMNLP2024
and EMNLP 2022 papers. Itβs a fun 10 minute read about our ideas on interpretable neural architectures. β€οΈ to my fantastic collaborators
www.marekrei.com/blog/creatin...
loading . . .
Creating Interpretable Models with Atomic Inference - Marek Rei
This is a guest post from Joe Stacey about our quest to create interpretable Natural Language Inference (NLI) models. In this post he will shareβ¦
https://www.marekrei.com/blog/creating-interpretable-models-with-atomic-inference/
12 months ago
0
31
2
Welcome to Bluesky to more of our NLP researchers at Imperial!! Looking forward to following everyone's work on here. To follow us all click 'follow all' in the starter pack below
go.bsky.app/Bv5thAb
add a skeleton here at some point
12 months ago
3
20
7
Just about to start my next big train journey, this time from London to Norway (Narvik). Just 7 countries to get the train through (UK, France, Belgium, Germany, Denmark, Sweden, Norway) π π should be epic
12 months ago
4
26
0
Cool thing about this thread now is you can see the likes per tip! Almost like voting on the best ones. So far tip #1 the clear winner Loving the BlueSky engagement π π
add a skeleton here at some point
12 months ago
0
7
0
You know itβs cold when little Hamish starts hugging the radiator β€οΈ
12 months ago
0
12
0
After going to NAACL, ACL and
#EMNLP2024
this year, here are a few tips Iβve picked up about attending
#NLP
conferences. Would love to hear any other tips if you have them! This proved very popular on another (more evil) social media platform, so sharing here also π My 10 tips:
12 months ago
14
84
18
βEntering Georgia and South Carolina, last chance to buy alcohol!β Had no idea, but i think alcohol sales are prohibited on Sunday in these states! The train is so exciting π
12 months ago
0
1
0
I love the Amtrak dining cars!! How pretty is this. Really good sit down breakfast, lunch and dinners. And the best bit is all the amazing people you meet and speak to at the meals.
12 months ago
1
17
1
Just boarded my train from Miami to New York post
#EMNLP2024
and super excited!! Amtrak trains are the fantastic, and Iβve got my own little room with two seats, a bed above, and toilet next to the bed. The toilet thing is a bit weird though if you have two to a room
12 months ago
3
25
1
Such a fantastic reaction to our paper today. so happy π Chocolates went down well too! Massive thanks to everyone for all your ideas and feedback
12 months ago
0
11
0
Excited to present our
#EMNLP2024
paper as a poster this morning at 10:30 (in the downstairs poster room)! It's cool work about creating inherently interpretable models, and (as always) I will have chocolate to give out π Paper is here:
aclanthology.org/2024.emnlp-m...
12 months ago
0
5
0
So excited to fly out to
#EMNLP2024
tomorrow! Would love to chat sometime π Iβm on the conference app so easy to message me there, or just come and say hi! Would love to hear about your research Hopefully I wonβt be too jet lagged π βοΈ
#NLP
about 1 year ago
0
3
0
reposted by
Joe Stacey
Juan Diego Rodriguez
about 1 year ago
How do language models organize concepts and their properties? Do they use taxonomies to infer new properties, or infer based on concept similarities? Apparently, both! π New paper with my fantastic collaborators
@amuuueller.bsky.social
and
@kanishka.bsky.social
4
109
28
#NLP
Bluesky really growing quick. Going to need a lot of effort early on to replace Twitter, but a very promising start! π
about 1 year ago
0
17
1
For the day I get back from
#EMNLP2024
Iβve booked big train trip from London to Narvik in Norway. Youβre probably wondering how that even works without getting on a boat. Well, it involves taking the train through quite a few countries ππ
about 1 year ago
0
1
0
Any tips people have in advance of
#EMNLP2024
for good poster presentations? Itβs such a small thing, but I always like when people acknowledge you when youβre waiting for a poster (when the presenters busy talking to someone else) π
about 1 year ago
3
5
0
I'm new to BlueSky, but excited to be here! π I've written up a little blog post about my EMNLP 2022 and
#EMNLP2024
papers about interpretable neural architectures in
#NLP
. Great way to learn about our work with minimal paper reading :) Let me know what you think!
www.marekrei.com/blog/creatin...
about 1 year ago
0
31
2
you reached the end!!
feeds!
log in