Max Bartolo
@maxbartolo.bsky.social
📤 294
📥 27
📝 15
Building robust LLMs @Cohere
reposted by
Max Bartolo
Lisa Alazraki
4 months ago
Thrilled to share our new preprint on Reinforcement Learning for Reverse Engineering (RLRE) 🚀 We demonstrate that human preferences can be reverse engineered effectively by pipelining LLMs to optimise upstream preambles via reinforcement learning 🧵⬇️
1
9
1
I'm excited to share the tech report for our
@cohere.com
@cohereforai.bsky.social
Command A and Command R7B models. We highlight our novel approach to model training including self-refinement algorithms and model merging techniques at scale. Read more below! ⬇️
6 months ago
1
11
7
I really enjoyed my MLST chat with Tim
@neuripsconf.bsky.social
about the research we've been doing on reasoning, robustness and human feedback. If you have an hour to spare and are interested in AI robustness, it may be worth a listen 🎧 Check it out at
youtu.be/DL7qwmWWk88?...
6 months ago
0
8
3
Check out
@lisaalaz.bsky.social
's internship work with us
@cohere.com
questioning the rationale behind rationales 🔥
add a skeleton here at some point
7 months ago
0
4
1
Super excited to see PRISM recognised as a
#NeurIPS2024
best paper. This was an incredible large-scale effort by
@hannahrosekirk.bsky.social
and fantastic collaborators. If you're interested in human feedback, check it out, there are 100+ pages of detailed insights! 🔥
add a skeleton here at some point
10 months ago
0
9
1
reposted by
Max Bartolo
Adina Williams
10 months ago
Our paper PRISM alignment won a best paper award at
#neurips2024
! All credits to
@hannahrosekirk.bsky.social
A.Whitefield, P.Röttger, A.M.Bean, K.Margatina, R.Mosquera-Gomez, J.Ciro,
@maxbartolo.bsky.social
H.He, B.Vidgen, S.Hale Catch Hannah tomorrow at
neurips.cc/virtual/2024/poster/97804
loading . . .
https://blog.neurips
2
67
9
reposted by
Max Bartolo
Tim Rocktäschel
10 months ago
Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! 🧞
add a skeleton here at some point
3
94
21
Looking forward to
@neuripsconf.bsky.social
#NeurIPS
#NeurIPS2024
in Vancouver next week! ❄️ Reach out (or pop by the
@cohere.com
booth) if you want to chat about human feedback, robustness and reasoning, prompt optimisation, adversarial data, glitch tokens, evaluation, or anything else!
10 months ago
0
11
0
Sparks of multi-hop reasoning ✨
add a skeleton here at some point
10 months ago
0
9
2
Fun to see Douwe's Dynabench plot continue to inspire new groundbreaking benchmarking work!
add a skeleton here at some point
10 months ago
0
4
0
🚨 LLMs can learn to reason from procedural knowledge in pretraining data! 🚨 I particularly enjoy research where the evidence contradicts our initial hypothesis. If you're interested in LLM reasoning, check out the 60+ pages of in-depth work at
arxiv.org/abs/2411.12580
add a skeleton here at some point
10 months ago
4
67
8
reposted by
Max Bartolo
atla
10 months ago
We launched Judge Arena with
@huggingface.bsky.social
@clefourrier.bsky.social
- a platform that lets you easily compare models as judges side-by-side and vote for the best evaluation Check out the live leaderboard and start voting now 🤗
add a skeleton here at some point
0
10
4
you reached the end!!
feeds!
log in