Tom Schaul
@schaul.bsky.social
๐ค 3212
๐ฅ 291
๐ 28
RL researcher at DeepMind
https://schaul.site44.com/
๐ฑ๐บ
Where do some of Reinforcement Learning's great thinkers stand today? Find out! Keynotes of the RL Conference are online:
www.youtube.com/playlist?lis...
Wanting vs liking, Agent factories, Theoretical limit of LLMs, Pluralist value, RL teachers, Knowledge flywheels (guess who talked about which!)
about 1 month ago
1
74
24
reposted by
Tom Schaul
Aditi Mavalankar
3 months ago
On my way to
#ICML2025
to present our algorithm that strongly scales with inference compute, in both performance and sample diversity! ๐ Reach out if youโd like to chat more!
add a skeleton here at some point
0
8
2
Deadline to apply is this Wednesday!
add a skeleton here at some point
4 months ago
0
4
1
Ever thought of joining DeepMind's RL team? We're recruiting for a research engineering role in London:
job-boards.greenhouse.io/deepmind/job...
Please spread the word!
loading . . .
Research Engineer, Reinforcement Learning
London, UK
https://job-boards.greenhouse.io/deepmind/jobs/6688132
4 months ago
1
28
9
When faced with a challenge (like debugging) it helps to think back to examples of how you've overcome challenges in the past. Same for LLMs! The method we introduce in this paper is efficient because examples are chosen for their complementarity, leading to much steeper inference-time scaling! ๐งช
add a skeleton here at some point
6 months ago
0
18
5
Some extra motivation for those of you in RLC deadline mode: our line-up of keynote speakers -- as all accepted papers get a talk, they may attend yours!
@rl-conference.bsky.social
7 months ago
0
36
11
200 great visualisations: 200 facets and nuances of 1 planetary story.
add a skeleton here at some point
8 months ago
0
5
0
reposted by
Tom Schaul
Eugene Vinitsky ๐
8 months ago
Reposting David Silver's talk about how RL is the way to intelligence. No particular reason
www.youtube.com/watch?v=pkpJ...
loading . . .
David Silver - Towards Superhuman Intelligence - RLC 2024
YouTube video by Reinforcement Learning Conference
https://www.youtube.com/watch?v=pkpJMNjvgXw&ab_channel=ReinforcementLearningConference
0
71
7
reposted by
Tom Schaul
Reinforcement Learning Conference
9 months ago
Excited to announce the first RLC 2025 keynote speaker, a researcher who needs little introduction, whose textbook we've all read, and who keeps pushing the frontier on RL with human-level sample efficiency
0
51
4
Could language games (and playing many of them) be the renewable energy that Ilya was hinting at yesterday? They do address two core challenges of self-improvement -- let's discuss! My talk is today at 11:40am, West Meeting Room 220-222,
#NeurIPS2024
language-gamification.github.io/schedule/
add a skeleton here at some point
10 months ago
0
27
1
Don't get to talk enough about RL during
#neurips2024
? Then join us for more, tomorrow night at The Pearl!
add a skeleton here at some point
10 months ago
0
14
0
This year's (first-ever) RL conference was a breath of fresh air! And now that it's established, the next edition is likely to be even better: Consider sending your best and most original RL work there, and then join us in Edmonton next summer!
add a skeleton here at some point
10 months ago
0
19
3
Are there limits to what you can learn in a closed system? Do we need human feedback in training? Is scale all we need? Should we play language games? What even is "recursive self-improvement"? Thoughts about this and more here:
arxiv.org/abs/2411.16905
loading . . .
Boundless Socratic Learning with Language Games
An agent trained within a closed system can master any desired capability, as long as the following three conditions hold: (a) it receives sufficiently informative and aligned feedback, (b) its covera...
https://arxiv.org/abs/2411.16905
10 months ago
7
111
24
reposted by
Tom Schaul
Glen Berseth
10 months ago
RLC will be held at the Univ. of Alberta, Edmonton, in 2025. I'm happy to say that we now have the conference's website out: rl-conference.cc/index.html Looking forward to seeing you all there!
@rl-conference.bsky.social
#reinforcementlearning
2
60
21
Twitter-optional NeurIPS? Sounds like an appealing prospect!
add a skeleton here at some point
11 months ago
1
12
1
you reached the end!!
feeds!
log in