Tom Sherborne
@tomsherborne.bsky.social
📤 99
📥 41
📝 8
MTS @ Cohere on code. Views not my employer’s.
We are hiring
@cohere.com
for an Agent Infrastructure Engineer! If you want to work on building the next generation of agent models for
#RAG
,
#ToolUse
#Code
,
#Reasoning
and more then apply here. DM me if you have any Qs.
jobs.ashbyhq.com/cohere/3f797...
loading . . .
Member of Technical Staff, Agent Infrastructure Engineer
At Cohere, we have one of the highest compute-to-engineers ratios in the world. We do not delineate strongly between engineering and research: everyone contributes to writing production code and condu...
https://jobs.ashbyhq.com/cohere/3f797fee-430a-4fdd-8c06-acf7899de5b8
8 months ago
0
0
0
I’ll be at
@neuripsconf.bsky.social
all next week! Find me mostly at the
@cohere.com
booth / DM me to talk code / post-training / life at Cohere 🇨🇦
10 months ago
0
2
0
My PhD thesis "Modelling Cross-lingual Transfer For Semantic Parsing" is finally submitted! 🎉🎉🎉
over 1 year ago
0
2
1
TRAM is accepted to
#ICLR2024
as a Spotlight! See you in Vienna 🇦🇹! Thanks to
@nsaphra.bsky.social
, Pradeep Dasigi, Hao Peng and
@ai2.bsky.social
Vision experiments, more discussion and visuals coming soon to the camera ready!
add a skeleton here at some point
over 1 year ago
0
1
1
reposted by
Tom Sherborne
Clara Na
almost 2 years ago
Really excited about this one and had such a blast working with
@siree.sh
@abertsch.bsky.social
@davidthewid.bsky.social
@strubell.bsky.social
! Please read our paper and reach out with any questions, we'd love to chat! See y'all in Singapore :)
add a skeleton here at some point
1
8
3
🚨 new paper 🚨 Can we train for flat minima with less catastrophic OOD forgetting? We propose Trust Region Aware Minimization for smoothness in parameters+representations. TL;DR representations matter just as much!
arxiv.org/abs/2310.03646
w/
@nsaphra.bsky.social
Pradeep Dasigi + Hao Peng
almost 2 years ago
1
10
3
you reached the end!!
feeds!
log in