hardmaru
@hardmaru.bsky.social
📤 4433
📥 721
📝 184
Co-Founder & CEO, Sakana AI 🎏 →
@sakanaai.bsky.social
Visit →
https://sakana.ai/
Reproducing all of Jürgen Schmidhuber’s papers (1990-2025) using an AI coding assistant. Cool project by Yaroslav! It even reproduced the “World Models” paper by me and Schmidhuber (2018) using a toy environment, with a full VAE + RNN world model implementation. Project:
github.com/cybertronai/...
loading . . .
23 days ago
1
43
6
reposted by
hardmaru
Sakana AI
24 days ago
How do we make LLMs faster and lighter? Don’t force the GPU to adapt to sparsity. Reshape the sparsity to fit the GPU! Our latest work with NVIDIA introduces new CUDA kernels & data formats for faster inference and training of sparse transformer language models: Blog:
pub.sakana.ai/sparser-fast...
add a skeleton here at some point
0
25
2
Excited to share Sakana AI’s new
#ICML2026
paper in collaboration with NVIDIA: "Sparser, Faster, Lighter Transformer Language Models"
arxiv.org/abs/2603.23198
This work introduces new open-source GPU kernels and data formats for faster inference and training of sparse transformer LLMs: 🧵 Thread 👇
loading . . .
24 days ago
2
60
9
If GitHub were built in: Japan 🇯🇵 China 🇨🇳 North Korea 🇰🇵 The EU 🇪🇺
30 days ago
6
125
24
For the past few years, humans have been doing “prompt engineering” to coax the best performance out of different LLMs. In this work, we explored what happens if we train an AI to do that job instead. Link to our
#ICLR2026
paper:
arxiv.org/abs/2512.04388
Thread:
add a skeleton here at some point
about 1 month ago
2
37
12
reposted by
hardmaru
Sakana AI
about 1 month ago
Introducing our new work: “Learning to Orchestrate Agents in Natural Language with the Conductor” accepted at
#ICLR2026
arxiv.org/abs/2512.04388
What if we trained an AI not to solve problems directly, but to act as a manager that delegates tasks to a diverse team of other AIs? Thread:
2
38
7
reposted by
hardmaru
Sakana AI
about 1 month ago
日経クロステックの連載記事「R&Dを根底から変革、普及始まる『AI科学者』」にて、Sakana AIのResearch Scientist、Robert Langeへの取材記事が掲載されました。 当社のAIサイエンティストについて、テーマ探索から論文査読まで研究の全工程を自動的に遂行する仕組みと、その現状の到達点・限界について解説されています。 記事でも触れていただいているとおり、AIサイエンティストに関する論文は2026年3月に学術誌Natureに掲載されました。基盤となるAIモデルの性能向上に伴って生成される論文の質が改善されうることを実験的に示せた点は、本研究の重要な成果の一つです。
loading . . .
Sakana AIとGoogleのAI科学者、自律性に差 研究の種を生むのは人間
CraifはAI科学者で研究の時間を大幅に短縮し、NanoFrontierはAI科学者のプロセスを前提とした事業を立ち上げた。大手製薬などもAI科学者ツールの導入を始めている。AI科学者の全体像を整理する。
https://xtech.nikkei.com/atcl/nxt/column/18/03603/042600005/
1
3
1
Scaling up massive LLMs continues to yield incredible results. But to truly unlock their full potential, the next frontier is test-time compute and dynamic orchestration.
add a skeleton here at some point
about 1 month ago
2
36
3
reposted by
hardmaru
Sakana AI
about 1 month ago
What if instead of building one giant AI, we evolved a coordinator to orchestrate a diverse team of specialized AIs? 🐟 Excited to share our new
#ICLR2026
paper: “TRINITY: An Evolved LLM Coordinator”! Paper
arxiv.org/abs/2512.04695
OpenReview
openreview.net/forum?id=5Ha...
Fugu
sakana.ai/fugu-beta
3
56
11
We’ve been using Sakana Fugu internally for our own research and coding. Instead of relying on a single model, it dynamically orchestrates the best combination of open and closed models for any task. The future of AI is collective intelligence. Excited to open the beta API:
sakana.ai/fugu-beta
add a skeleton here at some point
about 1 month ago
1
42
6
reposted by
hardmaru
Sakana AI
about 1 month ago
Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep complex reasoning Apply for the beta:
forms.gle/BtKkhc2CfLKk...
loading . . .
Sakana Fugu Beta Tester Application 🐟🐠
Thank you for your interest in joining the Sakana Fugu beta program! Please fill out the questionnaire below to apply. Application Deadline: May 5, 2026 (anywhere on Earth) Selected testers will rec...
https://forms.gle/BtKkhc2CfLKk1dvNA
0
4
1
reposted by
hardmaru
Sakana AI
about 1 month ago
We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system!
sakana.ai/fugu-beta
Fugu dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task, hits SOTA on SWE-Pro, GPQA-D, ALE-Bench!
3
17
4
Getting LLMs to simulate “true” randomness or generate diverse outputs is surprisingly difficult. We found a simple prompting trick that solves this by having the model generate and manipulate a random string. To be presented at
#ICLR2026
this week! Blog:
pub.sakana.ai/ssot
add a skeleton here at some point
about 1 month ago
3
44
8
reposted by
hardmaru
Sakana AI
about 1 month ago
Can LLMs flip coins in their heads? When prompted to "Flip a fair coin" 100 times, the heads to tails ratio drifts far from 50:50. LLMs can understand what the target probability should be, but generating outputs that faithfully follow a given distribution is a separate problem.
pub.sakana.ai/ssot
loading . . .
3
13
4
I am very proud of our team for releasing EDINET-Bench, and it is fantastic to see a Japanese financial dataset recognized at
#ICLR2026
this week. We need more diverse, non-English datasets to evaluate models in the real world. Paper:
openreview.net/forum?id=Dxn...
add a skeleton here at some point
about 1 month ago
1
18
2
Digital Ecosystems: Interactive Multi-Agent Neural Cellular Automata
pub.sakana.ai/digital-ecos...
add a skeleton here at some point
about 1 month ago
4
38
4
We are hiring Software Engineers in Tokyo to help us scale Sakana AI’s R&D efforts. If you are interested in building the data pipelines and full stack infrastructure needed to push the boundaries of automated scientific discovery, we would love to hear from you. 🗼🎌
sakana.ai/careers/#sof...
add a skeleton here at some point
about 2 months ago
1
10
3
A “Neural Computer” is built by adapting video generation architectures to train a World Model of an actual computer that can directly simulate a computer interface. Paper:
arxiv.org/abs/2604.06425
Code:
github.com/metauto-ai/N...
Cool work led by Mingchen Zhuge et al. from Schmidhuber’s lab!
loading . . .
about 2 months ago
2
75
14
I’m incredibly proud of The AI Scientist team for this milestone publication in Nature. We started this project to explore if foundation models could execute the entire research lifecycle. Seeing this work validated at this level is a special moment.
add a skeleton here at some point
2 months ago
2
32
2
Sakana AI 初の一般向けサービス Sakana Chat を公開しました🐟 Try Sakana Chat:
chat.sakana.ai
強力なWeb検索エージェントを備え、高速で信頼性の高い情報を引き出せます。 世界の高性能なオープンモデルには、開発元のバイアスが不可避的に内在しています。我々は独自の事後学習により、①これらのバイアスの除去、②日本の価値観の反映、③安全かつ文脈に即した適応を実現する技術を開発しました。 今回のリリースは、その技術実証の第一弾。国内で誰もが安心して使えるAIの選択肢の一つとして、ぜひお試しください!
add a skeleton here at some point
2 months ago
1
11
1
reposted by
hardmaru
Sakana AI
3 months ago
“When AI Discovers the Next Transformer” Full Interview on YouTube:
youtu.be/EInEmGaMRLc
Robert Lange (Sakana AI) joins Tim Scarfe (ML Street Talk) to discuss Shinka Evolve, a framework that combines LLMs with evolutionary algorithms to do open-ended program search.
1
15
2
Instead of forcing models to hold everything in an active context window, we can use hypernetworks to instantly compile documents and tasks directly into the model's weights. A step towards giving language models durable memory and fast adaptation. Blog:
pub.sakana.ai/doc-to-lora/
add a skeleton here at some point
3 months ago
2
103
18
reposted by
hardmaru
Sakana AI
4 months ago
「How Competition is Stifling AI Breakthroughs」 Sakana AI共同創業者 Llion Jones のTED AIトークが公開されました。目標を定めすぎないオープンエンドな研究がブレークスルーを生む理由、Transformerの成功が業界にもたらした状況、それを乗り越える次の構想と成果を語りました。
www.ted.com/talks/llion_...
loading . . .
How competition is stifling AI breakthroughs
Llion Jones cowrote "Attention Is All You Need," the seminal paper that introduced the transformer — the architecture that launched the generative AI revolution. Now he warns that the industry that gr...
https://www.ted.com/talks/llion_jones_how_competition_is_stifling_ai_breakthroughs
0
4
1
Our journey at Sakana AI is just getting started. We are looking for people to help us pioneer the next generation of AI—building from Japan to the world. Join us:
sakana.ai/careers
4 months ago
0
30
3
I founded Sakana AI after my time at Google, so it is incredibly meaningful to be able to partner with them now. It feels like a special connection to be working together again to advance the AI ecosystem in Japan.
sakana.ai/google#en
add a skeleton here at some point
4 months ago
3
49
2
reposted by
hardmaru
Sakana AI
4 months ago
Our work on The AI Scientist and ALE-Agent has already shown the power of these models. Now, we are scaling reliable AI in mission-critical sectors like finance and government to ensure the highest security and data sovereignty. Full details:
sakana.ai/google#en
loading . . .
Sakana AI
Sakana AI、Googleとの戦略的パートナーシップ締結を発表
https://sakana.ai/google#en
1
1
1
reposted by
hardmaru
Sakana AI
4 months ago
We are thrilled to announce a strategic partnership with Google! Google is also making a financial investment in Sakana AI to strengthen this collaboration. We are combining Google’s world-class products like Gemini and Gemma with our agile R&D to accelerate automated scientific discovery.
loading . . .
1
17
3
reposted by
hardmaru
Sakana AI
4 months ago
We just published an unofficial guide on what we look for when interviewing research candidates at Sakana AI. Written by Stefania Druga, Luke Darlow, and Llion Jones. The biggest differentiator? Understanding over implementation. Read it:
pub.sakana.ai/Unofficial_G...
1
23
2
reposted by
hardmaru
Sakana AI
4 months ago
RePo moves us toward models that intelligently curate their own working memory rather than passively accepting input order. Read the full breakdown on our website:
pub.sakana.ai/repo/
Paper:
arxiv.org/abs/2512.14391
loading . . .
RePo: Language Models with Context Re-Positioning
In-context learning is fundamental to modern Large Language Models (LLMs); however, prevailing architectures impose a rigid and fixed contextual structure by assigning linear or constant positional in...
https://arxiv.org/abs/2512.14391
0
18
4
reposted by
hardmaru
Sakana AI
4 months ago
Introducing RePo: Language Models with Context Re-Positioning Standard LLMs force a rigid linear structure on context, treating physical proximity as relevance. Cognitive Load Theory suggests this is inefficient—models waste capacity managing noise instead of reasoning.
arxiv.org/abs/2512.14391
loading . . .
1
56
12
reposted by
hardmaru
Sakana AI
5 months ago
2026 is just getting started 🚀✨ We are hiring. Join our team in Tokyo!
sakana.ai/careers
0
13
1
reposted by
hardmaru
Sakana AI
5 months ago
AI導入は「雇用不安小さい正社員制度が強みに」 日経ビジネスにて、Sakana AI CEO
@hardmaru.bsky.social
のインタビューが公開されました。企業へのAI実装が本格化する2026年における現状と課題、そして日本企業の組織文化がAI導入にとってポジティブに働く可能性について語りました。
business.nikkei.com/atcl/gen/19/...
【記事のハイライト】🧵
loading . . .
サカナAIのデビッド・ハCEO、AI導入は「雇用不安小さい正社員制度が強みに」
AIファーストを掲げる企業が増加する中、経営者はAIのリスクを正しく理解し、適切に導入を進める必要がある。国内最大級のユニコーンで、企業向けのAIソリューション開発を行うSakana AI(サカナAI、東京・港)のデビッド・ハ最高経営責任者(CEO)に、日本企業のAI導入における課題を聞いた。
https://business.nikkei.com/atcl/gen/19/00831/010800008/
1
3
1
reposted by
hardmaru
Sakana AI
5 months ago
Introducing DroPE: Extending Context by Dropping Positional Embeddings We found embeddings like RoPE aid training but bottleneck long-sequence generalization. Our solution’s simple: treat them as a temporary training scaffold, not a permanent necessity.
arxiv.org/abs/2512.12167
pub.sakana.ai/DroPE
loading . . .
2
117
28
One of my favorite findings: Positional embeddings are just training wheels. They help convergence but hurt long-context generalization. We found that if you simply delete them after pretraining and recalibrate for <1% of the original budget, you unlock massive context windows. Smarter, not harder.
add a skeleton here at some point
5 months ago
8
218
34
reposted by
hardmaru
Sakana AI
5 months ago
We are taking our technology far beyond competitive programming to unlock a new era of AI-driven discovery. We are hiring. Join our team in Tokyo.
sakana.ai/careers/#sof...
0
9
2
We’re hiring.
sakana.ai/careers/#sof...
add a skeleton here at some point
5 months ago
0
28
7
When agents compete for limited resources, intelligence reorganizes around survival, not elegance.
5 months ago
1
18
4
Survival of the fittest code! Our paper explores LLMs driving an evolutionary arms race in Core War, where assembly programs fight each other. We task LLMs with evolving "Warriors" in a virtual machine, producing chaotic, self-modifying code dynamics. Blog:
sakana.ai/drq
Paper:
pub.sakana.ai/drq/
add a skeleton here at some point
5 months ago
2
41
10
reposted by
hardmaru
Sakana AI
5 months ago
Introducing Digital Red Queen (DRQ): Adversarial Program Evolution in Core War with LLMs. In this work, we explore how LLMs can drive open-ended adversarial evolution of programs within the Core War environment. Blog
sakana.ai/drq
Website
pub.sakana.ai/drq/
ArXiv
arxiv.org/abs/2601.03335
Thread:
loading . . .
2
32
9
So proud of Team Sakana AI for pulling this off! We managed to get an agent to rank #1 in a difficult heuristic optimization contest. We leaned heavily into test-time inference using a mix of frontier models. The agent spent $1,300 to autonomously discover an algorithm that beat the human baseline.
add a skeleton here at some point
5 months ago
2
35
4
reposted by
hardmaru
Sakana AI
5 months ago
Our AI agent has achieved 1st place in a competitive optimization programming contest against over 800 human participants. Blog:
sakana.ai/ahc058
Thread:
2
20
4
Happy New Year! ⛩️
add a skeleton here at some point
5 months ago
0
12
0
Sakana AI’s office looks like this.
add a skeleton here at some point
5 months ago
0
2
1
Software Engineering as a profession will continue to fundamentally change in 2026. Humans will need to learn to co-adapt to this evolving “alien technology” which comes with no real manual, and figure out how to operate it. What a time to be alive ✨
twitter.com/karpathy/sta...
5 months ago
1
36
0
reposted by
hardmaru
Sakana AI
5 months ago
Merry Christmas! 🎄 Sakana AIでは、事業開発に関心がある方向けの「カジュアル面談窓口」をオープンしました!金融・防衛・インテリジェンス領域で、私たちがどのような開発に挑んでいるのか。中の人が直接お話しします。 募集職種: エンジニア、Project Manager、Product Manager 内容: 事業戦略、開発の裏側、チームの雰囲気など 最先端のAI開発を社会実装するプロセスに興味がある方、ぜひお気軽にご応募ください! 👉 応募フォームはこちら:
forms.gle/sW5wz23SLSvN...
👉 募集要項:
sakana.ai/careers/
0
3
1
I doubt that anything resembling genuine AGI is within reach of current AI tools—Terence Tao
mathstodon.xyz/@tao/1157223...
5 months ago
3
90
16
“iRobot Corp., the company that revolutionized robot vacuum cleaners in the early 2000s with its Roomba model, filed for bankruptcy and proposed handing over control to its main Chinese supplier.” 😥
www.bloomberg.com/news/article...
loading . . .
Robot Vacuum Roomba Maker Files for Bankruptcy After 35 Years
iRobot Corp., the company that revolutionized robot vacuum cleaners in the early 2000s with its Roomba model, filed for bankruptcy and proposed handing over control to its main Chinese supplier.
https://www.bloomberg.com/news/articles/2025-12-15/robot-vacuum-roomba-maker-files-for-bankruptcy-after-35-years
6 months ago
0
17
10
『日経ビジネス』のインタビュー記事が公開されました。 日本の組織構造は長年の知恵の結晶であり、無理にフラット化すべきではありません。特性の異なるAIを組み合わせ、既存の蓄積を代替するのではなく、人に寄り添う「コンパニオン」であるべきです。組織の強みを活かすAIの在り方を語りました。
add a skeleton here at some point
6 months ago
1
1
1
reposted by
hardmaru
Sakana AI
6 months ago
Sakana AIでは、🐟 Recruiterを募集しています! 先端AIの社会実装をミッションとするApplied Teamの一員として、EngineerやProject Managerなどの採用を担っていただきます。ダイレクトソーシングを軸に、候補者一人ひとりと直接的な関係を築くことで、Sakana AIの成長を加速させる重要な役割です。 JD:
sakana.ai/careers/#rec...
IT/テクノロジー業界でのリクルーティング経験を活かし、Sakana AIのチーム組成をともに担っていただける方のご応募をお待ちしております!
0
1
1
“Why AGI Will Not Happen” by Tim Dettmers.
timdettmers.com/2025/12/10/w...
This essay is worth reading. Discusses diminishing returns (and risks) of scaling. The contrast between West and East: “Winner takes all” approach of building the biggest thing vs a long-term focus on practicality.
6 months ago
4
54
18
Load more
feeds!
log in