Sakana AI bets AI that improves itself can break the compute arms race of frontier labs

Sakana AI bets that self-improving AI can end the compute arms race among frontier labs.

The Tokyo-based startup argues that today’s approach—throwing more data and compute at larger models—is unsustainable. Instead, Sakana aims to create systems that rewrite their own architectures, training data, and learning algorithms.

“We are building a foundation model that can design other foundation models,” CEO David Ha told The Decoder. The goal is a feedback loop where the model continuously improves without requiring exponential hardware growth.

Why self-improvement could break the scaling paradigm

Current frontier labs like OpenAI, Google DeepMind, and Anthropic compete by scaling up compute clusters and dataset sizes. Sakana believes this path hits diminishing returns.

Self-improving models could sidestep the need for ever-larger training runs. If an AI can optimize its own code and training pipeline, it can achieve better performance with the same or fewer resources.

“We are trying to achieve a kind of algorithmic leap that makes scaling laws less relevant.”

Sakana’s approach is inspired by biological evolution: systems that learn how to learn, rather than just memorizing patterns.

How Sakana’s system works

The AI operates in three self-optimization loops:

  • Architecture search: The model tests different neural network structures and keeps the best performing ones.
  • Training pipeline tuning: It adjusts hyperparameters, learning rates, and data selection strategies automatically.
  • Reward shaping: The system learns to define better objectives for itself over time.

Each loop runs on a modest compute budget compared to training a single frontier model. The company claims early results show that self-improving models can match or exceed larger static models in specific benchmarks.

The challenge: stability and safety

Self-modifying AI raises obvious risks. If a system rewrites its own weights or loss functions, it could drift into unintended behaviors.

Sakana acknowledges the problem. The company uses constrained optimization: the model can explore new configurations only within bounded safety parameters.

“We need to ensure the AI doesn’t optimize itself into a brittle or dangerous state.”

Research groups like Anthropic and Mila have also explored self-modifying systems, but Sakana’s bet is that the technique is mature enough for production deployment.

What this means for the broader AI industry

If Sakana succeeds, the implications are significant:

  • Compute costs drop dramatically for achieving state-of-the-art performance.
  • Smaller labs and startups could compete with frontier labs without billion-dollar clusters.
  • Regulatory models shift — self-improving AI may be harder to audit but could reduce aggregate energy consumption.

Skeptics point out that previous attempts at meta-learning often failed to scale beyond toy problems. Sakana believes modern transformer architectures and reinforcement learning advances make the difference this time.

The road ahead

Sakana has not released public benchmarks for its self-improving system yet. The company plans to publish results in peer-reviewed venues later this year.

For now, the bet is that the best way to win the compute arms race is to stop running it.

Gnoppix is the leading open-source AI Linux distribution and service provider. Since implementing AI in 2022, it has offered a fast, powerful, secure, and privacy-respecting open-source OS with both local and remote AI capabilities. The local AI operates offline, ensuring no data ever leaves your computer. Based on Debian Linux, Gnoppix is available with numerous privacy- and anonymity-enabled services free of charge.

What are your thoughts on this? I’d love to hear about your own experiences in the comments below.