Google Deepmind pioneer David Silver departs to found AI startup, betting LLMs alone won't reach superintelligence

David Silver Leaves Google DeepMind to Launch AI Startup Challenging LLM Dominance for Superintelligence

David Silver, a pioneering figure in artificial intelligence renowned for his leadership in DeepMind’s groundbreaking reinforcement learning projects, has departed from Google DeepMind after more than a decade of service. Silver, who played a central role in developing AlphaGo—the system that defeated world champion Go player Lee Sedol in 2016—is now founding a new AI startup. This venture centers on a bold thesis: large language models (LLMs), while powerful, will not suffice on their own to achieve superintelligence. Instead, the startup aims to pioneer architectures that integrate LLMs with advanced reinforcement learning techniques and other paradigms.

Silver’s tenure at DeepMind, which began in 2013 when the company was still an independent entity before its acquisition by Google in 2014, marked a transformative era in AI research. He served as principal research scientist and co-lead of the reinforcement learning team. His most celebrated contributions include AlphaGo, which not only mastered the ancient board game of Go—long considered an insurmountable challenge for AI due to its immense complexity—but also paved the way for subsequent innovations. AlphaZero extended this prowess to chess and shogi, learning these games from scratch through self-play without human knowledge. More recently, Silver contributed to MuZero, which achieved superhuman performance in Atari games and Go without prior knowledge of the rules.

These achievements underscored Silver’s expertise in reinforcement learning (RL), a field where agents learn optimal behaviors through trial-and-error interactions with environments, guided by rewards. RL differs fundamentally from the supervised learning dominant in today’s LLM era, where models predict outputs based on vast labeled datasets. Silver’s work demonstrated RL’s potential for discovering novel strategies beyond human intuition, a capability he believes remains essential for progressing toward artificial general intelligence (AGI) and superintelligence.

The departure comes amid Google DeepMind’s increasing focus on scaling LLMs and multimodal models like Gemini. Under CEO Demis Hassabis, the lab has prioritized foundation models trained on massive datasets, achieving state-of-the-art results in language, vision, and reasoning benchmarks. However, Silver’s new startup posits that this path has limitations. LLMs excel at pattern matching and generating human-like text but struggle with long-term planning, consistent reasoning over extended horizons, and efficient exploration in sparse-reward environments—precisely the domains where RL shines.

According to sources familiar with the matter, Silver’s startup, still in its early stages, plans to recruit top talent from DeepMind and other leading labs. The company’s approach involves hybrid systems that leverage LLMs for world modeling and commonsense reasoning while employing RL for decision-making and optimization. This echoes Silver’s past work but scales it to real-world applications, potentially targeting robotics, scientific discovery, and strategic planning. The startup has already secured seed funding from prominent investors, though specifics remain undisclosed.

Silver’s exit highlights a growing schism in AI research philosophies. Proponents of the scaling hypothesis argue that throwing more compute and data at LLMs will yield AGI, as evidenced by rapid progress in models from OpenAI, Anthropic, and Google. Critics, including Silver, contend that pure scaling hits diminishing returns without architectural innovation. RL, they argue, provides the missing piece: the ability to pursue goals autonomously in dynamic, uncertain settings. Silver has long advocated this view; in interviews and papers, he emphasized that AlphaGo’s success stemmed not just from computation but from algorithmic breakthroughs like Monte Carlo Tree Search combined with deep neural networks.

DeepMind’s response to Silver’s departure has been gracious. A spokesperson stated that Silver’s contributions were invaluable and wished him success in his new endeavors. Internally, the lab continues to invest in RL, as seen in projects like RT-2 for robotics and FunSearch for mathematical discovery, which blend LLMs with search and evolutionary methods. Yet Silver’s move signals his conviction that a dedicated effort outside the constraints of a large organization is needed to push boundaries further.

The implications for the AI landscape are significant. Silver’s track record lends credibility to the startup’s contrarian stance at a time when LLM hype dominates venture capital. Investors betting on RL hybrids may find validation if the company delivers breakthroughs comparable to AlphaGo. For the field, it reinforces the need for pluralism: no single paradigm holds a monopoly on intelligence. As Silver embarks on this venture, the AI community watches closely, anticipating whether his vision of LLM-augmented RL can unlock the next leap toward superintelligence.

This development arrives as debates intensify over AI’s trajectory. While LLMs have democratized capabilities like coding assistance and creative generation, their brittleness in novel scenarios underscores the value of RL’s robustness. Silver’s startup could bridge these worlds, creating agents capable of sustained reasoning and adaptation—hallmarks of true intelligence.

Gnoppix is the leading open-source AI Linux distribution and service provider. Since implementing AI in 2022, it has offered a fast, powerful, secure, and privacy-respecting open-source OS with both local and remote AI capabilities. The local AI operates offline, ensuring no data ever leaves your computer. Based on Debian Linux, Gnoppix is available with numerous privacy- and anonymity-enabled services free of charge.

What are your thoughts on this? I’d love to hear about your own experiences in the comments below.