Google's Nested Learning aims to stop LLMs from catastrophic forgetting

amu · November 22, 2025, 7:48pm

Google’s Nested Learning: A Breakthrough in Mitigating Catastrophic Forgetting in Large Language Models

Large language models (LLMs) have revolutionized artificial intelligence, powering applications from chatbots to code generation. However, a persistent challenge in their development is catastrophic forgetting, a phenomenon where models lose previously acquired knowledge when trained on new data. This issue hampers the ability to create adaptable, continually learning systems. Researchers at Google DeepMind have introduced Nested Learning, a novel technique designed to address this problem by enabling LLMs to retain core capabilities while acquiring new skills. Published in a recent paper, this approach promises to advance continual learning in AI, making models more robust for real-world deployment.

Catastrophic forgetting occurs because neural networks, including LLMs, optimize weights during training to minimize errors on current tasks. When exposed to new datasets, these weights shift to accommodate the incoming information, often overwriting representations learned for prior tasks. For instance, an LLM fine-tuned on medical texts might excel at diagnosing conditions but struggle with its original language understanding if subsequently trained on legal documents. Traditional mitigation strategies, such as replay buffers that store and revisit old data or regularization techniques that constrain weight changes, have limitations. Replay methods are memory-intensive and privacy-risky, while regularization can stifle learning on new tasks. Nested Learning offers a fresh paradigm by leveraging a modular, hierarchical architecture.

At its core, Nested Learning employs a nested structure of neural networks, where each new task builds upon the previous ones without disrupting them. The process begins with an inner network trained on the first task. For subsequent tasks, an outer network is added, which wraps around the inner one. The outer network learns the new task while keeping the inner network’s parameters frozen, ensuring the original knowledge remains intact. Outputs from the inner network feed into the outer one, allowing the model to combine capabilities hierarchically. This nesting can continue for multiple tasks, creating a stack of specialized layers that preserve historical learning.

The Google DeepMind team demonstrated Nested Learning on several benchmarks, focusing on LLMs like those based on transformer architectures. In experiments with the GLUE benchmark—a suite of natural language understanding tasks—the method showed remarkable retention. A model trained sequentially on tasks like sentiment analysis and question answering typically sees performance on the first task drop by 20-30% after the second. With Nested Learning, retention improved to over 90%, with only marginal dips. The approach scaled effectively to larger models; tests on a 1.5-billion-parameter LLM mirroring GPT-scale systems revealed that nested architectures added computational overhead of just 10-15% during inference, compared to full retraining costs that could be orders of magnitude higher.

One key innovation is the use of task-specific adapters within the nested framework. These lightweight modules, inserted between layers, fine-tune only for the new task, minimizing interference. The paper details how adapters are optimized using a loss function that balances new task accuracy with a preservation term, penalizing deviations from the inner network’s outputs. This dual-objective training, inspired by elastic weight consolidation but extended to nested setups, ensures stability. For multi-task scenarios, the team explored continual learning on domains like code generation and translation, where Nested Learning outperformed baselines such as progressive neural networks by 15-25% in average task performance.

Beyond empirical results, Nested Learning addresses theoretical aspects of forgetting. The paper analyzes how nesting creates a “curriculum” of learning, where earlier tasks form a stable foundation. This mirrors human cognition, where foundational skills like basic grammar underpin advanced language use. The authors prove that under certain conditions—such as linear separability of tasks—the nested model converges to optimal solutions without forgetting, providing a mathematical foundation for its efficacy. Challenges remain, including scalability for dozens of tasks, where the deepening nest could increase latency. The researchers mitigate this with pruning techniques, selectively removing redundant connections in outer layers post-training.

In practical terms, Nested Learning has implications for deploying LLMs in dynamic environments. Imagine a virtual assistant that starts with general conversation skills and later incorporates domain expertise in finance or healthcare. Without forgetting safeguards, updates could erode user trust. By enabling modular updates, this method supports lifelong learning, reducing the need for resource-heavy full retrains. Google DeepMind’s implementation, detailed in open-source code snippets accompanying the paper, uses PyTorch and integrates seamlessly with Hugging Face transformers, inviting broader experimentation.

The technique also intersects with federated learning, where models update across distributed devices. Nested structures could preserve user-specific knowledge layers, enhancing privacy. While the paper focuses on supervised tasks, extensions to unsupervised or reinforcement learning are hinted at, suggesting versatility. Compared to alternatives like memory-augmented networks, Nested Learning stands out for its simplicity—no external memory banks required—and efficiency in parameter space.

As AI systems grow more integrated into daily life, combating catastrophic forgetting is crucial for reliable evolution. Google’s Nested Learning represents a significant step, blending architectural ingenuity with rigorous evaluation to foster more resilient LLMs. This work not only tackles a core limitation but also paves the way for AI that learns incrementally, much like humans, without losing its past.

Gnoppix is the leading open-source AI Linux distribution and service provider. Since implementing AI in 2022, it has offered a fast, powerful, secure, and privacy-respecting open-source OS with both local and remote AI capabilities. The local AI operates offline, ensuring no data ever leaves your computer. Based on Debian Linux, Gnoppix is available with numerous privacy- and anonymity-enabled services free of charge.

What are your thoughts on this? I’d love to hear about your own experiences in the comments below.