Google Deepmind upgrades Gemini API with multi-tool chaining and context circulation

amu · March 18, 2026, 7:56pm

Google DeepMind Enhances Gemini API with Advanced Multi-Tool Chaining and Context Circulation Capabilities

Google DeepMind has introduced significant upgrades to its Gemini API, focusing on multi-tool chaining and context circulation. These enhancements aim to empower developers to build more sophisticated AI agents capable of handling complex, multi-step tasks with greater reliability and efficiency. By enabling seamless integration of multiple tools and preserving conversation context across interactions, the updated API addresses key limitations in previous versions, paving the way for more robust agentic applications.

At the core of these improvements is multi-tool chaining, a feature that allows Gemini models to invoke multiple function tools in a single response. Previously, the API supported only single-tool calls per turn, which constrained workflows requiring sequential or parallel tool usage. Now, developers can define multiple tools within a single request, and the model intelligently decides the order and combination of calls needed to fulfill a query. For instance, an AI agent tasked with booking a flight might first query a calendar tool to check availability, then consult a flight search API, and finally interact with a payment processor, all within one cohesive response cycle.

This capability is particularly valuable for agentic systems where tasks decompose into interdependent subtasks. DeepMind emphasizes that multi-tool chaining reduces latency and minimizes the need for iterative prompting loops, which often lead to context dilution or errors in long conversations. The API handles tool responses automatically, feeding outputs back into the model’s reasoning process for dynamic decision-making. Developers access this through updated parameters in the Gemini API endpoints, such as the tools array in the generateContent method, where each tool includes its function declaration, including name, description, and parameters.

Complementing multi-tool chaining is context circulation, a mechanism designed to maintain full conversation history across tool invocations without truncation. In traditional setups, tool calls could disrupt context flow, forcing developers to manually manage state or risk losing critical prior information. Context circulation ensures that the entire chat history, including tool results, remains intact and available to the model in subsequent turns. This is achieved through an optimized token management system that circulates context efficiently, even in extended interactions.

DeepMind reports substantial performance gains from these features. Internal benchmarks show improved success rates for multi-step tasks, with Gemini 1.5 Pro achieving up to 90 percent reliability in chained tool scenarios compared to 70 percent in prior iterations. The upgrades also enhance safety alignments, as the model better reasons over tool outputs before final responses, reducing hallucination risks. For example, in a simulated research agent workflow, the model chains a web search tool with a summarization function and a fact-checking tool, circulating context to produce accurate, grounded outputs.

Implementation is straightforward for existing Gemini API users. Developers update their SDKs to the latest versions (e.g., Google Generative AI SDK for Python or JavaScript) and specify toolConfig with chaining enabled. A sample code snippet illustrates this:

from google import genai

model = genai.GenerativeModel('gemini-1.5-pro')
tools = [
    tool1_function_declaration,
    tool2_function_declaration
]
response = model.generate_content(
    "Plan a trip to Paris",
    tools=tools,
    tool_config={"function_calling_strategy": "CHAIN"}
)

The API supports both automatic and manual tool orchestration modes. In automatic mode, the model selects and chains tools autonomously; manual mode allows developer-specified sequences for precise control. Tool responses are returned as structured JSON within the parts of the response object, facilitating easy parsing and continuation.

These updates extend to the Gemini API’s Vertex AI integration on Google Cloud, where enterprise users benefit from scalable deployment options, including serverless endpoints and fine-tuning capabilities. DeepMind has also expanded tool compatibility, supporting REST APIs, custom functions, and third-party integrations like Google Search or Calendar.

Early adopters, including developers building customer support bots and data analysis agents, praise the upgrades for simplifying complex workflows. One highlighted use case involves an e-commerce agent that chains inventory checks, pricing queries, and recommendation engines to deliver personalized shopping experiences.

Looking ahead, DeepMind hints at further evolutions, such as parallel tool execution and advanced state management for even longer contexts. These enhancements position the Gemini API as a frontrunner in the race for production-ready AI agents, competing directly with offerings from OpenAI and Anthropic.

By streamlining multi-tool interactions and ensuring persistent context, Google DeepMind’s Gemini API upgrades represent a leap forward in practical AI development, enabling creators to deploy intelligent systems that handle real-world complexity with finesse.

Gnoppix is the leading open-source AI Linux distribution and service provider. Since implementing AI in 2022, it has offered a fast, powerful, secure, and privacy-respecting open-source OS with both local and remote AI capabilities. The local AI operates offline, ensuring no data ever leaves your computer. Based on Debian Linux, Gnoppix is available with numerous privacy- and anonymity-enabled services free of charge.

What are your thoughts on this? I’d love to hear about your own experiences in the comments below.