Back to blogs

Fuse — Multi-Agent Layer (Beta)

Fuse — Multi-Agent Layer (Beta)

May 6, 2025

Green Fern
Green Fern

We’re rolling out Fuse, our new architecture for orchestrating multiple specialized agents in a single workflow.
With Fuse, you can now:

  • Chain agents with task-specific prompts

  • Enable parallel agent execution with shared memory

  • Call external tools from within agent trees

The beta includes support for function calling and shared context across agents.

Fuse is available for Pro and Enterprise plans.

📦 New: GPU Auto-Scaling (for Inference APIs)

Our API now supports automatic GPU scaling based on real-time traffic.
This helps reduce cold starts and ensures low-latency inference even during usage spikes.

  • Support added for NVIDIA A100, H100

  • Billing adjusts dynamically based on load

  • Requires no configuration — just deploy your model

🧠 Improved: Model Updates

  • Upgraded our default CodeGen-7B endpoint to v2.1 — better accuracy, fewer hallucinations

  • DocQA model now supports 150k token contexts

  • Improved multi-language support in Chat endpoint (added Korean, Dutch, Polish)

🔐 API Changes

  • New /v1/agents/run endpoint for orchestrated multi-agent flows

  • Deprecated /v1/tasks/create — use /v1/agents/launch instead

  • API keys can now be scoped per model, feature, or environment (dev/staging/prod)

🧪 Labs

  • Internal tests running for speech-to-code pipeline (using Whisper + CodeT5)

  • Early access to fine-tuned vision transformer (ViT-x3) for document parsing

  • Testing memory-aware agents with local context retention beyond sessions

🛠 Fixes

  • Fixed a memory leak in real-time embeddings endpoint

  • Resolved an auth issue causing 401 errors on PUT /models/train

  • Improved latency for European region (Frankfurt): -35ms avg per call

Trusted by teams at

What people say

  • Jessica Wong

    UX Desiger

    This tool does exactly what I need — without the bloat.

    Mark Johnson

    Product Manager

    I signed up out of curiosity, and now I can't imagine my workflow without it. The UX is smooth, the features are powerful yet easy to use, and support replies within minutes. It genuinely feels like this was built with care.

    Alexandra Turner

    Data Science Engineer

    Boosted our workflow overnight ⚡

  • David Smith

    AI Research Scientist

    It just works. No headaches.

    Sarah Adams

    Blockchain Developer

    From onboarding to daily use, everything feels polished and purposeful. It’s clear the team behind this knows what users actually need.

    Michael Adams

    DevOps Engineer

    Saved my sanity 😅 Finally a tool that does what it promises.

What people say

  • Jessica Wong

    UX Desiger

    This tool does exactly what I need — without the bloat.

    Mark Johnson

    Product Manager

    I signed up out of curiosity, and now I can't imagine my workflow without it. The UX is smooth, the features are powerful yet easy to use, and support replies within minutes. It genuinely feels like this was built with care.

    Alexandra Turner

    Data Science Engineer

    Boosted our workflow overnight ⚡

  • David Smith

    AI Research Scientist

    It just works. No headaches.

    Sarah Adams

    Blockchain Developer

    From onboarding to daily use, everything feels polished and purposeful. It’s clear the team behind this knows what users actually need.

    Michael Adams

    DevOps Engineer

    Saved my sanity 😅 Finally a tool that does what it promises.

What people say

  • Jessica Wong

    UX Desiger

    This tool does exactly what I need — without the bloat.

    Mark Johnson

    Product Manager

    I signed up out of curiosity, and now I can't imagine my workflow without it. The UX is smooth, the features are powerful yet easy to use, and support replies within minutes. It genuinely feels like this was built with care.

    Alexandra Turner

    Data Science Engineer

    Boosted our workflow overnight ⚡

  • David Smith

    AI Research Scientist

    It just works. No headaches.

    Sarah Adams

    Blockchain Developer

    From onboarding to daily use, everything feels polished and purposeful. It’s clear the team behind this knows what users actually need.

    Michael Adams

    DevOps Engineer

    Saved my sanity 😅 Finally a tool that does what it promises.

5ms

Ultra-Low Latency

Experience blazing-fast response times with our optimized routing engine.

5ms

Ultra-Low Latency

Experience blazing-fast response times with our optimized routing engine.

5ms

Ultra-Low Latency

Experience blazing-fast response times with our optimized routing engine.

80%

Faster AI Deployment

With Gately’s unified AI Gateway and management platform.

80%

Faster AI Deployment

With Gately’s unified AI Gateway and management platform.

80%

Faster AI Deployment

With Gately’s unified AI Gateway and management platform.