SearchSwarm addresses a fundamental challenge in artificial intelligence: how to enable large language models (LLMs) to perform complex, long-horizon research tasks without exceeding their finite memory, or "context window." When an AI attempts to solve a massive, multi-step problem, it often runs out of space to track its own reasoning. This paper introduces "delegation intelligence," a method where a main agent acts as a project manager, breaking down large tasks into smaller, manageable pieces and assigning them to subagents. By receiving only summarized reports back from these subagents, the main agent can maintain focus on the overall goal without getting overwhelmed by excessive data.
The Power of Delegation
The core of the SearchSwarm approach is a specialized "harness"—a set of instructions and tools that guides the model on how to manage its workload. Instead of trying to do everything at once, the main agent uses a tool called call_sub_agent to dispatch specific research tasks. The harness ensures the main agent provides a "comprehensive brief" to the subagent, explaining not just what to do, but why it matters and what has already been discovered. This prevents the subagent from wasting time on redundant searches. Once the subagent finishes, it returns a condensed report with verified citations, allowing the main agent to synthesize the final answer while keeping its own memory clear for high-level decision-making.
Training for Intelligence
The researchers found that simply giving a model a delegation tool is not enough; the model must be trained to understand when and how to delegate. To achieve this, they created a dataset of high-quality "delegation trajectories." By using their harness to guide models through research tasks, they recorded successful examples of task decomposition and result integration. They then used this data to fine-tune the model, effectively teaching it to internalize the logic of a project manager. This training process ensures the model learns to prioritize its own limited "attention" for complex reasoning while offloading repetitive data gathering to subagents.
Performance and Impact
The resulting model, SearchSwarm-30B-A3B, demonstrates that a smaller, more efficient model can compete with much larger systems when equipped with the right delegation strategy. It achieved top-tier results on several research-heavy benchmarks, such as BrowseComp and GAIA, outperforming other models of a similar size. The researchers noted that without this specific training, models often fail to use delegation tools effectively, even when they are available. By open-sourcing their harness, training data, and model weights, the team aims to provide a foundation for future research into how AI agents can better coordinate to solve increasingly complex, real-world problems.
Comments (0)
to join the discussion
No comments yet
Be the first to share your thoughts!