Agent Bridge

Overview

While Inspect provides facilities for native agent development, you can also very easily integrate agents created with 3rd party frameworks like OpenAI Agents SDK, Pydantic AI, and LangChain, or use fully custom agents you have developed or ported from a research paper. You can also use CLI based agents that run within sandboxes (e.g. Claude Code or Codex CLI).

Agents are bridged into Inspect such that their native model calling functions are routed through the current Inspect model provider. There are two types of agent bridges supported:

Bridging to Python-based agents that run in the same process as Inspect via the agent_bridge() context manager.
Bridging to agents that run in a sandbox via the sandbox_agent_bridge() context manager (these agents can be written in any language).

We’ll cover each of these configurations in turn below. You can also learn from the following examples:

OpenAI Agents SDK	Demonstrates using a native Open AI Agents SDK agent to perform Q/A using web search.
LangChain	Demonstrates using a native LangChain agent to perform Q/A using the Tavili Search API
Pydantic AI	Demonstrates using a native Pydantic AI agent to perform Q/A using web search.
Claude Code	Demonstrates using a Claude Code agent to explore a Kali Linux system.
Codex CLI	Demonstrates using a Codex CLI agent to explore a Kali Linux system.

Agent Bridge

The agent_bridge() can bridge agents written against the Python APIs for OpenAI Completions, OpenAI Responses, and Anthropic. To bridge a Python based agent running in the same process as Inspect:

Write your custom Python agent as normal using the OpenAI or Anthropic connector provided by your agent system, specifying “inspect” as the model name.
Run your custom Python agent within the agent_bridge() context manager which redirects OpenAI calls to the current Inspect model provider.

For example, here we build an agent that uses the OpenAI SDK directly (imaging using your favourite agent framework in its place):

from openai import AsyncOpenAI
from inspect_ai.agent import (
    Agent, AgentState, agent, agent_bridge
)
from inspect_ai.model import messages_to_openai

@agent
def my_agent() -> AgentState:
    async def execute(state: AgentState) -> AgentState:
        async with agent_bridge(state) as bridge:
            client = AsyncOpenAI()
            
            await client.chat.completions.create(
                model="inspect",
                messages=messages_to_openai(state.messages)
            )

            return bridge.state

    return execute

1: Use the agent_bridge() context manager to redirect the OpenAI API to the Inspect model provider. Pass the state so that the bridge can automatically keep track of changes to messages and output based on model calls passing through the bridge.
2: Use the OpenAI API with model="inspect", which enables Inspect to intercept the request and send it to the Inspect model being evaluated for the task.
3: Convert the state.messages input into native OpenAI messages using the messages_to_openai() function.
4: Return the state changes automatically tracked by the bridge .

The OpenAI Agents SDK and LangChain example provides a more in-depth demonstration of using the Python agent bridge with Inspect.

Sandbox Bridge

The sandbox_agent_bridge() can bridge agents written against the OpenAI Completions, OpenAI Responses, or Anthropic API. To bridge an agent running in a sandbox to Inspect:

Configure your sandbox (e.g. via its Dockerfile) to contain the agent that you want to run. The agent should be configured to talk to the OpenAI API on localhost port 3131 (e.g. OPENAI_BASE_URL=http://localhost:13131/v1 or ANTHROPIC_BASE_URL=http://localhost:13131).
Write a standard Inspect agent that uses the sandbox_agent_bridge() context manager and the sandbox().exec() method to invoke the custom agent.

The sandbox bridge works via running a proxy server inside the sandbox container which receives requests for the OpenAI and Anthropic APIs. This proxy server in turn relays requests to the current Inspect model provider.

For example, here we build an agent that runs a custom agent binary (passing it input on the command line and reading output from stdout):

from openai import AsyncOpenAI
from inspect_ai.agent import (
    Agent, AgentState, agent, sandbox_agent_bridge
)
from inspect_ai.model import user_prompt
from inspect_ai.util import sandbox

@agent
def my_agent() -> AgentState:
    async def execute(state: AgentState) -> AgentState:
        async with sandbox_agent_bridge(state) as bridge:
            
            prompt = user_prompt(state.messages)
            
            result = sandbox().exec(
                cmd=[
                    "/opt/my_agent",
                    "--prompt",
                    prompt.text
                ],
                env={"OPENAI_BASE_URL": f"http://localhost:{bridge.port}/v1"}
            )
            if not result.success:
                raise RuntimeError(f"Agent error: {result.stderr}")

            return bridge.state

    return execute

1: Use the sandbox_agent_bridge() context manager to redirect the OpenAI API to the Inspect model provider. Pass the state so that the bridge can automatically keep track of changes to messages and output based on model calls passing through the bridge.
2: Extract the last user message from the message history with user_prompt().
3: Run the agent, using a CLI argument for input and stdout for output (other agents may use more sophisticated encoding schemes for messages in and out).
4: Redirect the OpenAI API to talk to a proxy server that communicates back to the current Inspect model provider. Note that we read the port to listen on from the bridge yielded by the context manager.
5: Return the state changes automatically tracked by the bridge.

The Claude Code and Codex CLI examples provide more in-depth demonstrations of running custom agents in sandboxes.

Models

As demonstrated above, communication with Inspect models is done by using the OpenAI API with model="inspect". You can use the same technique to interface with other Inspect models. To do this, preface the model name with “inspect” followed by the rest of the fully qualified model name.

For example, in a LangChain agent, you would do this to utilise the Inspect interface to Gemini:

model = ChatOpenAI(model="inspect/google/gemini-1.5-pro")

Transcript

Custom agents run through a bridge still get most of the benefit of the Inspect transcript and log viewer. All model calls are captured and produce the same transcript output as when using conventional agents.

If you want to use additional features of Inspect transcripts (e.g. spans, markdown output, etc.) you can still import and use the transcript function as normal. For example:

from inspect_ai.log import transcript

transcript().info("custom *markdown* content")