inspect_ai.agent
The inspect_ai.agent module is available only in the development version of Inspect. To install the development version from GitHub:
pip install git+https://github.com/UKGovernmentBEIS/inspect_ai
Agents
react
Extensible ReAct agent based on the paper ReAct: Synergizing Reasoning and Acting in Language Models.
Provide a name
and description
for the agent if you plan on using it in a multi-agent system (this is so other agents can clearly identify its name and purpose). These fields are not required when using react() as a top-level solver.
The agent runs a tool use loop until the model submits an answer using the submit()
tool. Use instructions
to tailor the agent’s system message (the default instructions
provides a basic ReAct prompt).
Use the attempts
option to enable additional submissions if the initial submission(s) are incorrect (by default, no additional attempts are permitted).
By default, the model will be urged to continue if it fails to call a tool. Customise this behavior using the on_continue
option.
@agent
def react(
*,
str | None = None,
name: str | None = None,
description: str | AgentPrompt | None = AgentPrompt(),
prompt: list[Tool] | None = None,
tools: str | Model | Agent | None = None,
model: int | AgentAttempts = 1,
attempts: = AgentSubmit(),
submit: AgentSubmit str | AgentContinue | None = None,
on_continue: -> Agent )
name
str | None-
Agent name (required when using with handoff() or as_tool())
description
str | None-
Agent description (required when using with handoff() or as_tool())
prompt
str | AgentPrompt | None-
Prompt for agent. Includes agent-specific contextual
instructions
as well as an optionalassistant_prompt
andhandoff_prompt
(for agents that use handoffs). both are provided by default but can be removed or customized). Passstr
to specify the instructions and use the defaults for handoff and prompt messages. tools
list[Tool] | None-
Tools available for the agent.
model
str | Model | Agent | None-
Model to use for agent (defaults to currently evaluated model).
attempts
int | AgentAttempts-
Configure agent to make multiple attempts.
submit
AgentSubmit-
Configure submit tool used by agent.
on_continue
str | AgentContinue | None-
Message to play back to the model to urge it to continue. Optionally, can also be an async function to call to determine whether the loop should continue (executed on every turn) and what message to play back.
bridge
Bridge an external agent into an Inspect Agent.
See documentation at https://inspect.aisi.org.uk/agent-bridge.html
@agent
def bridge(agent: Callable[[dict[str, Any]], Awaitable[dict[str, Any]]]) -> Agent
agent
Callable[[dict[str, Any]], Awaitable[dict[str, Any]]]-
Callable which takes a sample
dict
and returns a resultdict
.
human
Human agent for agentic tasks that run in a Linux environment.
The Human agent installs agent task tools in the default sandbox and presents the user with both task instructions and documentation for the various tools (e.g. task submit
, task start
, task stop
task instructions
, etc.). A human agent panel is displayed with instructions for logging in to the sandbox.
If the user is running in VS Code with the Inspect extension, they will also be presented with links to login to the sandbox using a VS Code Window or Terminal.
@agent
def human(
bool | str = True,
answer: bool = False,
intermediate_scoring: bool = True,
record_session: -> Agent )
answer
bool | str-
Is an explicit answer required for this task or is it scored based on files in the container? Pass a
str
with a regex to validate that the answer matches the expected format. intermediate_scoring
bool-
Allow the human agent to check their score while working.
record_session
bool-
Record all user commands and outputs in the sandbox bash session.
Execution
handoff
Create a tool that enables models to handoff to agents.
def handoff(
agent: Agent,str | None = None,
description: | None = None,
input_filter: MessageFilter | None = None,
output_filter: MessageFilter str | None = None,
tool_name: **agent_kwargs: Any,
-> Tool )
agent
Agent-
Agent to hand off to.
description
str | None-
Handoff tool description (defaults to agent description)
input_filter
MessageFilter | None-
Filter to modify the message history before calling the tool. Use the built-in
remove_tools
filter to remove all tool calls or alternatively specify a custom MessageFilter function. output_filter
MessageFilter | None-
Filter to modify the message history after calling the tool. Use the built-in
last_message
filter to return only the last message or alternatively specify a custom MessageFilter function. tool_name
str | None-
Alternate tool name (defaults to
transfer_to_{agent_name}
) **agent_kwargs
Any-
Arguments to curry to Agent function (arguments provided here will not be presented to the model as part of the tool interface).
run
Run an agent.
The input messages(s) will be copied prior to running so are not modified in place.
async def run(
input: str | list[ChatMessage] | AgentState, **agent_kwargs: Any
agent: Agent, -> AgentState )
agent
Agent-
Agent to run.
input
str | list[ChatMessage] | AgentState-
Agent input (string, list of messages, or an AgentState).
**agent_kwargs
Any-
Additional arguments to pass to agent.
as_tool
Convert an agent to a tool.
By default the model will see all of the agent’s arguments as tool arguments (save for state
which is converted to an input
arguments of type str
). Provide optional agent_kwargs
to mask out agent parameters with default values (these parameters will not be presented to the model as part of the tool interface)
@tool
def as_tool(agent: Agent, description: str | None = None, **agent_kwargs: Any) -> Tool
agent
Agent-
Agent to convert.
description
str | None-
Tool description (defaults to agent description)
**agent_kwargs
Any-
Arguments to curry to Agent function (arguments provided here will not be presented to the model as part of the tool interface).
as_solver
Convert an agent to a solver.
Note that agents used as solvers will only receive their first parameter (state
). Any other parameters must provide appropriate defaults or be explicitly specified in agent_kwargs
def as_solver(agent: Agent, **agent_kwargs: Any) -> Solver
agent
Agent-
Agent to convert.
**agent_kwargs
Any-
Arguments to curry to Agent function (required if the agent has parameters without default values).
Filters
remove_tools
Remove tool calls from messages.
Removes all instances of ChatMessageTool as well as the tool_calls
field from ChatMessageAssistant.
async def remove_tools(messages: list[ChatMessage]) -> list[ChatMessage]
messages
list[ChatMessage]-
Messages to remove tool calls from.
last_message
Remove all but the last message.
async def last_message(messages: list[ChatMessage]) -> list[ChatMessage]
messages
list[ChatMessage]-
Target messages.
MessageFilter
Filter messages sent to or received from agent handoffs.
= Callable[[list[ChatMessage]], Awaitable[list[ChatMessage]]] MessageFilter
Protocol
Agent
Agents perform tasks and participate in conversations.
Agents are similar to tools however they are participants in conversation history and can optionally append messages and model output to the current conversation state.
You can give the model a tool that enables handoff to your agent using the handoff() function.
You can create a simple tool (that receives a string as input) from an agent using as_tool().
class Agent(Protocol):
async def __call__(
self,
state: AgentState,*args: Any,
**kwargs: Any,
-> AgentState )
state
AgentState-
Agent state (conversation history and last model output)
*args
Any-
Arguments for the agent.
**kwargs
Any-
Keyword arguments for the agent.
AgentState
Agent state.
class AgentState
Attributes
messages
list[ChatMessage]-
Conversation history.
output
ModelOutput-
Model output.
agent
Decorator for registering agents.
def agent(
| None = None,
func: Callable[P, Agent] *,
str | None = None,
name: str | None = None,
description: -> Callable[P, Agent] | Callable[[Callable[P, Agent]], Callable[P, Agent]] )
func
Callable[P, Agent] | None-
Agent function
name
str | None-
Optional name for agent. If the decorator has no name argument then the name of the agent creation function will be used as the name of the agent.
description
str | None-
Description for the agent when used as an ordinary tool or handoff tool.
agent_with
Agent with modifications to name and/or description
This function modifies the passed agent in place and returns it. If you want to create multiple variations of a single agent using agent_with() you should create the underlying agent multiple times.
def agent_with(
agent: Agent,*,
str | None = None,
name: str | None = None,
description: -> Agent )
agent
Agent-
Agent instance to modify.
name
str | None-
Agent name (optional).
description
str | None-
Agent description (optional).
Types
AgentPrompt
Prompt for agent.
class AgentPrompt(NamedTuple)
Attributes
instructions
str | None-
Agent-specific contextual instructions.
handoff_prompt
str | None-
Prompt used when there are additional handoff agents active.
assistant_prompt
str | None-
Prompt for assistant (covers tool use, submit tool, CoT, etc.).
AgentAttempts
Configure a react agent to make multiple attempts.
Submissions are evaluated using the task’s main scorer, with value of 1.0 indicating a correct answer. Scorer values are converted to float (e.g. “C” becomes 1.0) using the standard value_to_float() function. Provide an alternate conversion scheme as required via score_value
.
class AgentAttempts(NamedTuple)
Attributes
attempts
int-
Maximum number of attempts.
incorrect_message
str | Callable[[AgentState, list[Score]], Awaitable[str]]-
User message reply for an incorrect submission from the model. Alternatively, an async function which returns a message.
score_value
ValueToFloat-
Function used to extract float from scores (defaults to standard value_to_float())
AgentContinue
Function called to determine whether the agent should continue.
Returns True
to continue (with no additional messages inserted), return False
to stop. Returns str
to continue with an additional custom user message inserted.
= Callable[[AgentState], Awaitable[bool | str]] AgentContinue: TypeAlias
AgentSubmit
Configure the submit tool of a react agent.
class AgentSubmit(NamedTuple)
Attributes
name
str-
Name for submit tool.
description
str-
Description of submit tool.