Agent

Package: flyte.ai.agents.agent

A flyte-native tool-use agent harness.

Parameters

name: Stable agent identifier (used for logs and event payloads). instructions: Base system prompt. Skills and a tool catalog summary are appended automatically. model: Model identifier passed to call_llm. Defaults to "claude-haiku-4-5". tools: Sequence (or {name: tool} mapping) of tools the agent may call. Each entry can be a plain callable, a @flyte.trace helper, an @env.task :class:~flyte.TaskTemplate, a :class:~flyte.remote._task.LazyEntity, or a pre-built :class:AgentTool. mcp_servers: Optional remote / stdio MCP servers whose tools should be loaded into the agent’s tool registry on first use. See :class:MCPServerSpec. skills: Extra context appended to the system prompt. Each entry is either a string or a :class:pathlib.Path pointing to a local text file. max_turns: Maximum number of LLM ↔ tool turns before the loop gives up. call_llm: Optional async callback (model, system, messages, tools) -> LLMMessage. Defaults to :func:_default_call_llm (uses litellm). approval_callback: Optional async callback (tool, args) -> bool invoked when a tool with requires_approval=True is about to run. Defaults to a HITL prompt via flyteplugins-hitl. parallel_tool_calls: When True (default) tool calls returned in a single assistant message are executed concurrently. Set to False to force strict sequential execution (useful when tool side-effects must be ordered). Ignored in code mode. code_mode: When True the agent runs in code mode: instead of emitting JSON tool calls, the LLM writes a small Python program each turn that is executed in the Monty sandbox (flyte.sandbox.orchestrate_local) with the tools exposed as plain functions. The value of the program’s last expression becomes the observation for the next turn; the loop ends when the LLM replies with plain text (no code block). This unlocks generated control flow (loops, flyte_map fan-out, intermediate aggregation) while still dispatching @env.task tools durably on-cluster. Tools with a call_handler run through that handler in code mode as well. Requires pydantic-monty in the runtime image. Note: per-tool HITL approval is not enforced in code mode, since tools are invoked from inside the sandbox rather than as discrete approved calls.

Parameters

        
    
class Agent(
    name: str,
    instructions: str,
    model: str,
    tools: Sequence[Any] | Mapping[str, Any],
    mcp_servers: Sequence[MCPServerSpec],
    skills: Sequence[str | pathlib.Path],
    max_turns: int,
    call_llm: LLMCallable,
    approval_callback: ApprovalCallback,
    parallel_tool_calls: bool,
    code_mode: bool,
)

Parameter	Type	Description
`name`	`str`
`instructions`	`str`
`model`	`str`
`tools`	`Sequence[Any] \| Mapping[str, Any]`
`mcp_servers`	`Sequence[MCPServerSpec]`
`skills`	`Sequence[str \| pathlib.Path]`
`max_turns`	`int`
`call_llm`	`LLMCallable`
`approval_callback`	`ApprovalCallback`
`parallel_tool_calls`	`bool`
`code_mode`	`bool`

Properties

Property	Type	Description
`system_prompt`	`str`	The fully-rendered system prompt, including skills + tool catalog.

Methods

Method	Description
`add_tool()`	Register an additional tool after construction.
`approval_callback()`	Default HITL approval: ask the user via `flyteplugins-hitl`.
`call_llm()`	Default LLM callback that uses ``litellm.
`run()`	Drive the LLM ↔ tool loop until the assistant returns a final reply.
`tool_descriptions()`	Conform to :class:`~flyte.

add_tool()

        
    
def add_tool(
    obj: Any,
    name: str | None,
) -> AgentTool

Useful when tools need access to runtime objects (e.g. an HTTP client created inside a task).

Parameter	Type	Description
`obj`	`Any`
`name`	`str \| None`

approval_callback()

        
    
def approval_callback(
    tool: AgentTool,
    args: *args,
) -> bool

Default HITL approval: ask the user via flyteplugins-hitl.

Returns True if the user approves the tool call. When the plugin is not installed (or the agent is running outside a Flyte task context), this falls back to denying the call so that the agent can recover by trying a different approach.

Parameter	Type	Description
`tool`	`AgentTool`
`args`	`*args`

call_llm()

        
    
def call_llm(
    model: str,
    system: str,
    messages: list[dict[str, Any]],
    tools: list[dict[str, Any]] | None,
) -> LLMMessage

Default LLM callback that uses litellm.acompletion with tool calling.

Compatible with any provider that litellm supports (OpenAI, Anthropic, Gemini, Bedrock, local OpenAI-compatible servers, …).

Parameter	Type	Description
`model`	`str`
`system`	`str`
`messages`	`list[dict[str, Any]]`
`tools`	`list[dict[str, Any]] \| None`

run()

This method can be called both synchronously or asynchronously.

Default invocation is sync and will block. To call it asynchronously, use the function .aio() on the method name itself, e.g.,: result = await <Agent instance>.run.aio().

        
    
def run(
    message: str,
    memory: list[dict[str, Any]] | MemoryStore | None,
) -> AgentResult

Drive the LLM ↔ tool loop until the assistant returns a final reply.

Implements the :class:~flyte.ai.agents.protocol.AgentProtocol so instances can be plugged directly into :class:~flyte.ai.chat.AgentChatAppEnvironment.

The agent is decoupled from any persistent state: memory is passed in per call rather than attached to the agent. memory may be:

None: a stateless, single-shot conversation.
a list[dict]: prior messages to prepend (e.g. a chat history). The returned :class:AgentResult carries no memory in this case.
a :class:MemoryStore: its transcript is prepended, the in-flight transcript is appended back to it, and it is returned on :attr:AgentResult.memory. Persistence is the caller’s responsibility: call memory.save() (or .save.aio()) after run to write the updated transcript back to its keyed remote path.

Call synchronously via run(...); in async contexts use run.aio(...).

Parameter	Type	Description
`message`	`str`
`memory`	`list[dict[str, Any]] \| MemoryStore \| None`

tool_descriptions()

        
    
def tool_descriptions()

Conform to :class:~flyte.ai.agents.protocol.Agent.

MCP tools loaded lazily are only listed after the first :meth:run call.