Vercel AI SDK provider for Sanity Content Agent API.
npm install content-agent ai
ContentAgent extends the AI SDK's ToolLoopAgent — it inherits .generate(), .stream(), telemetry, and every other capability of the AI SDK's first-class agent abstraction. Construction is the only thing that differs: pass a provider and threadId instead of a model, and ContentAgent resolves the model for you.
import { createContentAgent } from "content-agent";
import { ContentAgent } from "content-agent/agent";
const provider = createContentAgent({
organizationId: "your-org-id",
token: "your-sanity-token",
});
const agent = new ContentAgent({
provider,
threadId: "my-thread",
});
const { text } = await agent.generate({
prompt: "What blog posts do I have?",
});
provider is reusable — create it once per organization and instantiate as many ContentAgents as you need.
Why a class instead of
streamText({ model, ... })?The AI SDK 6 explicitly invites SDK authors to ship their own
Agentimplementations: "TheAgentis now an interface instead of a class, which allows developers to implement their own agent abstractions for specific needs."ContentAgentis that implementation for the Sanity Content Agent — it preconfiguresstopWhen,prepareStep, andtoolChoicefor thefinal_answerreply pipeline so you don't have to.
Migrating from
streamText({ model: provider.agent(threadId) })? Nothing breaks — the lower-level API is still supported and works exactly as before. Switch toContentAgentto pick up the safe defaults that prevent empty responses when the model burns its step budget on client tool calls. See How replies work below.
const result = await agent.stream({
prompt: "Summarize my latest content",
});
for await (const text of result.textStream) {
process.stdout.write(text);
}
Pass AI SDK tools to the constructor — schemas are forwarded to the agent, and execution happens locally on the client:
import { ContentAgent } from "content-agent/agent";
import { tool } from "ai";
import { z } from "zod";
const agent = new ContentAgent({
provider,
threadId: "my-thread",
tools: {
submit_for_review: tool({
description: "Submit a document for editorial review.",
inputSchema: z.object({
documentId: z.string().describe("The Sanity document _id"),
email: z.string().email().describe("Reviewer email"),
note: z.string().optional().describe("Note for the reviewer"),
}),
execute: async ({ documentId, email, note }) => {
const res = await fetch("https://workflows.example.com/api/review", {
method: "POST",
body: JSON.stringify({ documentId, email, note }),
});
return res.json();
},
}),
},
});
const result = await agent.stream({
prompt: "Find all draft blog posts and submit them for review.",
});
Every reply is delivered through a server-side final_answer tool call. The agent streams final_answer.message to the wire as text-delta chunks (parsed incrementally from the tool-call args) and includes structured suggestions for clients that render reply buttons. There's no separate "streaming mode" — every conversation, every client follows the same path.
ContentAgent adjusts the loop control primitives to make this safe by default:
stopWhen defaults to "stop on the first step that contains a final_answer tool call," plus a stepCountIs(20) safety cap. A final_answer accompanied by other tool calls in the same step is the canonical act-and-answer pattern — the loop terminates without an extra round-trip.prepareStep is composed with a safety net that, on the cap step, forces toolChoice { type: 'tool', toolName: 'final_answer' } if the model has been calling tools without ever converging on final_answer. This guarantees a reply on the wire even in pathological cases where the model would otherwise burn its step budget on client-tool round-trips.final_answer naturally on any single turn, the agent runner retries that turn with forced tool_choice. Anthropic's API enforces forced tool choice strictly; the recovery is guaranteed to produce a final_answer chunk.If you pass your own stopWhen or prepareStep, they take precedence — ContentAgent doesn't reach into them.
Each application key uniquely identifies a deployed Sanity Studio workspace. Since multiple studios can share the same project ID and dataset, the application key disambiguates which one to target.
List available applications for your organization, then target one by its key:
const apps = await provider.applications();
const app = apps.find((app) => app.title === "My Studio");
const agent = new ContentAgent({
provider,
threadId: "my-thread",
application: { key: app.key },
});
You can also set a default application on the provider so every agent and .prompt() call inherits it:
const provider = createContentAgent({
organizationId: "your-org-id",
token: "your-token",
application: resolvedApp,
});
Pass a config object to control agent behavior:
const agent = new ContentAgent({
provider,
threadId: "my-thread",
config: {
instruction: "You are a helpful content assistant for our marketing team.",
userMessageContext: {
"slack-channel": "#marketing",
"user-role": "editor",
},
capabilities: {
read: true,
write: false,
},
filter: {
read: '_type in ["post", "author", "category"]',
write: '_type == "post"',
},
},
});
| Field | Type | Description |
|---|---|---|
instruction |
string |
Custom instruction included in the system prompt |
userMessageContext |
Record<string, string> |
Key-value pairs appended to each user message as XML tags |
capabilities |
object |
Controls what operations the agent can perform (read, write) |
filter |
object |
GROQ boolean expressions for document access control |
perspectives |
object |
Perspective locking for read and write operations |
Each capability can be true, false, or a preset object:
// Read-only
config: { capabilities: { read: true, write: false } }
// Minimal write access
config: { capabilities: { read: true, write: { preset: "minimal" } } }
For stateless single-turn requests, use the provider's .prompt() method directly — no thread, no agent class:
const { text } = await provider.prompt(
{
application: { key: "projectId.datasetName" },
config: { capabilities: { read: true, write: false } },
instructions: "Be concise",
},
{ message: "List my 5 most recent posts" },
);
createStudioAgent makes the provider work inside Sanity Studio with no token or manual configuration. It derives the API host, organization ID, and application from the Studio context:
import { useMemo } from "react";
import { createStudioAgent } from "content-agent";
import { ContentAgent } from "content-agent/agent";
import { useClient, useWorkspace } from "sanity";
function MyComponent() {
const client = useClient({ apiVersion: "2024-01-01" });
const { name: workspace } = useWorkspace();
const provider = useMemo(
() => createStudioAgent(client, workspace),
[client, workspace],
);
const agent = useMemo(
() => new ContentAgent({ provider, threadId: "my-thread" }),
[provider],
);
// Use agent.stream() / agent.generate() with useChat, etc.
}
The factory call is synchronous — async setup (org ID fetch, application resolution) runs lazily on the first API call and is cached at the module level. Wrap in useMemo so the provider and agent keep stable identities across renders.
Since there is no token in a Studio context, credentials: 'include' is set on all fetch calls automatically so the browser sends session cookies. The base URL is derived from the client's project-scoped getUrl method.
useChatFor a streaming chat UI, pair with DirectChatTransport and useChat from the AI SDK:
import { useMemo, useState } from "react";
import { useClient, useWorkspace } from "sanity";
import { useChat } from "@ai-sdk/react";
import { DirectChatTransport } from "ai";
import { createStudioAgent } from "content-agent";
import { ContentAgent } from "content-agent/agent";
function ChatTool() {
const client = useClient({ apiVersion: "2024-01-01" });
const { name: workspace } = useWorkspace();
const [threadId] = useState(() => `thread-${crypto.randomUUID()}`);
const transport = useMemo(() => {
const provider = createStudioAgent(client, workspace);
const agent = new ContentAgent({ provider, threadId });
return new DirectChatTransport({ agent });
}, [client, workspace, threadId]);
const { messages, sendMessage, status } = useChat({ transport });
// Render messages, input form, etc.
}
If you need more control than ContentAgent provides, the underlying primitives are still exported:
provider.agent(threadId, settings) — returns a raw LanguageModelV3 you can pass to streamText/generateText/ToolLoopAgent directlyfromClient(client, workspace) — derives baseURL and application from a Sanity clientprovider.resolveApplication(client, workspace) — fetches applications and matches by resource ID + workspace nameMIT