Agent Context and Caching

Shared agent context pipeline, workspace business profiles, compact schema prompts, provider-aware caching, and runtime loop controls.

Problem

The current agent stack works, but it is too prompt-string heavy and too expensive for long-lived use:

Chat injects every entity type schema into the prompt on every turn.
Entity-scoped chat trusts client-supplied entity context instead of reloading the latest server truth.
Chat, workflow, extraction, heartbeat, and inbox each assemble context differently.
Stable instructions and tool definitions are not provider-cache aware.
Long conversations have no provider-level context management strategy.
Shared context and memories can grow without meaningful prompt budgeting.
Agents are not given a structured tenant-authored business profile, so OCI-specific knowledge depends on whatever happens to exist in records or prior conversations.

This leaves quality, latency, and cost on the table.

Goals

Create a shared context pipeline that all agent entry points can reuse.
Keep stable context rich, but compact enough to avoid wasting prompt budget.
Add tenant-level business profile support so OCI and future clients can author durable business context once and inject it everywhere.
Add provider-aware caching and context management for Anthropic-backed agents.
Improve prompts and tool instructions around retrieval-first behavior, freshness, citations, and value framing.
Keep the implementation platform-safe: no product-specific slugs in platform code.

Design

Shared Context Layers

Prompts should be assembled from distinct layers with different stability and cost profiles:

Stable cached context
- Base system prompt
- Tenant-authored workspace business profile
- Compact data-type summary
- Stable shared-context guidance (corrections, lessons, routing)
Dynamic session context
- Current date/time
- Current user and role
- Active entity context
- Recent bounded memories
Execution-local context
- Workflow node instructions
- Extraction field instructions
- Heartbeat attention context

Workspace Business Profile

Add a tenant setting key, agent_context, with a structured schema:

{
  summary: string;
  industry?: string;
  businessModel?: string;
  customers?: string[];
  valueDrivers?: string[];
  coreProcesses?: string[];
  keySystems?: string[];
  successMetrics?: string[];
  terminology?: string[];
  constraints?: string[];
  currentPriorities?: string[];
  differentiators?: string[];
}

This becomes the durable place to encode what OCI actually does, how it makes money, which processes matter, which systems it relies on, and how success should be judged.

Prompt Compaction

Replace raw full-schema injection with a compact type summary:

show type name, slug, description, active fields, and relation labels
cap fields/relations per type
highlight the focused type when a chat is scoped to a record
instruct agents to use listEntityTypes({ detailed: true }) when they need the full schema

Entity context should also be compacted:

summarize field values as named bullets
cap field count and value length
avoid dumping large JSON blobs into the system prompt

Runtime Improvements

Enhance the shared runtime to support:

deterministic tool ordering
Anthropic cache control on stable system/tool definitions
Anthropic context management for long chats
pass-through toolChoice and prepareStep so callers can tighten agent loops when needed

Chat should no longer trust client-supplied entity title/type/content as system-prompt input. If an entity ID is provided, the route should reload the latest tenant-scoped entity context from the server and only inject that verified version.

Acceptance Criteria

Rollout Notes

agent_context is additive and backward-compatible. Workspaces without it continue to run with existing prompts.
Anthropic caching/context-management should only be applied when the resolved model ID is Claude-family.
Tool ordering must remain deterministic so prompt caching gets reuse from identical tool definitions.

Feature Backlog

All planned work for the Sprinter Platform, organized by priority and category.

Agent Ops and Telemetry

Admin authoring for workspace business context, runtime telemetry surfacing, and prompt regression coverage for agent quality.