AI Agent Trucking V1 — Implementation Plan

Companion to: docs/proposals/ai-agent-trucking-v1.md The proposal explains what and why. This plan tracks how and in what order.

A live execution copy of this plan also exists locally at ~/.cursor/plans/ai_agent_trucking_v1_*.plan.md for in-IDE todo tracking. The version here in the docs repo is the canonical, reviewable copy.

Field	Value
Status	Decisions locked — code not started, kickoff deferred behind `feature/tenant-roles-permissions` and Anthropic key acquisition
Owner	Scott Asher
Repos touched	`attunelogic-api`, `attunelogic-service`, `attunelogic-docs`
Branches	`feature/ai-agent-trucking-v1` (already created in api + service, per `44-release-branch-policy`)
Last updated	2026-05-16

Locked decisions live in the proposal doc's "Locked decisions" section. This plan file inherits them — if a phase task and the proposal disagree, the proposal wins.

Pre-build decisions (snapshot from 2026-05-16 review)

Mirrored from the proposal for quick reference while building. Authoritative copy lives in the proposal doc.

Build Phase 1+2 now; defer Phase 3 until Anthropic key + data-policy sign-off.
Deploy posture: AI_AGENT_GLOBAL_ENABLED=true in beta only; false in alpha/main.
Internal tenant cost handling: no auto-disable ceiling, alert only at $10/day and $50/month per tenant. External pilot ceilings remain $25/mo and $200/day.
AgentSession retention: 90-day TTL on startedAt (Mongo TTL index).
Dry-run default: per-tenant, no global default. Internal tenants start with dryRun = true, flip deliberately.
Names, paths, schemas, env vars, routes: locked as proposed — see proposal for the full list.

Overview

Add an in-app AI assistant on the service web that lets trucking customers create multi-leg Jobs from natural language. The agent runs as an orchestrator on the API using Anthropic Claude with tool-use, resolves entities through scoped read-only tools, and commits via the existing handleExtractedJobCreate pipeline so created records flow through current validation, tenancy, and audit. Created jobs are flagged as AI-originated and surfaced in the drawer with a 5-minute Undo window.

For the full architectural context, decisions, kill-switch hierarchy, no-address PII model, and pre-release safety checklist, see the proposal doc.

Build phases

We build in 3 phases. Phases 1 and 2 do not require an Anthropic API key or the data-policy review — only Phase 3 does. This means we can land all the safety infrastructure, gates, and scaffolding while the policy review is in flight.

Phase 1 — Infrastructure & kill switches (no LLM, no Anthropic key needed)

Goal: a fully gated, observable, killable system before a single token is spent.

Ship value: every kill switch in place and verifiable. The platform can guarantee "AI is off" before any AI exists.

Phase 2 — Tools & scaffolding with stubbed LLM (still no real Anthropic call)

Ship value: entire system testable, killable, observable, and reviewable without any real LLM call.

Phase 3 — Wire Anthropic (requires data-policy decision + API key)

api_deps_env — Add @anthropic-ai/sdk and ANTHROPIC_API_KEY / AI_AGENT_DEFAULT_MODEL to config/keys.js, config/index.js, and .env.example.
api_anthropic_wrapper — Create src/services/ai/anthropic.js (client singleton, default model, token usage helper). Fail-closed when ANTHROPIC_API_KEY missing → 503.
api_agent_orchestrator — Build src/services/ai/agent/{index.js,systemPrompt.js} implementing the Claude tool-use loop with iteration cap.
Cost estimation against pricing.js table; enforce per-tenant monthly $ ceiling and platform 24h ceiling; circuit breaker flips L2 on threshold breach.
Wire health endpoint to report anthropicReachable, lastSuccessfulCallAt, real error rates, MTD cost.
Internal smoke test in dry-run mode against a test tenant. Then internal employee tenant with real writes. Then 1 friendly external pilot tenant.

Pre-release & rollout

prerelease_checklist — Run the full pre-release safety checklist (kill-switch hierarchy verified end-to-end, dry-run mode tested, circuit breaker unit-tested, panic button verified in alpha, runbook published in attunelogic-docs, load test against rate limits, cost ceiling triggers tested, all logs verified PII-free, no-address tests green) before flipping any production tenant on. Full checklist lives in the proposal doc.
prerelease_runbook — Add docs/operations/ai-agent-runbook.md covering the kill-switch hierarchy, panic button procedure, circuit breaker recovery, common failure modes, on-call escalation, and how to interpret AI Activity widget signals.
rollout — Branch feature/ai-agent-trucking-v1 in both repos; enable flag per pilot tenant via Config.featureFlagOverrides; promote feature -> beta -> alpha -> main. Pilot stages: internal alpha (dry-run) → internal beta → 1 friendly external → 3-5 → GA.

Cross-references

Proposal (architecture, decisions, safety story): docs/proposals/ai-agent-trucking-v1.md
Operational runbook (to be written in Phase 1): docs/operations/ai-agent-runbook.md
Local Cursor plan (live todos, do not commit): ~/.cursor/plans/ai_agent_trucking_v1_*.plan.md
Existing extraction pipeline: attunelogic-api/docs/JOB_EXTRACTION_API.md
Branch policy: 44-release-branch-policy (in attunelogic-api repo)

Pre-build decisions (snapshot from 2026-05-16 review)​

Overview​

Build phases​

Phase 1 — Infrastructure & kill switches (no LLM, no Anthropic key needed)​

Phase 2 — Tools & scaffolding with stubbed LLM (still no real Anthropic call)​

Phase 3 — Wire Anthropic (requires data-policy decision + API key)​

Pre-release & rollout​

Cross-references​