Engineering Multi-Model AI Workflows
Book 2 of the Full-Stack AI Engineering Series. The application layer. An agent is not a clever prompt. It is a governed actor with a contract, a boundary, and a record.
A single clever prompt delights on Monday and embarrasses you on Thursday. The day a stakeholder asks what your agents do, what they are allowed to touch, and what one of them did last Tuesday at three in the afternoon, the team discovers the answer was never built. It was only hoped for.
An agent that demos well and fails in production fails silently, in language fluent enough to be believed. An institution that cannot review what its agents decided, before the decision takes effect, has surrendered control of its own liability without noticing. The problem was never the prompt. The problem is orchestration.
AgentMesh is the catalog of record and the orchestration engine: it turns a pile of prompts into governed agents that plan, act, verify, and submit to human review. Each agent declares a contract, runs inside a boundary, and leaves a record. Orchestration, not cleverness, is the discipline that scales.
Every registered agent satisfies four obligations before AgentMesh admits it: a contract, a boundary, a record, and a review. An agent does not exist at Nebula until it has declared all four.
Planner, executor, verifier, generator. The planner decomposes, the executor acts under contract, the verifier judges truth, the generator gives it voice and may never add what was not verified.
Research anchorA small, inspectable, machine-readable declaration of domain, powers, data scope, and tools. Not documentation. The declared boundary the system holds the agent to.
Workflows as activity-on-vertex graphs: nodes are agents, edges are dependencies, parallelism is read off the structure rather than declared.
Research anchorThree review tiers mapped to payment size and regulatory impact. Escalate on a whisper. Downgrade on a verdict.
Research anchorReviewer feedback as first-class, typed data. An edit is data about one answer. A reason is data about a pattern.
Data-only versus action-capable separation, least privilege, PII minimization, and red-teaming at four seams, behind a pre-registration safety review.
No background in regulation, finance, or enterprise architecture is required; Nebula Financial supplies the regulated pressure. What is assumed is a mindset: comfortable calling a model from code, reading a schema, and reasoning about a contract. Governance is not the tax on building agents. It is the thing that lets you build the thousandth one.
An agent that cannot say "I do not know" will eventually say something false with total confidence.Prompt Systems & Agent Orchestration
Three books, one fictional regulated fintech, Nebula Financial, and three systems that are not three products but three faces of one platform, each owning a layer of the stack.
The book is in draft. Leave a name and a working email, and you get one note when Book 2 publishes. No list, no noise, no second message.
One note when Prompt Systems & Agent Orchestration publishes. Until then, the Enterprise Playbook is out now.
An agent is not a product. It is a workload that has not yet met its platform.