Diego Baldi — AI orchestration, governance, agent reliability

Diego Baldi

CTO focused on AI orchestration, governance, and agent reliability.

Notes on building AI systems that survive production — agent reliability, orchestration, and the kind of boring governance that keeps things working when the model changes underneath them.

Recent writing All posts →

May 25 · 2026
Hello, world
A short note on what this site is, who it's for, and the thesis the rest of the writing will defend.

meta essay

Projects All projects →

AI Control Plane In development

A governance & observability runtime for enterprise agents — policy enforcement, traceable tool calls, and human-in-the-loop checkpoints.

Python · FastAPI · LangGraph · Postgres

Eval harness for retrieval-grounded agents Planned

Open benchmark scoring agent answers against a frozen corpus, with adversarial paraphrase and recency probes.

Python · DuckDB

Trace replay CLI Planned

Re-runs a stored agent session against a different model and diffs the divergence. Debugging aid, eval pipeline component.

Python · TypeScript

/now

Currently scaffolding the AI Control Plane and reading deeply on agent reliability engineering. See what I'm focused on this month →