Log · AI Factory

From the blog.

Engineering notes on building AI you can trust — and the factory that produces it.

AI Factory~8 min

Less code is not better design

Why AI agents default to the option that passes the tests — not the one that survives production. And the mechanism we use to catch it before the merge.

Read the post
AI Factory~7 min
The 4 layers of an AI factory

An agent ignores advice and obeys what is deterministic. The 4 layers — with real configs — for building with agents without breaking production. Paste it into Claude Code and it configures itself.

Trust engineering~5 min
A claim is not a chunk

Retrieving the right passage is not verifying the fact. The unit of truth is the atomic claim bound to its literal source.

Trust engineering~7 min
Belief, doubt and uncertainty

A confidence number hides what matters most: how much evidence supports it, how much contradicts it, and how much we simply do not know.

Product~4 min
The traffic light, not the percentage

A 73% confidence score tells nobody what to do. A clear verdict, with the evidence one click away, does.

Trust engineering~6 min
Not every source carries the same weight

An ERP export, a Slack message and an LLM answer do not deserve the same initial trust. How we assign priors per source.

AI Factory~5 min
The factory verifies itself

We build a truth engine with AI agents. That forced us to apply, internally, the same discipline we demand from AI output.

Product~4 min
When two sources contradict each other

Jira says the 10th; Slack says the 12th. Most systems pick one silently. We flag it — before you decide on the wrong data.

Get every note in your inbox.

Engineering notes on trustworthy AI. No noise.

We use analytics (including heatmaps) to improve the site. You decide.