Log · AI Factory

From the blog.

Engineering notes on building AI you can trust — and the factory that produces it.

Less code is not better design

Why AI agents default to the option that passes the tests — not the one that survives production. And the mechanism we use to catch it before the merge.

Read the post

AI Factory~7 min

The 4 layers of an AI factory

An agent ignores advice and obeys what is deterministic. The 4 layers — with real configs — for building with agents without breaking production. Paste it into Claude Code and it configures itself.

Trust engineering~5 min

A claim is not a chunk

Retrieving the right passage is not verifying the fact. The unit of truth is the atomic claim bound to its literal source.

Trust engineering~7 min

Belief, doubt and uncertainty

A confidence number hides what matters most: how much evidence supports it, how much contradicts it, and how much we simply do not know.

Product~4 min

The traffic light, not the percentage

A 73% confidence score tells nobody what to do. A clear verdict, with the evidence one click away, does.

Trust engineering~6 min

Not every source carries the same weight

An ERP export, a Slack message and an LLM answer do not deserve the same initial trust. How we assign priors per source.

AI Factory~5 min

The factory verifies itself

We build a truth engine with AI agents. That forced us to apply, internally, the same discipline we demand from AI output.

Product~4 min

When two sources contradict each other

Jira says the 10th; Slack says the 12th. Most systems pick one silently. We flag it — before you decide on the wrong data.

Get every note in your inbox.

Engineering notes on trustworthy AI. No noise.