Field notes on code, debt and judgment in the LLM era.
Essays from the team behind CodeNSM — first a long, honest walk through the problems of the AI coding era, then the methodology we built in response: what production telemetry reveals about codebases, why the old debt instruments stopped working, and how we run pre-registered science on the results.
The Problem — a 30-part walk through the AI coding era
Written to be read in order. Start at Part 1 →
Methodology & Product — how CodeNSM answers it
Pre-registered software telemetry science: why we wrote 51 falsifiable hypotheses before looking at the data
Most vendor 'research' is marketing that went looking for a p-value. We did the opposite: registered 51 falsifiable hypotheses about production codebases in code, before the data arrived — and built the anonymization so the database itself can be shared for verification.
Engineering philosophyRule engines where we know the physics, LLMs only for the last mile
The fashionable way to build CodeNSM would be to pour telemetry into a large model and ask it what it thinks. We refused. Here is why every archetype, state and score in CodeNSM is deterministic — and where the LLM genuinely belongs.
MethodologyMeet your code employees: the CodeNSM archetype taxonomy
Every function in a production codebase is doing a job — routing, record-keeping, diplomacy, security, cleanup. Name the job and suddenly 'is this function healthy?' becomes answerable, because you can compare it to its peers instead of to the whole company.
DiligenceReading a codebase like a balance sheet
Acquirers diligence revenue to two decimal places and diligence the codebase with a two-hour architecture call. Here is the split that actually matters — unique engineering IP versus vanilla glue — and how runtime evidence turns 'trust me' into a schedule of assets.
Agencies & studiosProof over promises: attaching living quality evidence to agency deliverables
Every agency says 'we write clean code.' None of them can prove it a month after handover, when the invoice is paid and the codebase is a stranger's problem. Living telemetry turns the handover from a zip file into a warranty.
Engineering leadershipWalking the floor at scale: what tech leads actually monitor, and why your dashboards miss it
Great tech leads do a daily gemba walk through the codebase — noticing fragile corners, drifting boundaries, load-bearing hacks. Commit counts and velocity charts capture none of it. Here is what the walk actually consists of, and what it takes to keep walking at 40 engineers.
Founder lensThe non-technical founder's blind spot
You can read your P&L, your funnel, your churn cohort — and you cannot read the one asset everything else depends on. What a great tech lead sees daily, why you structurally can't, and how much of that judgment layer is now automatable.
LLM-era debtThe tech-debt crisis nobody codes anymore
Cunningham's debt metaphor assumed the borrower knew they were borrowing. In 2026, most new code is drafted by models, reviewed by people who never wrote the line, and shipped at a volume no review process was designed for. The metaphor needs an upgrade — and so does the measurement.