All Posts

How LLMs Generate Tokens in Production

May 26, 2026

A walkthrough of the path from prompt text to generated tokens, and why production LLM serving is really about schedu...
Omotenashi, A Week of Noticing in Tokyo

May 04, 2026

I spent a week in Tokyo for the ClickHouse offsite and I couldn’t stop noticing the small design choices that made li...
LLM Benchmarks Are Flatlined. Task Horizons Are Not.

March 02, 2026

The headline accuracy numbers on standard benchmarks have stagnated. MMLU, TruthfulQA, HellaSwag: the top models have...
Ditch grep and Speed up Claude Code with LSPs

March 01, 2026

Grep is a text search. Code is not text. It’s a graph of symbols, types, and call chains. That gap is where Claude wa...
AI B*llsh*tting

November 04, 2025

Spotting AI Lies: How to Know When Your LLM is BS-ing
Edit Survival

October 26, 2025

Edit Survival - Quality metrics for AI coding agents
How To Write A Coding Agent In 169 Lines Of Python

October 15, 2025

Writing a minimal coding agent from scratch with no hidden magic. Just prompts, tool calls and a loop.