Agents produce motion by default. You have to design for shipped.
Left alone, a team of agents will generate enormous activity: drafts, plans, reviews, re-drafts. Activity feels like progress and is not. The single most important thing we did was learn to measure shipped outcomes, not entries in a log. If a week produces a hundred updates and three of them touched the real product, the number that matters is three.