The problem
What was broken before AI
As AI coding tools spread through an engineering organization, the hard problem shifts from “can the model write code?” to “can the team trust, measure, and improve the way agents work?” Pull requests may have thin descriptions. Test failures may repeat without anyone capturing the lesson. Teams may not know where AI agents are helping or getting stuck. Products may also be hard for agents to install if the integration flow assumes a human is clicking through every step.
What changed
What the use case made possible
Brian’s team started treating AI coding as an engineering system with feedback loops. A PR-description hook uses Claude Code to assess whether a pull request description is good enough for review. Telemetry pipelines help the team understand AI tool usage and behavior. Flaky-test agents capture failures, fix issues where possible, and remember what happened. Intercom also redesigned install flows so agents can add the product to a codebase more easily.
Why this matters
Why this use case is worth studying
Brian’s workflow is valuable because it moves past the novelty of AI-generated code. The bigger opportunity is building the infrastructure that makes agent work visible, reviewable, and improvable. Intercom is not just asking developers to use AI more. It is shaping the surrounding process so the team can see what agents are doing, improve weak spots, and make their own product easier for agents to adopt.
Use this when
When this pattern applies
Use this pattern when AI coding has moved beyond individual experimentation and the team now needs trust, measurement, and repeatability. It works best when agent output affects shared engineering workflows like pull requests, tests, docs, setup, or product integrations.

