The cost of the second pilot
Most AI initiatives die in the gap between the first demo and the second customer.
- AI adoption
- Pilots
- Go-to-market
- Operating
Every CTO we have spoken to in the last twelve months has done at least one AI pilot. About half of them have shipped something to a design partner. Almost none of them have a second customer on the same product.
This is the gap. It is not a technical gap and it is not a marketing gap. It is the gap between one thing working with heavy support and the same thing working without it. It is where most AI initiatives quietly die.
The first pilot is cheap on purpose
Every reasonable team starts the same way. Pick a friendly customer. Hand-tune the prompts for their data. Manually fix the edge cases as they appear. Ship something that demos well, in their environment, with their data, with the founding engineer on the Slack channel.
That pilot is cheap by design. It externalizes the unsolved engineering problems onto goodwill — the customer tolerates the bugs because they got the system early; the engineer tolerates the manual ops because they are validating the product.
The first pilot is supposed to be cheap. It is not supposed to be the product.
The second customer is where the bill comes due
The second customer has different data, different edge cases, different expectations, and crucially, no installed goodwill. The shortcuts that made the first pilot work do not work twice. The engineering team discovers, in order:
- The eval set is the first customer's eval set. It was never general.
- The prompts are tuned to one schema, one domain vocabulary, one regulatory regime.
- The retrieval was working because the first customer's corpus was small enough to fit in context.
- The judge model agrees with the first customer because the rubric was written for the first customer's preferences.
- The cost economics that worked in pilot do not work at the second customer's volume.
- The error modes that the first customer absorbed silently are immediately visible to the second.
This is the bill. And it is much larger than the original pilot, because turning the pilot into a product is the actual product work. The pilot was the trailer.
How to budget for it
The teams that get through the gap budget for it explicitly. The shape that works:
Phase one — Design partner. 4-6 weeks. One customer, one workflow, maximum manual support. Goal: prove the AI capability is real, not the product.
Phase two — Productization. 6-10 weeks. Goal: take everything that was manual in phase one and make it automatic, eval'd, monitored, and cheap. No new features. New ones in this phase are the kiss of death.
Phase three — Second customer. 4-6 weeks. Goal: ship the productized system to a customer whose data and edge cases were not used to build it. Find where it breaks. Fix the systemic causes, not the symptoms.
Teams that skip phase two — and most do — ship phase one to phase three and learn the bill exists the expensive way.
What this means for vendor selection
If you are buying AI software in 2026, the most useful filter is: does this vendor have a second customer on this exact product? Not on related products. Not on customizations. Not on a marketing case study that names a logo without saying what was shipped. The same product.
A surprising number of vendors fail this filter. They have customer A on a heavily-customized v1 and customer B on a different heavily-customized v1, and what they are selling you is a third v1. That is not a product. That is consulting.
The vendors who pass this filter are the ones who paid the second-pilot bill. They are easier to spot than they sound — the case studies are specific, the integrations are versioned, the eval scorecards are real, and the support process does not require a founding engineer.
And for builders
The takeaway is uncomfortable: the second customer costs more than the first, not less. The economics of AI products are inverted from the economics of traditional SaaS, where each marginal customer is cheaper than the last. Until your system genuinely generalizes — which usually happens around customer four or five — every new customer is teaching you something expensive about what you did not actually generalize.
Budget for it. Tell your board about it. Do not promise the second pilot will be like the first. It is the part where the work happens.
Stuck between the first design partner and the second customer? Book a call — this is exactly the gap we are usually hired to help close.
More insights.
Work with usThe AI product engineer is the new full-stack engineer
Senior product engineering used to mean comfort across the front-end, the back-end, and the database. The bar has moved. The teams shipping consequential software in 2026 demand a fourth competency — turning probabilistic models into reliable product behavior — and the engineers who have it are the highest-leverage hires on the org chart.
industryWhy "build vs. buy" just flipped
For two decades, "buy the SaaS" was the right answer for almost any internal-tool question. The economics that made that true are no longer true. We walk through what changed, what it means for vendor selection, and which categories are most exposed.