Product Thinking in the AI Era | Oladayo Kelani

Traditional product management assumes deterministic software. You define a feature, engineers build it, QA tests it, and it ships. The output is predictable given the input. AI-native products break this assumption at every layer, and the frameworks built for deterministic software were not designed to handle it.

The user story format collapses first. "As a user, I want the AI to summarize my documents so that I save time reading" is technically valid but operationally useless. It tells you nothing about what a good summary looks like, how to measure it, or when the feature is done. AI features need outcome specifications, not capability descriptions. Replace the user story with an evaluation rubric: what does excellent look like, what does acceptable look like, what is a failure, and how do you measure each.

MVP thinking breaks next. A minimum viable AI product is a contradiction. An AI product that performs poorly in early users' hands does not generate the "validate the assumption, iterate" feedback loop that traditional MVPs depend on. It generates distrust that is extremely difficult to recover from. AI products need a minimum viable quality threshold before any user sees them. Build the evaluation framework before you build the product.

The backlog structure needs to change. AI product backlogs have three distinct categories that should never be mixed: capability work (what the model can do), quality work (how reliably it does it), and trust work (how users understand and recover from its failures). Most teams conflate these, which is why AI roadmaps consistently over-promise and under-deliver on quality. Keep these backlogs separate and staff them separately.

Metrics require a new vocabulary. Traditional product metrics (activation, retention, conversion) are necessary but not sufficient for AI products. Add model-level metrics: task completion rate, intervention rate (how often users correct the AI), confidence calibration (does the model's expressed confidence match its actual accuracy), and drift detection (are outputs degrading over time as the world changes). These are not engineering metrics. They are product health indicators that belong on every product dashboard.

The AI product manager's core skill is evaluation design. Not prompt engineering, not model selection, not pipeline architecture. Evaluation design. If you can precisely specify what a good output looks like across the full distribution of inputs your product will receive, you can hire engineers to build the system that achieves it. If you cannot, you are guessing.

The mental model shift required is from building features to building systems. Traditional product work is additive: add this feature, then that one. AI product work is systemic: change this component and unpredictable things happen elsewhere. Product managers who master AI will be the ones who develop strong systems intuition: the ability to reason about feedback loops, emergent behaviours, and second-order effects before they appear in production.