A conversation with OpenAI’s CPO Kevin Weil, Anthropic’s CPO Mike Krieger, and Sarah Guo

Click to expand the mind map for a detailed view.

Key Takeaways

AI Product Development is Unpredictable: AI capabilities evolve rapidly, making product planning stochastic and emergent.
Customer-Centric AI: Focus on solving real customer problems, not just implementing AI for the sake of it.
Enterprise vs. Consumer AI: Enterprise AI requires understanding buyers’ goals, not just end-user satisfaction.
60% Rule: AI products can be valuable even if they’re only 60% accurate, as long as they’re designed for human-in-the-loop workflows.
Evals are Critical: Writing effective evaluations (evals) is a core skill for AI product managers to measure and improve model performance.
Prototyping with AI: Use AI tools to rapidly prototype and test product ideas before committing to full development.
Proactive AI: Future AI will be more proactive, anticipating user needs and acting without explicit prompts.
Asynchronous AI: AI will handle longer, more complex tasks over extended periods, freeing users to focus on other work.
Model Personality Matters: AI models will develop distinct personalities, influencing user preferences and interactions.
Adapting to AI: Users adapt quickly to AI, but product teams must design for intuitive adoption and education.

Detailed Summary

1. AI Product Development Challenges

Unpredictable Capabilities: AI models evolve rapidly, making it hard to plan product roadmaps. Capabilities emerge unpredictably during model training.
Stochastic Nature: Product managers must adapt to the stochastic nature of AI, where model performance can vary widely (e.g., 60% vs. 99% accuracy).
Collaboration with Research: Close collaboration between product teams and research teams is essential to co-design and fine-tune models.

2. Enterprise vs. Consumer AI

Enterprise AI: Requires understanding the buyer’s goals, not just end-user satisfaction. Enterprise customers often need advanced notice (e.g., 60 days) before product changes.
Consumer AI: Focuses on user experience and rapid adoption. Consumer products can iterate faster but require intuitive design for mass adoption.

3. The 60% Rule

Human-in-the-Loop: AI products can be valuable even if they’re only 60% accurate, as long as they’re designed to involve humans in the workflow (e.g., GitHub Copilot).
Graceful Failure: Design for scenarios where the AI fails, ensuring users can easily correct or override the model’s output.

4. Evals and Model Performance

Core Skill for PMs: Writing effective evaluations (evals) is critical for measuring and improving AI model performance.
Automating Evals: AI models like Claude can help write and grade evals, making the process more efficient.
Iterative Improvement: Use evals to iteratively improve model performance, especially for enterprise use cases.

5. Prototyping with AI

Rapid Prototyping: Use AI tools to quickly prototype and test product ideas. For example, prompting Claude to generate UI comparisons before designers start in Figma.
Faster Iteration: AI enables faster iteration and experimentation, reducing the time from idea to prototype.

6. Future of AI: Proactivity and Asynchrony

Proactive AI: Future AI will anticipate user needs, offering proactive suggestions (e.g., meeting prep, draft presentations).
Asynchronous AI: AI will handle longer, more complex tasks over extended periods, allowing users to focus on other work while the AI reasons and researches.

7. Model Personality and User Interaction

Distinct Personalities: AI models will develop distinct personalities, influencing user preferences and interactions.
Customization vs. Consistency: Balance customization (e.g., personalized AI) with consistency (e.g., a unified brand personality).

8. Educating Users

In-Product Education: Use AI to educate users within the product (e.g., Claude explaining how to use a feature).
Enterprise Adoption: Leverage power users within organizations to evangelize and teach others how to use AI tools.

Conversational Insights

“AI is not intelligence-limited; it’s eval-limited.” – Models can do more than we realize, but we need better evaluations to unlock their potential.
“60% accuracy can still be valuable if you design for human-in-the-loop workflows.” – AI doesn’t need to be perfect to be useful.
“The future of AI is proactive and asynchronous.” – AI will anticipate needs and handle tasks over longer time horizons.
“Evals are the new core skill for product managers.” – Writing and refining evaluations is critical for building effective AI products.
“AI product development is stochastic and emergent.” – Capabilities emerge unpredictably, requiring flexible product planning.
“Enterprise AI is about the buyer, not just the end user.” – Understanding the buyer’s goals is key to success in enterprise AI.
“Prototyping with AI is underused.” – Use AI tools to rapidly prototype and test ideas before committing to full development.
“Model personality is a product role.” – The personality of an AI model influences user preferences and interactions.
“Users adapt to AI faster than we expect.” – People quickly normalize AI capabilities, even if they initially seem magical.
“AI will automate drudgery, freeing humans for creativity.” – Focus on automating repetitive tasks to allow users to focus on higher-value work.

Software Tools

GitHub Copilot: AI-powered coding assistant.
Claude: AI model by Anthropic, used for prototyping and evaluations.
Figma: Design tool for UI prototyping.
OpenAI GPT Models: Used for consumer, enterprise, and developer products.
Anthropic’s O1: Advanced reasoning model for complex tasks.

People Mentioned

Speakers

Kevin: AI product leader, working on consumer and enterprise AI products.
Mike: Former Instagram founder, now working on AI products at Anthropic.

Other Individuals

Andrej Karpathy: AI researcher, mentioned in the context of evaluating model performance.
Howard Schultz: Former CEO of Starbucks, referenced in an enterprise AI example.

Companies Mentioned

OpenAI: Known for GPT models and consumer/enterprise AI products.
Anthropic: Creator of Claude, focused on AI research and product development.
GitHub: Developer of GitHub Copilot, an AI-powered coding assistant.
Starbucks: Example of an enterprise customer using AI for operational efficiency.
Instagram: Referenced as a consumer product with rapid user adoption.

Key Takeaways

Detailed Summary

1. AI Product Development Challenges

2. Enterprise vs. Consumer AI

3. The 60% Rule

4. Evals and Model Performance

5. Prototyping with AI

6. Future of AI: Proactivity and Asynchrony

7. Model Personality and User Interaction

8. Educating Users

Conversational Insights

Software Tools

People Mentioned

Speakers

Other Individuals

Companies Mentioned

Related Posts

Future of AI Co-Pilots for Product Managers

Product Management as we know today isn’t going to exist in 5 years

AI Leaders Reveal the Next Wave of AI Breakthroughs (At FII Miami 2025)