Section 06: Validation Experiments & Hypotheses
Lean experiments to de-risk PromptVault assumptions. Test problem-solution fit, pricing, and channels in 8 weeks for under $5K.
Hypothesis Framework
10 testable hypotheses structured as We believe... We will know... Prioritized by risk.
#1: Problem Existence 🔴 Critical
We believe that AI engineers & prompt engineers
Will actively seek prompt organization tools
If they manage 50+ prompts across models & teams
We will know this is true when 60%+ of surveyed users confirm as top-3 pain & 5%+ landing page signup
Success Metrics: >60% confirm | Next: Solution tests if pass; pivot if fail.
#2: Solution Fit 🔴 Critical
We believe that prompt engineers
Will adopt versioning & multi-model testing
If we deliver Git-like control + side-by-side results in <5 min
We will know this is true when 70%+ rate Wizard-of-Oz output "useful/very useful"
Success Metrics: >70% useful | Next: Pricing tests.
#3: Willingness to Pay 🔴 Critical
We believe that AI practitioners
Will pay $19/mo for Pro features
If we save 10+ hrs/mo on prompt chaos & testing
We will know this is true when 10+ pre-orders at $19 & 50%+ post-trial conversion
Success Metrics: 10+ pre-orders | Next: Channel tests.
#4: Version Control Value 🟡 High
We believe that team prompt engineers
Will use Git-like versioning daily
If diffs & reverts prevent "lost good prompt" issues
We will know this is true when 40%+ fake door clicks on versioning demo
#5: Multi-Model Testing 🟡 High
We believe that AI engineers
Will run 5+ model comparisons/week
If one-click side-by-side with analytics
We will know this is true when 60%+ Wizard users request repeat tests
#6: Team Collaboration 🟡 High
We believe that 10-100 person AI teams
Will upgrade to Team plan
If shared libs reduce duplication by 50%
We will know this is true when 30%+ interviewees cite team chaos
#7: Pro Pricing Sweet Spot 🟢 Medium
We believe that individual practitioners
Will pay $19/mo over $9
If unlimited features justify premium
We will know this is true when Van Westendorp shows $15-25 optimal
#8: Channel Efficacy 🟢 Medium
We believe that Reddit/Twitter AI communities
Will drive < $5 CAC signups
If targeted "Prompt Chaos?" posts
We will know this is true when LinkedIn/Reddit < $5 CAC vs Google >$10
#9: Retention Signal 🟢 Medium
We believe that Pro users
Will return weekly for tests
If analytics show ROI
We will know this is true when 30%+ week 2 repeat in cohort
#10: VS Code Extension Appeal 🟢 Medium
We believe that developers
Will install extension pre-launch
If inline prompt save from chats
We will know this is true when 20%+ landing clicks on extension CTA
Experiment Catalog
12 low-cost experiments (total ~$4K, 8 weeks). Detailed setups below.
Experiment Prioritization Matrix
8-Week Validation Sprint
Wk 1-2: Problem Valid.
| D1-3: | Launch landing + recruit |
| D4-14: | 20 interviews + 1K visitors |
Wk 3-4: Solution Valid.
| D15-21: | Analyze + Wizard setup |
| D19-28: | 15 deliveries + feedback |
Wk 5-6: Pricing/Channels
| D29-35: | Pricing survey + pre-orders |
| D36-42: | Channel ads + fake doors |
Wk 7-8: Decide
| D43-52: | Synthesis + Go/No-Go |
| D53-56: | MVP plan or pivot |
Minimum Success Criteria
Go: All Must | Cond. Go: 80% | No-Go: <70%
Pivot Triggers & Contingencies
Trigger #1: No Problem (<40%)
Pivot: General AI note-taker or dev-only tool. Action: Re-interview for adjacents.
Trigger #2: Low Fit (<50% useful)
Pivot: Simplify to solo organizer. Action: Feature heatmap from feedback.
Trigger #3: Low WTP (<$10)
Pivot: Freemium-only or enterprise. Action: Segment to teams ($49/u).
Trigger #4: High CAC (>$10)
Pivot: Product-led (ext + viral). Action: Community/SEO focus.
Experiment Documentation Template
Owner: Founder. Track in Notion/Airtable.
Total Cost: ~$4K | Timeline: 8 weeks | Decision-Ready for MVP Build.