002 · Own product · 2025
Git-style prompt versioning with branches, eval runs, and deploy-on-merge. Brought 800+ prompts under version control across 9 active engineering teams.
Timeline
6 weeks
Role
Product / engineering
Prompts versioned
800+
Active teams
9 teams
The problem
The pattern is everywhere: prompts live in spreadsheets, Notion docs, or worse — hardcoded in application config. When something breaks in production, there's no diff to show what changed. When a prompt update regresses a hot path, the team has no baseline to roll back to.
The result: every prompt deploy is a manual, anxiety-producing process. Engineers copy-paste from a doc into production code during a deployment. There's no eval run before the change goes live. There's no history.
The approach
WizPrompt gives prompts the same lifecycle discipline as code. Every prompt lives in a versioned branch. Changes go through a PR process with side-by-side eval runs before merging.
Deploy-on-merge means the production prompt is always the merged version — no manual copy-paste, no drift between the source-of-truth doc and what's actually running. Integrates via SDK so existing pipelines don't need to change their calling convention.
What we learned
Getting prompts under version control is straightforward engineering. The hard part is writing evals that are actually predictive. Most teams don't have golden test sets — they have vague feelings about whether the model is "doing the right thing."
WizPrompt ships with eval scaffolding that makes it easy to define golden input/output pairs and correctness criteria. The first time a team runs a structured eval on a prompt they've been using in production for months, they usually find something surprising. That's the product doing its job.
Multi-model comparison also surfaces a second insight: prompt quality is model-specific. A prompt tuned for Claude often degrades on GPT-4o. The side-by-side view makes this visible before it becomes a production incident.
Result
Across 9 active engineering teams, WizPrompt brought all production prompts under version control with full diff history and pre-deploy eval coverage. Zero manual copy-paste deploys since rollout.
The product is live at wizprompt.pro. Multi-provider routing and shared eval libraries are shipping in Q3 2026.
Stack
We take a small number of client engagements each year. Real problems only.