PromptOps is an open-source prompt management infrastructure designed for AI engineers. It allows teams to version control, test, and deploy AI prompts just like code, decoupling prompt logic from application code.

How do I version control prompts with PromptOps?

PromptOps treats prompts as immutable assets. Every change creates a new version with a unique hash. You can track history, rollback instantly, and promote specific versions across environments (development, staging, production) via our API or Dashboard.

Is PromptOps open source?

Yes, PromptOps has an open-source core that developers can self-host. We also offer a managed cloud version for teams that need enterprise features and SLAs.

The Engineering Guide to Prompt Version Control

Use git for code. Use PromptOps for prompts.

If you are building an LLM-powered application today, you have likely faced the "Prompt Management Problem". It starts innocently enough: you have a single prompt in your code.

const prompt = `
You are a helpful assistant.
User: ${userInput}
`;

But then, requirements change. Marketing wants the tone to be "friendlier". Product wants to test GPT-4 vs GPT-3.5. Engineering needs to fix a hallucination edge case. Suddenly, your codebase looks like this:

// utils/prompts.ts
export const getBriefingPrompt = (plan: string, version = 'v2') => {
  if (plan === 'enterprise') {
    return version === 'v3' ? ENTERPRISE_PROMPT_V3 : ENTERPRISE_PROMPT_V2;
  }
  // ... 50 more lines of logic
}

This is Spaghetti Code. And worse, it couples your logic (deployment cycles) with your data (prompts). Every prompt change requires a pull request, a code review, a CI build, and a deployment. This implementation guide explores migration from hardcoded strings to a robust Prompt Version Control System.

The Three Stages of Prompt Maturity

Just as software deployment evolved from FTP uploads to Docker containers, prompt engineering is undergoing a similar shift.

Stage 1: Hardcoded Strings (The "FTP" Era)

Storage: String constants in source code.
Versioning: Relying on git history of the file.
Deployment: Full application redeploy.
Pros: Zero infrastructure, easy to start.
Cons: Slow iteration, no A/B testing, non-technical stakeholders are blocked.

Stage 2: Database / CMS (The "WordPress" Era)

Storage: SQL/NoSQL database column.
Versioning: Often missing, or single "latest" field.
Deployment: API call fetches data.
Pros: Decoupled from deploy.
Cons: "It worked on my machine" bugs. Production breaks because someone edited the prompt in the database without testing.

Stage 3: Infrastructure as Code (The "PromptOps" Era)

Storage: Immutable, versioned artifacts with unique hashes.
Versioning: SemVer or Hash-based (e.g., v1.0.4 or sha:8a2f).
Deployment: Promotion pipelines (Dev → Staging → Prod).
Pros: Instant rollback, type safety, separate lifecycle.

Architecture Comparison

Implementing Prompt Version Control

To implement Stage 3, you need a system that enforces immutability. You cannot simply "edit" a prompt; you must "fork" it into a new version.

1. The Data Model

A robust schema for prompt versioning needs to track not just the text, but the configuration.

model PromptVersion {
  id            String   @id @default(uuid())
  promptId      String
  versionNumber Int
  
  // The "Code"
  systemPrompt  String
  userTemplate  String   // e.g. "Hello {{name}}"
  
  // The "Config"
  model         String   // e.g. "gpt-4"
  temperature   Float    // e.g. 0.7
  
  createdAt     DateTime @default(now())
}

2. The Resolution Logic

Your application should never ask for "the prompt". It should ask for "the active prompt for this environment".

This prevents the classic "Production Breakage" scenario where a developer changes a prompt to tests a new idea, and unintentionally affects live users.

// ❌ Bad: Fetching by ID
const prompt = await db.prompts.findUnique({ where: { id: '123' } });

// ✅ Good: Resolving by Context
const prompt = await promptOps.get('onboarding-email', {
  environment: process.env.NODE_ENV, // 'production'
  tags: ['marketing', 'v2']
});

Rollbacks and Reliability

One of the biggest advantages of treating prompts as infrastructure is Instant Rollbacks.

In a traditional code-based workflow, if a prompt causes a regression (e.g., the bot starts being rude), you have to:

Revert the git commit.
Wait for CI tests to pass.
Wait for the build container.
Wait for deployment.

This can take 15-30 minutes. With a PromptOps approach, you simply update the active_version pointer in your registry. The rollback takes milliseconds.

Testing Strategies (CI/CD)

Once prompts are versioned, you can attach test suites to them. This is often called "Evaluation" or "Evals".

Deterministic Tests: Assert that the output contains specific JSON keys.
Semantic Tests: Use an LLM to grade the output (e.g., "Is this response polite?").
Regression Tests: Compare the new version's output against a "Golden Dataset" of previous inputs.

Ready to upgrade your infrastructure?

PromptOps provides this entire architecture out of the box. Secure, type-safe, and built for high-scale production teams.

Get Started Try the Dashboard