How Stripe Ships 1,300 AI-Written Pull Requests Every Week with ‘Minions’

Imagine a world where a massive army of digital robots writes thousands of computer code updates every single week. This is no longer science fiction; it is exactly how Stripe, the global payments giant, manages its massive infrastructure. By using a specialized system called “Minions,” Stripe has turned AI coding from a simple experiment into a production powerhouse. Let us explore the technical architecture that allows them to maintain high quality while moving at lightning speed.

Stripe’s codebase is remarkably complex, consisting of hundreds of millions of lines of code. Interestingly, their backend is written in Ruby—specifically with Sorbet for type-checking—which is a unique environment for modern Large Language Models (LLMs) to navigate. Furthermore, they use many internal libraries that general AI models like ChatGPT or Claude were never trained on. To bridge this gap, Stripe built the Minions framework to handle “one-shot” tasks, meaning the AI can complete a task in one go without constant human supervision.

The core secret behind Stripe’s success is a hybrid approach called a “Deterministic-Agentic Workflow.” In simple terms, this means they combine the creative reasoning of an AI (the Agentic part) with rigid, unchangeable rules (the Deterministic part). In many standard AI tools, the AI is allowed to decide everything, which often leads to mistakes. At Stripe, however, the system is the boss. The system dictates a strict path, and the AI is merely a worker following that path. If the AI’s code fails a test, the system forces it to go back and fix it before a human ever sees the work.

To make this work, Stripe uses “Blueprints.” A Blueprint is essentially a roadmap for a specific coding task. It contains different types of “nodes” or steps. Some steps are “Agent Nodes,” where the AI uses its brain to solve a problem, like writing a new feature. Other steps are “Deterministic Nodes,” which are automated checks that never change, such as linting (checking for grammar errors in code) or running unit tests. By interleaving these steps, Stripe ensures that the AI cannot skip important safety checks.

A critical part of this architecture is the “ToolShed.” This is an internal server based on the Model Context Protocol (MCP). The ToolShed provides the Minions with over 500 specialized tools, such as the ability to search internal documentation, check build statuses, or look at previous bug reports. However, giving an AI 500 tools at once would be overwhelming. Therefore, the Minions workflow includes a “Context Curation” step. This step automatically picks only the most relevant 10 to 15 tools the AI needs for a specific task, keeping the AI focused and efficient.

Security is another major concern when letting AI write code. Stripe handles this by using “DevBoxes.” Every time a Minion starts a task, the system spins up a fresh, isolated AWS EC2 instance. This follows the “Cattle, not Pets” philosophy in DevOps. These instances are temporary and disposable; they are created for one job and destroyed immediately afterward. This ensures that even if an AI makes a catastrophic error, it happens in a safe, isolated box where it cannot damage the main system.

The typical workflow follows a pattern often called the “PIV Loop”: Planning, Implementation, and Validation. First, an Agentic node creates a plan based on a GitHub issue. Second, a human or a secondary system validates that plan. Third, the implementation node writes the actual code in a fresh context window to keep things clean. Finally, the validation node runs a battery of tests—Stripe has over 3 million tests in their suite—to ensure everything works perfectly. Only after all these hurdles are cleared does a human engineer perform a final code review.

While Stripe’s “Minions” system is private, other companies are following similar paths. Shopify, for example, has open-sourced a tool called “Roast,” which allows developers to build these same types of structured AI workflows using Ruby. Whether you are a professional engineer or just starting to learn about IT, understanding how to control AI with deterministic gates is the most important skill in the new era of software development. By building systems that supervise the AI, we can achieve levels of productivity that were previously impossible.

If you are interested in exploring this yourself, I recommend starting with smaller, open-source frameworks to understand the logic of state machines in AI. Learning how to connect an LLM to a terminal or a testing suite via MCP is a fantastic project for any aspiring developer. The future of coding is not just about writing lines of text; it is about building the systems that manage the robots who do the heavy lifting for us.