Open-Source Infrastructure Is Eating the Cloud: The Post-Vendor-Lock-In Era
Something broke in the relationship between enterprises and their cloud vendors. Not all at once—more like a slow fracture that finally gave way in 2025.
When IBM closed its $6.4 billion acquisition of HashiCorp in February 2025, it set off a chain reaction nobody fully anticipated. Terraform users—many already rattled by the 2023 license switch from MPL 2.0 to BSL—suddenly found themselves locked into tooling owned by a company with a mixed track record on acquisitions. The free tier vanished. Pricing went opaque. And according to a Spacelift survey from Q4 2024, 38% of Terraform users were already evaluating or actively migrating to OpenTofu.
That number has only grown since. By April 2026, OpenTofu had reached roughly 12% adoption among IaC practitioners, with another 27% of teams planning to evaluate or expand its use. These aren’t hobbyists running side projects. We’re talking about production workloads at scale.
The infrastructure layer is being rewritten. And for the first time in a decade, the rewrite favors openness over convenience.
The License Change That Launched a Thousand Forks
Let’s rewind. In August 2023, HashiCorp switched Terraform’s license from the permissive Mozilla Public License 2.0 to the Business Source License. The stated reason: preventing cloud providers from offering competing managed services built on HashiCorp’s code without contributing back.
The community response was immediate and brutal. Within weeks, the Linux Foundation announced OpenTofu—a fully open fork. IBM, Oracle, Spacelift, Gruntwork, and dozens of smaller players pledged support. The project hit 18,000 GitHub stars in its first month. Contributors poured in faster than Terraform had accumulated in five years.
What made OpenTofu different from the usual angry fork? Governance. The Linux Foundation backing meant enterprise legal teams could sign off without months of license review. No more agonizing over whether a permissive license might change tomorrow—the foundation structure makes unilateral rug-pulls structurally impossible. The project adopted a public RFC process, transparent roadmap, and commitment to backward compatibility. By version 1.8, OpenTofu had achieved full compatibility with Terraform 1.5 while adding features Terraform lacked: state encryption, OCI registry support, dynamic provider registration.
The release cadence tells the story. OpenTofu ships faster than Terraform ever did under HashiCorp. The 1.12 release cycle introduced client-side state encryption that Terraform still doesn’t offer—a feature enterprise security teams had requested for years. When your open-source fork outpaces the original on enterprise features, something structural has shifted.
Harness, Scalr, env0, and Spacelift all shipped native OpenTofu support. The migration tooling matured to the point where 90% of existing Terraform code runs on OpenTofu with zero modifications. The common adoption blocker, as Scalr’s documentation notes, is “perceived migration effort, not technical risk.”
Data Pipelines: Where the Money Talks Loudest
The IaC story gets all the headlines, but the quieter revolution is happening in data integration. Airbyte—an open-source ELT platform launched in 2020—now offers over 600 connectors and has become the default choice for teams that want control over their data pipelines.
The economics are stark. Fivetran’s pricing model charges per Monthly Active Row (MAR). At enterprise scale, that translates to roughly $800–$1,200 per million MAR. For a company processing 100 million rows monthly, that’s $25,000–$40,000 a month—$300K–$480K annually—just for data movement. Self-hosted Airbyte? Infrastructure costs of maybe $2,000–$5,000/month for the same volume.
But I’d argue the cost savings aren’t even the primary driver. Control is.
A mid-size fintech I spoke with discovered their SaaS data integration tool was silently sampling API responses during peak loads—reducing data freshness from the promised 5 minutes to over 15 minutes without any alert. They couldn’t debug it because the connector code was proprietary. After migrating to Airbyte, their engineering team wrote a custom connector in three weeks that achieved 90-second freshness. That kind of customization simply doesn’t exist in closed-source tooling.
The data sovereignty angle matters too. Under GDPR, CCPA, and an expanding web of privacy regulations, routing sensitive customer data through a third-party SaaS provider creates compliance surface area that self-hosted solutions eliminate entirely. Financial services and healthcare organizations have been leading this charge—not because they love running infrastructure, but because their regulators increasingly demand it.
The Elephant: Vendor Lock-In Was Never About APIs
Here’s what most cloud migration guides won’t tell you: modern vendor lock-in doesn’t look like proprietary APIs anymore. The hyperscalers learned that lesson. AWS, Azure, and GCP all offer relatively standard interfaces for compute, storage, and networking. The lock-in moved downstream.
It’s in data gravity. Moving a 500TB data lake out of S3 costs real money—AWS charges $0.09/GB for egress in most regions. That’s $45,000 just for the transfer, before you account for the engineering time, the revalidation, the downstream service rewiring. For petabyte-scale operations, egress costs alone can reach six figures.
It’s in managed service coupling. You can run Kubernetes anywhere, sure. But if your entire observability stack depends on CloudWatch, your ML pipelines run on SageMaker, and your event bus is EventBridge, “migrating Kubernetes” means rebuilding half your architecture.
It’s in billing complexity. GCP’s BigQuery pricing uses a black-box algorithm that makes monthly costs swing 30–40% with no change in query patterns. AWS reserved instance pricing requires a finance team to optimize. Azure adjusts virtual network pricing seemingly every quarter. The cognitive overhead of understanding your bill becomes its own switching cost—you’ve invested so much in learning one provider’s pricing model that starting over feels irrational.
Netflix, despite being AWS’s poster child, maintains internal tooling that would allow partial cloud migration as a hedge. Shopify runs a multi-cloud strategy explicitly to avoid single-provider dependency. Airbnb built its own orchestration layer that abstracts away cloud-specific APIs. These aren’t edge cases—they’re sophisticated organizations that did the math on lock-in risk and decided the insurance premium was worth paying.
The pattern repeats at smaller scale too. A Series B startup I know chose OpenTofu from day one specifically because their CTO had lived through a Terraform Cloud pricing surprise at a previous company. “I’m not building on someone else’s pricing whims again,” she told me. That sentiment—burned once, never again—is spreading fast.
The Honest Cost of Going Open
I want to be clear-eyed about this. Romanticizing open-source infrastructure is dangerous.
A 1,200-resource enterprise migration from Terraform Cloud to self-hosted OpenTofu doesn’t take two weeks. The IaC code migration might—that part is genuinely smooth. But the surrounding ecosystem takes months. CI/CD pipelines need reconfiguring. Policy-as-code tools like Sentinel don’t have direct OpenTofu equivalents (you’ll need OPA or custom solutions). Cost estimation integrations break. Security scanning workflows need rebuilding.
One DevOps lead at a mid-market SaaS company told me their team saved $40K/month on Fivetran licensing after moving to Airbyte—but hired 1.5 additional SREs to handle the operational burden. Self-hosting means owning database management, monitoring, backup, upgrades, and security patches yourself. For the first two years, they actually spent more.
Community support is real but unpredictable. OpenTofu’s Slack is responsive for common issues. Complex edge cases might wait days. Terraform’s enterprise support SLA was 4 hours. When production is burning, that gap matters.
And there’s the accountability shift. When Fivetran has a data breach, you have a vendor to hold responsible—contractually and legally. When your self-hosted Airbyte instance leaks data, the board looks at you.
The Hybrid Reality
The market isn’t moving to “pure open-source everything.” It’s moving to open-core with commercial wrappers.
Airbyte’s business model is instructive: the open-source version is feature-complete. The cloud-hosted version adds auto-scaling, multi-region deployment, and 24/7 support. Most enterprises choose the hosted version—not because the open-source edition lacks features, but because they want someone else to handle 3 AM pages. Revenue reportedly crossed $40M in 2025, with 70% from cloud hosting and 25% from implementation consulting.
OpenTofu is following a similar trajectory. The core remains fully open under the Linux Foundation. Commercial platforms (Spacelift, Scalr, env0) provide the management layer, policy engines, and enterprise integrations that large organizations need. The model works because it separates the infrastructure standard (open, community-governed) from the operational convenience (commercial, SLA-backed).
Grafana Labs proved this model at scale—$300M+ ARR while keeping Grafana, Loki, and Tempo fully open-source. Temporal, Materialize, and Neon are running the same playbook. The pattern is clear: open-source the core to build trust and adoption, monetize the operational layer.
Regulation Is Coming—And It Favors Openness
The EU Data Act’s core provisions took effect in September 2025. By September 2026, enhanced interoperability requirements for cloud services become mandatory. Cloud providers operating in the EU must offer standardized switching mechanisms, fair contractual terms for migration, and cannot charge egress fees that function as de facto exit penalties.
This isn’t theoretical anymore. It’s law. And it structurally advantages open-source infrastructure because portability-by-design is already baked into projects like OpenTofu and Airbyte—they don’t need to retrofit it.
The US is moving slower but in the same direction. The FTC has been examining cloud market concentration since 2023. The UK’s Competition and Markets Authority cleared the IBM-HashiCorp deal but flagged cloud infrastructure market dynamics for ongoing review.
For CTOs, the calculation is changing. Open-source infrastructure isn’t just a cost optimization play—it’s becoming a compliance strategy. When regulators ask “can you switch providers within 30 days?” you want to answer yes without crossing your fingers.
The AI Wild Card
Here’s where I’m genuinely uncertain about what comes next.
Infrastructure is starting to embed AI—intelligent scheduling, predictive autoscaling, automated cost optimization. These capabilities require massive training data (telemetry from millions of deployments) and significant compute for model training. That’s a resource advantage the hyperscalers have and open-source communities don’t.
The next generation of lock-in might not be in code at all. It might be in model weights. When AWS offers an AI-powered infrastructure optimizer trained on data from millions of customer workloads, can an open-source equivalent match it? The code can be open, but the training data and compute required to build competitive models create a new kind of moat.
Maybe the response will be federated learning approaches—open-source infrastructure projects pooling anonymized telemetry to train shared models. Maybe it’ll be specialized models that work well enough on smaller datasets. Or maybe AI-powered infrastructure management becomes the new premium tier that keeps enterprises paying hyperscaler prices despite having open-source alternatives for everything else.
What Actually Changed
The shift isn’t really about OpenTofu versus Terraform, or Airbyte versus Fivetran. Those are symptoms.
What changed is the question CTOs ask. Five years ago, the default was “why would we build this ourselves?” Today it’s “why can’t we own this ourselves?” That inversion—from assumed dependency to assumed sovereignty—reshapes every infrastructure decision downstream.
Open-source infrastructure won’t “eat” the cloud. The hyperscalers aren’t going anywhere. But the relationship has fundamentally rebalanced. Vendors now operate under the constant awareness that their customers have viable alternatives. Pricing can’t drift unchecked. License changes have consequences. Lock-in strategies get scrutinized.
The post-vendor-lock-in era doesn’t mean zero vendor relationships. It means every vendor relationship includes an exit plan. It means infrastructure standards are governed by communities, not corporations. It means the code that runs your business belongs to you—even if you pay someone else to operate it.
That’s not a revolution. It’s a correction. And it was a long time coming.
Two years from now, we’ll look back at 2025–2026 as the inflection point. Not because any single tool won or lost, but because the default assumption flipped. Infrastructure sovereignty stopped being a luxury for FAANG-scale engineering teams and became table stakes for any serious technical organization. The tools caught up to the ambition. The governance models matured. The business models proved sustainable.
Who controls the tools you build the future with? That question used to have an obvious answer: whoever sold you the license. Now the answer is being written in commits, RFCs, and foundation charters. It’s messier. It’s slower. But it’s yours.



