Datadog is everywhere. Every observability conversation ends up there. But here’s the thing: most teams don’t need to spend $10k/month to see their logs and traces.
I’ve watched engineering teams get sticker shock when their Datadog bill jumps from $2k to $15k after adding a few microservices. The pricing model is complex—you’re paying per host, per custom metric, per GB of logs, per APM trace. What starts as a reasonable $500/month pilot quickly spirals when you actually use the product.
Look, Datadog is powerful. But it’s optimized for massive enterprises with dedicated observability budgets. If you’re a 20-person startup or a mid-size team trying to keep infrastructure costs under control, you need something different.
Why Teams Are Leaving Datadog
The pricing pain is real. A typical mid-size team running 50 hosts with moderate logging and APM can easily hit $8k-12k monthly. Scale that to 100+ hosts and you’re looking at $20k-50k. For many companies, that’s a full engineer’s salary going to monitoring.
The billing surprises are worse than the base cost. You add a few custom metrics to track business KPIs, suddenly you’re paying an extra $2k/month. You increase log retention from 15 to 30 days, there’s another $3k. The pricing calculator requires a PhD to understand, and your actual bill rarely matches the estimate.
Beyond cost, there’s the vendor lock-in problem. Datadog’s proprietary agent and data formats make it hard to migrate once you’re deep in. You’re essentially betting your entire observability stack on one vendor’s roadmap and pricing decisions. When they decide to sunset a feature or change pricing models (as they’ve done multiple times), you have limited options.
The complexity is another issue. Datadog tries to do everything—infrastructure monitoring, APM, logs, security, RUM, synthetics, CI/CD observability. Most teams only need 2-3 of these features but end up paying for the whole suite. The UI has gotten bloated over the years. Finding the right dashboard or setting up a simple alert now requires navigating through multiple nested menus.
Performance at scale is also a concern. I’ve seen teams with high-cardinality metrics hit query timeouts or slow dashboard loads. Datadog’s architecture handles standard use cases well, but push it with millions of unique metric combinations and you’ll feel the constraints.
Bottom line: if you’re spending more than 5% of your infrastructure budget on observability, or if your team spends more time managing Datadog than using it, you should explore alternatives. The switching cost is real but the long-term savings and flexibility make it worthwhile.
1. SigNoz: OpenTelemetry-Native and Cost-Effective
SigNoz is the open-source alternative that’s actually production-ready. Built from the ground up around OpenTelemetry, it gives you unified observability without vendor lock-in.
The architecture is clean: metrics, traces, and logs all flow through OpenTelemetry collectors. Everything is stored in ClickHouse, which means fast queries and lower storage costs. The UI is modern and responsive—you won’t miss Datadog’s interface.
What makes SigNoz compelling is the deployment flexibility. Self-host it on your own infrastructure for free, or use SigNoz Cloud starting at $199/month for the managed service. The cloud tier includes automatic scaling, managed upgrades, and dedicated support. For teams that don’t want to run another stateful service, the cloud option is a no-brainer.
Strengths:
- Native OpenTelemetry support means no vendor lock-in
- ClickHouse backend delivers fast queries at scale
- Self-hosted option gives complete cost control
- Active open-source community (10k+ GitHub stars)
- Clean, modern UI that doesn’t feel like an afterthought
Pricing:
Self-hosted is completely free. SigNoz Cloud starts at $199/month for up to 1M spans and 10GB logs. That’s roughly 10x cheaper than equivalent Datadog usage. They also offer transparent pricing calculators so you won’t get surprise bills.
Best for:
Teams that want to avoid vendor lock-in and are already using or planning to use OpenTelemetry. Perfect for engineering-led companies that value open standards and transparent pricing.
2. Grafana Cloud: The Open-Source Ecosystem Play
Grafana Cloud bundles the entire Grafana observability stack—Grafana for dashboards, Prometheus for metrics, Loki for logs, Tempo for traces, and Pyroscope for profiling. If you’re already using any of these tools, Grafana Cloud is the natural evolution.
The power here is integration depth. Everything talks to everything else natively. Correlating a spike in error logs with a deployment or a trace with high latency is seamless. The query language is consistent across telemetry types, so your team learns one tool instead of five.
Grafana’s approach is different from Datadog’s all-in-one platform. Instead of building everything themselves, they acquired best-of-breed tools and integrated them. Prometheus is the industry standard for metrics. Loki is proven for log aggregation. Tempo handles traces efficiently. You get mature, battle-tested components instead of a monolithic platform.
Strengths:
- Best-in-class visualization and dashboarding
- Deep ecosystem with thousands of plugins and integrations
- Can mix self-hosted and cloud components flexibly
- Strong community support (Grafana has 65k+ GitHub stars)
- Unified query experience across metrics, logs, and traces
Pricing:
Free tier includes 10k metrics, 50GB logs, 50GB traces. Paid plans start around $50/month and scale based on usage. The key advantage: you only pay for what you use, not per-host or per-service.
Best for:
Teams already invested in the Prometheus/Grafana ecosystem. Also great for platform teams that want flexibility to self-host some components while using managed services for others.
3. New Relic: The All-In-One APM Giant
New Relic has been doing APM since before Datadog existed. They’ve pivoted hard into modern observability with generous free tiers and simplified pricing.
The platform covers everything: APM, infrastructure monitoring, logs, distributed tracing, real user monitoring, and synthetic checks. Unlike Datadog’s fragmented pricing, New Relic uses a simple model: you get 100GB of data ingest per month free, then pay per GB after that.
Where New Relic shines is application performance monitoring. The transaction tracing is mature and detailed. Error tracking and alerting are robust. The mobile and browser RUM tools are particularly strong if you’re monitoring user-facing applications.
Strengths:
- Generous free tier (100GB/month) great for small to mid-size teams
- Mature APM capabilities with deep language support
- Simple, predictable pricing model (no per-host charges)
- Strong AI-assisted anomaly detection
- Excellent mobile and browser monitoring
Pricing:
Free up to 100GB data ingest and 1 full platform user. Standard tier starts at $99/month per user with 100GB included, then $0.30-0.50 per additional GB. For a typical team, this works out significantly cheaper than Datadog.
Best for:
Teams that need comprehensive APM and don’t want to cobble together multiple tools. The free tier makes it perfect for startups and small teams who want enterprise features without the enterprise price tag.
4. Better Stack: Modern UX for Modern Teams
Better Stack (formerly Logtail) is the new kid on the block, and they’re focused on developer experience first. The UI is fast, clean, and actually enjoyable to use. No nested menus or cryptic query languages.
The platform combines logging, uptime monitoring, and incident management. What sets it apart is how these features work together. When an alert fires, you immediately see relevant logs. When you’re debugging an incident, the timeline shows deployments, config changes, and metric anomalies all in one view.
Better Stack is opinionated about workflows. They don’t try to support every possible use case. Instead, they optimize for the 80% of observability tasks that teams do every day: checking logs, investigating incidents, and keeping services up.
Strengths:
- Fastest, cleanest UI among all these alternatives
- Affordable pricing that scales predictably
- Integrated incident management (no need for separate PagerDuty)
- SQL-like query language that’s actually readable
- Great for teams tired of complex, enterprise-focused tools
Pricing:
Starts at $20/month for 5GB logs with 15-day retention. Team plan at $100/month includes 30GB and advanced features. Enterprise plans start around $200/month. Far more transparent than Datadog’s pricing calculator.
Best for:
Startups and scale-ups that want a modern observability tool without the complexity. Perfect if you’re a 5-50 person engineering team that values developer experience and doesn’t need every possible feature.
5. OpenObserve: The Lightweight Challenger
OpenObserve is purpose-built for one thing: handling massive log volumes at a fraction of the cost. Built on ClickHouse and designed for efficiency, it claims 140x lower storage costs than Elasticsearch-based solutions.
The architecture is deliberately simple. No complex data pipelines, no separate components for ingestion and querying. Everything runs in a single binary. You can deploy it on a single server or scale horizontally as needed.
OpenObserve supports OpenTelemetry for logs, metrics, and traces. The query interface is straightforward—you can use SQL or a simplified syntax. Dashboards are functional if not fancy. This isn’t trying to be Datadog; it’s trying to be the most cost-effective way to store and query telemetry data.
Strengths:
- Exceptionally low storage costs (claimed 140x cheaper than Elastic)
- Single binary deployment (no complex setup)
- Built-in OpenTelemetry support
- Fast queries even with billions of records
- Self-hosted or cloud options available
Pricing:
Self-hosted is free and open-source. OpenObserve Cloud starts at $49/month for 200GB ingestion with 30-day retention. That’s about 5-10x cheaper than equivalent capacity on Datadog or Splunk.
Best for:
Teams with high log volumes that need cost-effective long-term retention. Perfect for platform teams supporting hundreds of services where log storage costs are eating the budget.
Quick Comparison
Here’s how these alternatives stack up:
| Tool | Pricing Start | Open Source | Best For |
|---|---|---|---|
| SigNoz |
Free (self-hosted) / $199/mo (cloud) | Yes | Teams wanting OpenTelemetry-native |
| Grafana Cloud | Free tier → $50+/mo | Yes | Teams already using Prometheus/Loki |
| New Relic | Free 100GB/mo → $99+/mo | No | Teams needing all-in-one APM |
| Better Stack | $20/mo → $200+/mo | No | Startups wanting simple UX |
| OpenObserve | Free (self-hosted) / $49/mo (cloud) | Yes | Cost-conscious teams with high log volume |
My Recommendation: Start With SigNoz or Grafana Cloud
If I were setting up observability for a new project today, I’d start with SigNoz Cloud. The OpenTelemetry-native approach means you’re not locked in, the pricing is transparent, and the feature set covers 90% of what teams actually need. The $199/month starting point is reasonable, and you can always self-host later if you want more control.
What makes SigNoz particularly compelling right now is momentum. The project is actively developed, the community is engaged, and they’re shipping features quickly. The team behind it has real observability experience and they’re solving actual pain points, not just building feature parity with Datadog.
For teams already running Prometheus or using Grafana for dashboards, Grafana Cloud is the obvious choice. You’re already familiar with the tools, and the cloud offering removes the operational burden of running Prometheus, Loki, and Tempo yourself. The migration path is straightforward since you’re likely already sending metrics to Prometheus—just point them at Grafana Cloud instead.
If you need battle-tested APM with minimal setup, New Relic’s free tier is generous enough to get started and prove value before spending a dollar. The simplified pricing model (per GB instead of per everything) makes budgeting much easier. New Relic has also improved significantly in the last two years—the UI redesign and the focus on developer experience show they’re listening to feedback.
Better Stack is ideal if you’re a small team (under 20 engineers) that values polish and simplicity. You’ll get up and running faster than any other option here. The integrated incident management is a bonus—one less tool to manage and one less vendor to pay.
OpenObserve is the specialist pick. If log storage costs are killing you or you need to retain logs for compliance, it’s unmatched. But for general observability, the other options are more complete. Consider OpenObserve as part of your stack rather than your entire observability solution.
Making the Switch
Here’s what you shouldn’t do: don’t stay on Datadog out of inertia. The switching cost is real but manageable, especially if you use OpenTelemetry. Most teams I’ve talked to recoup their migration investment within 3-6 months just from reduced monitoring bills.
Start small. Pick one service or one team and migrate them first. Prove out the alternative, understand the gaps, train your team on the new tools. Then roll it out gradually. You don’t need to do a big-bang migration.
The observability market is more competitive than ever. You have real choices that deliver 80% of Datadog’s capabilities at 20% of the cost. The tools above are production-ready, actively maintained, and backed by real companies or strong open-source communities.
Your specific choice depends on your situation. Evaluate based on your team size, technical sophistication, budget constraints, and existing tooling. Try the free tiers or self-hosted versions. Run a proof of concept for a month.
Pick the one that aligns with your team size, budget, and technical preferences. Your CFO will thank you, and your engineers will appreciate the simplified tooling and transparent pricing.


