Datadog Review 2026: The Full-Stack Observability Platform That Does Everything (But Charges Accordingly)
Datadog is the observability platform that engineering teams either swear by or swear at. This review cuts through the sales deck, assessing infrastructure monitoring, APM, log management, security, and the Bits AI assistant.
The Honest Take
Datadog is the observability platform that engineering teams either swear by or swear at — often within the same sprint. It is genuinely one of the most comprehensive monitoring, logging, and security platforms available today, and it is also one of the most expensive ways to gain visibility into your infrastructure. The question is not whether Datadog works. It does. The question is whether it works for *your* organisation at *your* scale and *your* budget — and whether you have the operational maturity to extract full value from it.
This review cuts through the sales deck. We have assessed Datadog across infrastructure monitoring, application performance management, log management, security, and its AI-powered capabilities including the Bits AI assistant. Here is what you actually need to know.
What Datadog Does
Launched in 2010, Datadog grew from a straightforward infrastructure metrics tool into a sprawling platform covering virtually every observability surface. In 2026, the product spans:
- Infrastructure monitoring — real-time metrics from servers, containers, Kubernetes, cloud services, and network devices
- APM and distributed tracing — end-to-end request tracing across microservices, with flame graphs and service dependency maps
- Log management — centralised ingestion, parsing, and search for logs across your entire estate
- Synthetic monitoring — scripted tests simulating user journeys to catch issues before real users do
- Real User Monitoring (RUM) — frontend performance tracking with session replay
- Cloud SIEM and security — threat detection, compliance monitoring, and cloud security posture management
- Database monitoring — query-level insights across PostgreSQL, MySQL, MongoDB, and others
- Bits AI — the platform's AI assistant for natural language querying, automated incident summaries, and anomaly detection
The breadth is genuinely impressive. For organisations that want a single pane of glass across their entire technology estate, Datadog is one of the few platforms that can credibly deliver it.
Infrastructure Monitoring: Still Best-in-Class
Datadog's infrastructure monitoring remains the product's bedrock. The agent-based collection is lightweight, the integrations library covers over 700 technologies, and the dashboarding experience is fast and flexible.
What sets it apart in 2026 is the contextual correlation. When a server spikes in CPU, Datadog does not just show you the spike — it surfaces the correlated application traces, the relevant log lines, and any recent deployments that might explain the change. This correlation across signal types dramatically reduces mean time to resolution.
Kubernetes support is particularly strong. The Kubernetes Explorer gives a visual, hierarchical view of your cluster health, pod-level metrics, and resource consumption — useful both for platform engineers managing the infrastructure and for developers trying to understand why their service is slow.
The host map and service map features remain genuinely useful for organisations running complex distributed systems. You can spot anomalies visually before you see them in alerting.
APM and Distributed Tracing
Datadog's APM is among the best in the market. Automatic instrumentation via the Datadog tracer libraries means you can get traces flowing with minimal code changes — important for teams that cannot afford weeks of instrumentation work.
The flame graph visualisation is clean and informative. The service catalogue feature, introduced a couple of years back and significantly improved in 2026, gives teams a centralised registry of all services with ownership, SLOs, and incident history baked in. This matters enormously as organisations scale beyond a handful of microservices.
Continuous Profiler is worth calling out specifically. It adds always-on code-level profiling that correlates directly with traces, so when you see a slow trace you can drill into the exact lines of code consuming CPU or memory. This is powerful — and the performance overhead in production is genuinely low.
The main friction: APM pricing is per host, and if you have a sprawling microservices architecture across many containers, costs escalate fast.
Log Management
Datadog Logs is functional and well-integrated with the rest of the platform. Log ingestion, parsing with Grok patterns, and search are all solid. The real value comes from pivot-ability: you can go from a log line straight to the correlated trace, metric, or RUM session.
However, log management is where Datadog's pricing model bites hardest. You pay per ingested gigabyte, and then again for retention beyond 15 days. For log-heavy applications — particularly those in regulated industries generating compliance logs — this can make Datadog logs prohibitively expensive. Many organisations end up routing only a subset of logs to Datadog and archiving the rest to S3, which somewhat undermines the unified observability promise.
The Sensitive Data Scanner is a useful addition for compliance-conscious teams, automatically detecting and redacting PII in log streams.
Security: CSPM, SIEM, and ASM
Datadog's security suite has matured significantly. Cloud Security Posture Management (CSPM) checks your cloud configuration against CIS benchmarks and custom compliance frameworks. Cloud Workload Security (CWS) provides runtime threat detection at the kernel level. Application Security Management (ASM) integrates into APM to detect and block attacks like SQL injection and SSRF in real time.
Cloud SIEM sits on top of the logs pipeline and offers detection rules (both out-of-the-box and custom) for identifying threats across your environment. The integration with the rest of the Datadog platform means you can correlate a security signal with infrastructure metrics and traces in a way that standalone SIEMs cannot.
That said, if security is your primary use case, dedicated SIEMs like Splunk or Elastic Security will offer more depth. Datadog's security features are excellent as *additions* to an observability-led workflow, but they are not the first choice for a pure security operations team.
Bits AI: Genuinely Useful, Not Gimmicky
Bits AI is Datadog's AI assistant, and in 2026 it has moved well beyond the chatbot-bolted-on-the-side stage. The most practically useful features:
Natural language queries — Ask "show me the services with the highest error rate over the last hour" and get a sensible dashboard query generated. Not revolutionary, but it meaningfully reduces the time engineers spend fighting the query language.
Incident summarisation — When an alert fires, Bits AI can summarise the contributing signals, recent changes, and suggested next steps. In a 3am incident, this is genuinely valuable.
Watchdog AI — Datadog's longer-standing anomaly detection engine automatically identifies unusual patterns in metrics, traces, and logs without requiring manual threshold configuration. The signal-to-noise ratio has improved markedly; false positive rates in 2025-2026 are substantially lower than they were two or three years ago.
AI-assisted dashboarding — Describe the dashboard you want and get a starting point generated. Still requires refinement, but useful for getting teams up to speed quickly.
Pricing
Datadog's pricing is modular, per-product, and notoriously complex. The headline figures for 2026:
| Product | Starting Price |
|---|---|
| Infrastructure (Pro) | ~$18/host/month |
| APM | ~$31/host/month |
| Log Management | ~$0.10/GB ingested + retention fees |
| Synthetic Monitoring | From $5/10k test runs |
| RUM | ~$1.50/1,000 sessions |
| Cloud SIEM | ~$0.20/GB analysed |
| Database Monitoring | ~$70/database host/month |
The free tier covers up to 5 hosts with a 1-day metric retention window — sufficient for evaluation, not for production.
The critical point: most real-world deployments use multiple products, and the costs compound. A mid-size engineering organisation running Infrastructure + APM + Logs across 50 hosts is looking at £15,000–£30,000+ per year before volume discounts. Enterprise contracts (required at scale) introduce annual commitments with prepaid usage, which mitigates some cost but requires accurate forecasting.
Comparison: Datadog vs. Alternatives
| Criteria | Datadog | New Relic | Splunk | Grafana |
|---|---|---|---|---|
| Infrastructure monitoring | Excellent | Very Good | Good | Very Good |
| APM depth | Excellent | Excellent | Good | Good |
| Log management | Very Good | Good | Excellent | Good |
| Security features | Good | Basic | Excellent | Basic |
| AI/ML capabilities | Very Good | Good | Very Good | Basic |
| Ease of setup | Very Good | Very Good | Complex | Moderate |
| Pricing model | Complex/Expensive | Per user (simpler) | Very Expensive | Open source / low cost |
| Best for | Full-stack observability | Developer-led monitoring | Enterprise SIEM | Cost-conscious / open source |
New Relic overhauled its pricing model in recent years to a consumption-based per-user model that is more predictable at smaller scales. APM quality is comparable to Datadog, but the breadth of integrations and the infrastructure monitoring depth lag behind.
Splunk is the enterprise choice for log-heavy, security-first deployments. If your primary driver is compliance logging, SIEM, and long-term log retention at scale, Splunk often makes more sense despite its even higher price tag. The Splunk/Cisco combination post-acquisition has brought improved security capabilities.
Grafana (with Prometheus, Loki, and Tempo) is the open-source alternative that larger engineering teams with DevOps maturity use to avoid vendor lock-in. The observability quality is excellent — Grafana Cloud's managed offering is competitive — but you pay in operational overhead and lack of out-of-the-box integrations.
Who It's For
Datadog is the right choice if:
- You are running a complex, multi-cloud or hybrid infrastructure at meaningful scale
- Your engineering teams value tight integration between metrics, traces, logs, and security signals
- You have the budget and are willing to pay for reduced operational overhead
- Your SRE or platform team is sophisticated enough to configure alerting and dashboards properly
- You are already investing in vendor-led tooling and want a single commercial relationship
Datadog is not the right choice if:
- You are an early-stage startup with limited infrastructure — the costs will outpace the value
- Your team lacks the operational maturity to configure and maintain a platform this complex
- Log volume is your primary cost driver and you need long-term retention at low cost
- You want open-source flexibility and are comfortable managing infrastructure for your observability stack
- Your use case is primarily security operations — dedicated SIEM platforms will serve you better
How to Get Started
1. Start a free trial — Datadog offers a 14-day trial with full feature access. Start with just Infrastructure monitoring; resist the temptation to enable everything on day one.
2. Install the agent — The Datadog Agent installs in minutes on Linux, Windows, or macOS, and via Helm for Kubernetes. The auto-discovery feature will start pulling in integrations immediately.
3. Configure your first dashboard — Use the out-of-the-box dashboards for your primary technologies before building custom ones.
4. Set up monitors and alerts — Start with the recommended Watchdog alerts and add manual monitors for your critical SLOs.
5. Evaluate cost vs. value — After 30 days, assess your estimated monthly spend against the actual value delivered. Be rigorous here — Datadog's sales team will encourage expansion; make sure each product addition is justified.
6. Negotiate enterprise terms — If you are committing to meaningful annual spend, push for committed use discounts. They are available and substantial.
The Verdict
Datadog is the best full-stack observability platform available in 2026 for organisations with complex infrastructure and the budget to match. It delivers on its promise of unified visibility across metrics, traces, logs, and security — and Bits AI is a genuine productivity addition rather than marketing theatre.
The limitation is straightforward: it is expensive, and the pricing complexity makes it difficult to forecast and control spend. Organisations that are not rigorous about configuration and cost governance will find bills spiralling unexpectedly.
For the right team, at the right scale, with the right operational discipline, Datadog is worth every pound. For everyone else, the alternatives deserve serious consideration.
Digital by Default rating: 8.5/10
Want guidance on whether Datadog is the right observability platform for your engineering team? We assess, procure, and implement data and analytics tooling for UK businesses. [Talk to us at Digital by Default](/contact) — no sales pressure, just straight advice.
Enjoyed this article?
Subscribe to our Weekly AI Digest for more insights, trending tools, and expert picks delivered to your inbox.