Overview
Datadog provides a comprehensive observability platform that integrates monitoring, logging, and tracing functionalities across various technology stacks and environments. Positioned for DevOps engineers, SREs, and development teams, the platform aims to provide a unified view of system performance and health. It supports complex cloud-native architectures, hybrid environments, and traditional on-premises infrastructure by collecting metrics, logs, and traces from applications, servers, and network devices through its agents and integrations.
The core utility of Datadog lies in its ability to correlate disparate data points. For instance, a DevOps engineer troubleshooting an application slowdown can move from high-level infrastructure metrics to specific application traces and corresponding log entries within the same dashboard. This integration is designed to reduce the mean time to resolution (MTTR) by centralizing diagnostic information and enabling faster root cause analysis. The platform also includes features for synthetic monitoring, allowing teams to simulate user interactions and API calls to proactively identify issues before they impact end-users. Real User Monitoring (RUM) further extends this by capturing actual user experience data from web and mobile applications to provide insights into front-end performance.
Datadog is particularly suited for organizations managing dynamic and distributed systems, such as those leveraging microservices, containers, and serverless architectures. Its extensive integration ecosystem supports major cloud providers like AWS, Google Cloud, and Azure, alongside popular tools and databases. The platform's capabilities extend to security monitoring, cloud cost management, and network performance, offering a consolidated approach to operational intelligence. This broad coverage positions it as a central component in maintaining the reliability and efficiency of modern software systems, aligning with principles of continuous monitoring and proactive incident management prevalent in DevOps practices.
Key features
- Infrastructure Monitoring: Collects metrics from servers, containers, and cloud instances, providing dashboards and alerts for resource utilization and health as detailed in the infrastructure monitoring documentation.
- Log Management: Ingests, processes, and analyzes logs from all sources, enabling real-time search, filtering, and pattern detection for troubleshooting and security.
- Application Performance Monitoring (APM): Traces requests across distributed services, offering insights into latency, error rates, and resource consumption for individual application components.
- Synthetic Monitoring: Simulates user journeys and API endpoints to proactively detect performance regressions and availability issues from various global locations.
- Real User Monitoring (RUM): Gathers performance data and user experience metrics directly from web and mobile applications to understand client-side performance.
- Security Monitoring: Detects threats and suspicious activity across infrastructure and applications using logs, metrics, and network data, integrating with security information and event management (SIEM) systems.
- Cloud Cost Management: Provides visibility into cloud spending across providers, helping identify cost inefficiencies and optimize resource allocation.
- Network Performance Monitoring (NPM): Monitors network traffic and connectivity between services and hosts, identifying bottlenecks and performance degradation across the network stack.
- Dashboards and Alerting: Customizable dashboards for visualizing metrics and logs, coupled with flexible alerting rules to notify teams of anomalies or critical events.
Pricing
Datadog's pricing model is modular, based on the specific products used, the number of hosts or monitored entities, and data ingestion volumes. A free plan is available for limited usage.
| Product Module | Starting Price (Billed Annually) | Details |
|---|---|---|
| Infrastructure Monitoring | $15/host/month | Includes metrics, events, and dashboards. Free tier for up to 5 hosts. |
| Log Management | $0.10/GB (ingested) | Price per GB of ingested and indexed logs. Additional costs for retention. Free tier for 100GB/month (ingested). |
| APM & Continuous Profiler | $31/host/month | Includes distributed tracing, service maps, and code-level performance insights. |
| Synthetic Monitoring | $5/1k tests | Pricing based on number of API tests and browser tests. |
| Real User Monitoring (RUM) | $1.50/1k sessions | Price per thousand user sessions. |
| Security Monitoring | $0.30/GB (ingested) | Price per GB of ingested security-relevant logs. |
Pricing as of April 2026. For the most current pricing details and enterprise options, refer to the official Datadog pricing page.
Common integrations
- Cloud Providers: AWS, Google Cloud Platform, Azure to monitor cloud services.
- Containers & Orchestration: Docker, Kubernetes, OpenShift for containerized environments.
- Operating Systems: Linux, Windows, macOS for host-level monitoring.
- Web Servers: Nginx, Apache HTTP Server to track web traffic and performance.
- Databases: MySQL, PostgreSQL, MongoDB, Redis for database performance metrics.
- Messaging Queues: Kafka, RabbitMQ to monitor message broker health.
- CI/CD Tools: Jenkins, GitLab CI/CD, GitHub Actions for pipeline visibility.
- Serverless: AWS Lambda, Azure Functions, Google Cloud Functions for serverless function monitoring.
Alternatives
- New Relic: Offers a similar suite of observability products, including APM, infrastructure monitoring, and logging, with a focus on full-stack visibility.
- Dynatrace: Provides AI-powered full-stack monitoring with automated root cause analysis, emphasizing enterprise-grade observability.
- Grafana Labs: Known for its open-source Grafana dashboarding tool, Grafana Labs also offers commercial solutions like Grafana Cloud for metrics, logs, and traces, often integrated with Prometheus and Loki.
Getting started
To begin collecting metrics with Datadog, you typically install the Datadog Agent on your hosts. Here's an example of how to install the agent on an Ubuntu system using a Bash script, followed by a Python example to send custom metrics.
Install Datadog Agent (Ubuntu)
DD_AGENT_MAJOR_VERSION=7 DD_API_KEY="<YOUR_DATADOG_API_KEY>" DD_SITE="datadoghq.com" bash -c "$(curl -L https://install.datadoghq.com/agent/install.sh)"
Replace <YOUR_DATADOG_API_KEY> with your actual API key from your Datadog account. The DD_SITE variable should match your Datadog region (e.g., datadoghq.eu for Europe).
Send Custom Metrics with Python
After installing the agent, you can send custom metrics using one of Datadog's client libraries. Here's a Python example using the datadog-api-client:
from datadog_api_client import ApiClient, Configuration
from datadog_api_client.v2.api.metrics_api import MetricsApi
from datadog_api_client.v2.model.metric_intake_type import MetricIntakeType
from datadog_api_client.v2.model.metric_point import MetricPoint
from datadog_api_client.v2.model.metric_series import MetricSeries
from datadog_api_client.v2.model.metric_payload import MetricPayload
import time
# Configure API key and application key
configuration = Configuration()
configuration.api_key["apiKeyAuth"] = "<YOUR_DATADOG_API_KEY>"
configuration.api_key["appKeyAuth"] = "<YOUR_DATADOG_APP_KEY>"
# Initialize the API client
with ApiClient(configuration) as api_client:
api_instance = MetricsApi(api_client)
# Define a custom metric
metric_name = "my_app.custom_metric"
metric_value = 42.5
current_timestamp = int(time.time())
# Create metric point
point = MetricPoint(
timestamp=current_timestamp,
value=metric_value,
)
# Create metric series
series = MetricSeries(
metric=metric_name,
type=MetricIntakeType("gauge"),
points=[point],
tags=["environment:dev", "service:my-app"],
)
# Create payload
body = MetricPayload(
series=[series],
)
try:
response = api_instance.submit_metrics(body=body)
print(f"Metric '{metric_name}' submitted successfully. Status: {response.status}")
except Exception as e:
print(f"Error submitting metric: {e}")
This Python script sends a single gauge metric named my_app.custom_metric with a value of 42.5 and includes tags for environment and service. Remember to replace <YOUR_DATADOG_API_KEY> and <YOUR_DATADOG_APP_KEY> with your actual keys. You can find more detailed instructions for various languages in the Datadog API documentation.