BlameTrail
IntegrationsObservability

Observability Integration

Connect your metrics, logging, and tracing providers to BlameTrail for unified observability during incidents.

Observability Integration

The observability integration connects your existing metrics, logging, and distributed tracing infrastructure to BlameTrail. When an incident occurs, BlameTrail automatically pulls relevant metrics, logs, and correlated traces from your providers, giving your team immediate context without switching tools.

Supported providers

ProviderMetricsLogsTraces
PrometheusYesNoNo
Grafana LokiNoYesNo
DatadogYesYesYes
AWS CloudWatchYesYesNo
TempoNoNoYes
JaegerNoNoYes
HoneycombNoNoYes
New RelicNoNoYes
Elastic APMNoNoYes
AWS X-RayNoNoYes
LightstepNoNoYes

What it does

  • Incident context — When an incident is created, BlameTrail queries your connected providers for metrics and logs around the incident time window (15 minutes before to 15 minutes after).
  • Service-level overview — View health status, key metrics, and recent logs for any service with connected observability providers.
  • Dedicated metrics explorer — Query and chart metrics across providers with preset templates or custom PromQL/Datadog queries.
  • Log search — Search and filter logs across Loki and CloudWatch connections, with level-based filtering and cursor pagination.
  • Trace correlation — When an incident has a suspect deploy, BlameTrail queries connected trace providers for traces around the deploy window and scores them by error density, timing proximity, latency deviation, and service overlap.
  • Latency regression detection — Compares pre-deploy and post-deploy latency percentiles (p50, p95, p99) per operation to surface regressions introduced by the suspect deploy.

How it works

BlameTrail Service → Observability Mapping → Provider Connection

                                              Prometheus / Loki / Datadog /
                                              CloudWatch / Tempo / Jaeger /
                                              Honeycomb / New Relic / Elastic /
                                              X-Ray / Lightstep

                                              Metrics, Logs & Traces

                                              Incident Context /
                                              Service Overview /
                                              Trace Correlation
  1. You connect one or more observability providers via the Integrations page.
  2. You map each connection to the BlameTrail services it monitors.
  3. When an incident fires or you visit a service page, BlameTrail queries the mapped providers in parallel and presents a unified view.

Requirements

  • A BlameTrail Starter or Pro plan
  • Network access from BlameTrail to your provider endpoints (Prometheus, Loki, Datadog API, or AWS CloudWatch)

Plan limits

FeatureStarterPro
Connections320
Max time range7 days30 days
Max log lines per query5001,000
Concurrent provider queries310

Next steps

On this page