🐶 SMS Reliability · Datadog Feed

Real-time metrics for throughput, delivery, and failure patterns directly from the Datadog API. Use this snapshot to judge platform health at a glance.

Namespace: sms Service: messaging-core Goal: sub 60s delivery

Dashboard mode

Global snapshot vs Saudi drilldown

Toggle between the standard Datadog SMS view and a Saudi Arabia (MCC 420) deep dive.

Live KPI Overview

Loading live metrics from Datadog...

Outbound Send Rate
--
Awaiting data Live feed
Send Success
--
Awaiting data Target >= 98%
Delivery Latency (p95)
--
Awaiting data Goal <= 45s
Failed Sends Rate
--
Awaiting data Target < 20 errors/min
Data Source
--

Shows whether the snapshot is live or sample.

Scope Filter
All traffic

Configured via Settings > Datadog.

Last Updated
--

Based on Datadog timestamps.

API Health
No requests yet

Displays the last Datadog error, if any.

Essential SMS Flow Metrics

Live Datadog values for throughput, delivery, latency, and queue posture.

Outbound Send Rate
--

Messages sent per minute via SMSC. Sudden drops are an early signal of outage or throttling.

Query: per_minute(sum:smsc.sms.sent{*})
Send Success
--

Percent of SMS submits that avoid an immediate failure. Watch for provider outages or credit blocks.

Query: 100 * (sum:smsc.sms.sent{*} - sum:smsc.failed{*}) / sum:smsc.sms.sent{*}
E2E Delivery Latency (p95)
--

P95 time from submission to delivery receipt. Target < 45s; warn at 55s.

Query: p95:sms.latency.e2e{*}
DLR Success
--

Percentage of messages that receive a delivery receipt. Decreases often localize to a provider or country.

Query: 100 * sum:smsc.dlr.received / sum:smsc.sms.sent
Queue Backlog
--

Messages waiting for an SMSC bind or provider accept. Sustained growth points to throughput limits.

Query: max:smsc.queued{*}
Failed Sends Rate
--

Failed submissions per minute. Spikes often align with throttling, credit issues, or template rejections.

Query: per_minute(sum:smsc.failed{*})

Metric Ledger

Copy these numbers directly into performance reports. Queries mirror the API requests powering the cards above.

Metric Current Value Datadog Query
Outbound Send Rate -- per_minute(sum:smsc.sms.sent{*})
Send Success -- 100 * (sum:smsc.sms.sent - sum:smsc.failed) / sum:smsc.sms.sent
Delivery Latency (p95) -- p95:sms.latency.e2e{*}
DLR Success -- 100 * sum:smsc.dlr.received / sum:smsc.sms.sent
Queue Backlog -- max:smsc.queued{*}
Failed Sends Rate -- per_minute(sum:smsc.failed{*})