Status¶
Live service health, incident archive, and scheduled maintenance.
Status page¶
The live status page lives at status.clouddigit.ai (subscribe via email, RSS, or webhook).
The page covers per-service, per-region indicators across all of:
Incident classifications¶
| State | Indicator | Means |
|---|---|---|
| Operational | green | Service running normally |
| Performance impact | yellow | Degraded latency / throughput, no API/availability impact |
| Partial outage | orange | Some operations failing |
| Major outage | red | Service unavailable |
| Maintenance | blue | Planned work in progress |
Subscriptions¶
- Email — per-service, per-region
- RSS —
https://status.clouddigit.ai/rss/<service>/<region>.rss - Webhook — JSON push on every status change
- ITSM — direct integration with PagerDuty, Opsgenie, ServiceNow
Scheduled maintenance calendar¶
Announced ≥ 7 days in advance for SLA-bearing changes. The status page shows an upcoming-maintenance band; customers in the affected region/service get email notice.
Post-incident reports (RCA)¶
For every P1 incident, an RCA is published on the status page within 5 BWD with:
- Summary (one paragraph)
- Customer impact (services, regions, time window, scope)
- Detection (how we found it, time to detect)
- Response (what we did, time to mitigate)
- Root cause (technical, human, process)
- Corrective actions (what we're changing)
- Preventive actions (what we're changing structurally to stop the class of issue)