System Overview
Real-time infrastructure health and operational monitoring for staging.
Services
12/12
12
Containers
7 up
8
Uptime
SLA
99.72%
Avg Latency
P50
42ms
Deploy Rate
100%
100%
Open Incidents
none
0
Cluster Uptime Trend (14d)
Deployments (14d)
feat: add real-time metrics dashboard
production
a1b2c3d
·
main
·
ci-bot
Deployed
v2.4.1 · 4h ago
Recent Deployments
View all →feat: add real-time metrics dashboard
Succeeded
v2.4.1 · 4m 23s
4h ago
chore: bump dependencies and security patches
Succeeded
v2.4.0 · 3m 57s
6h ago
fix: resolve JWT expiry edge case on token refresh
Succeeded
v2.3.9 · 5m 12s
8h ago
feat: implement webhook retry logic with backoff
Failed
v2.3.8 · 2m 08s
10h ago
Container Health
Manage →| Container | Image | Status | CPU | Memory | Uptime |
|---|---|---|---|---|---|
|
python-app
|
python:3.11-slim | Running |
71.4%
|
142 MB | 14d 6h 32m |
|
node-app
|
node:18-alpine | Running |
1.8%
|
118 MB | 7d 2h 15m |
|
redis
|
redis:7-alpine | Running |
46%
|
48 MB | 30d 0h 45m |
|
mongodb
|
mongo:7.0 | Running |
70.5%
|
285 MB | 21d 10h 08m |
Environments
Manage →
Production
v2.4.1
45.2K req/min
Staging
v2.4.0
2.4K req/min
Development
v2.4.2-dev
120 req/min
Incidents
View all →INC-0889
investigating
Database Replication Lag Spike
PostgreSQL Primary · 30m ago
Pipeline Health
CI/CD →
DEPLOYMENT SUCCESS RATE
100%
Succeeded: 6
Total: 6
main
4h ago
release
6h ago
hotfix
8h ago