Infrastructure Overview
Monitoring 47 hosts · 23 services · 6 databases · 3 K8s clusters
Avg Response Time
142 ms
+12% vs 1h ago
Error Rate
0.34 %
-8% vs 1h ago
Throughput
14.2 K req/s
+5% vs 1h ago
Availability
99.97 %
stable
Active Problems
View all →
Request Throughput
P95 latency
Requests
P95 Latency
Host Health — 12 monitored
Service Topology & Deployments
Service Flow
23 services
Recent Deployments
Last 24h
Performance Trends
Error Rate by Service
Top 5
checkout-svc
payment-gw
user-auth
CPU Saturation
All hosts
Avg CPU
Threshold 85%