Monitoring
Comprehensive monitoring infrastructure for all systems.
Architecture Diagram
graph TB
subgraph MONITORING["Monitoring Infrastructure (All VMs)"]
ZABBIX["Zabbix Server<br/>10.32.8.148<br/>60+ hosts monitored"]
UPTIME["UptimeKuma<br/>10.32.8.102<br/>Uptime monitoring"]
NETDATA["Netdata<br/>Bandwidth monitoring"]
end
subgraph TARGETS["Monitored Systems"]
API["7 API Servers"]
SQL["SQL Servers"]
MONGO["MongoDB Servers"]
HAPROXY["HAProxy"]
OTH["OTH Servers"]
CM["Channel Managers"]
end
ZABBIX -->|"SNMP/Agent"| TARGETS
UPTIME -->|"HTTP Checks"| TARGETS
NETDATA -->|"Metrics"| TARGETS
style ZABBIX fill:#90EE90
style UPTIME fill:#87CEEB
style NETDATA fill:#FFE4B5
Server Inventory
| Server |
IP Address |
Type |
Host |
Purpose |
| Zabbix |
10.32.8.148 |
VM |
XCP-3 |
Infrastructure monitoring (60+ hosts) |
| UptimeKuma |
10.32.8.102 |
VM |
XCP-3 |
Uptime and availability monitoring |
| Netdata |
- |
VM |
- |
Bandwidth and performance metrics |
Monitoring Capabilities
| System |
Metrics Collected |
| Zabbix |
CPU, Memory, Disk, Network, Services, Custom metrics |
| UptimeKuma |
HTTP/HTTPS endpoints, Response times, SSL certificates |
| Netdata |
Real-time bandwidth, Per-process metrics |
Zabbix Details
graph TB
subgraph ZABBIX_SERVER["Zabbix Server (10.32.8.148)"]
ZS["Zabbix Server"]
ZW["Zabbix Web UI"]
ZD["Zabbix Database"]
end
subgraph AGENTS["Zabbix Agents (60+ Hosts)"]
A1["API Servers"]
A2["SQL Servers"]
A3["MongoDB Servers"]
A4["HAProxy"]
A5["Other Services"]
end
A1 & A2 & A3 & A4 & A5 -->|"Metrics"| ZS
ZS --> ZD
ZW --> ZD
style ZABBIX_SERVER fill:#90EE90
| Feature |
Details |
| Hosts Monitored |
60+ |
| Check Interval |
30-60 seconds |
| Data Retention |
90 days |
| Alerting |
Email, SMS, Webhook |
UptimeKuma Details
| Feature |
Details |
| Endpoints Monitored |
All critical services |
| Check Types |
HTTP, HTTPS, TCP, Ping |
| Check Interval |
60 seconds |
| SSL Monitoring |
Certificate expiry alerts |
| Status Page |
Public/Private status pages |
Alert Channels
| Channel |
Used For |
| Email |
All alerts |
| SMS |
Critical alerts only |
| Webhook |
Integration with other systems |
| Dashboard |
Real-time visibility |
Last Updated: 2025-12-02