Skip to content

Monitoring

Comprehensive monitoring infrastructure for all systems.


Architecture Diagram

graph TB
    subgraph MONITORING["Monitoring Infrastructure (All VMs)"]
        ZABBIX["Zabbix Server<br/>10.32.8.148<br/>60+ hosts monitored"]
        UPTIME["UptimeKuma<br/>10.32.8.102<br/>Uptime monitoring"]
        NETDATA["Netdata<br/>Bandwidth monitoring"]
    end

    subgraph TARGETS["Monitored Systems"]
        API["7 API Servers"]
        SQL["SQL Servers"]
        MONGO["MongoDB Servers"]
        HAPROXY["HAProxy"]
        OTH["OTH Servers"]
        CM["Channel Managers"]
    end

    ZABBIX -->|"SNMP/Agent"| TARGETS
    UPTIME -->|"HTTP Checks"| TARGETS
    NETDATA -->|"Metrics"| TARGETS

    style ZABBIX fill:#90EE90
    style UPTIME fill:#87CEEB
    style NETDATA fill:#FFE4B5

Server Inventory

Server IP Address Type Host Purpose
Zabbix 10.32.8.148 VM XCP-3 Infrastructure monitoring (60+ hosts)
UptimeKuma 10.32.8.102 VM XCP-3 Uptime and availability monitoring
Netdata - VM - Bandwidth and performance metrics

Monitoring Capabilities

System Metrics Collected
Zabbix CPU, Memory, Disk, Network, Services, Custom metrics
UptimeKuma HTTP/HTTPS endpoints, Response times, SSL certificates
Netdata Real-time bandwidth, Per-process metrics

Zabbix Details

graph TB
    subgraph ZABBIX_SERVER["Zabbix Server (10.32.8.148)"]
        ZS["Zabbix Server"]
        ZW["Zabbix Web UI"]
        ZD["Zabbix Database"]
    end

    subgraph AGENTS["Zabbix Agents (60+ Hosts)"]
        A1["API Servers"]
        A2["SQL Servers"]
        A3["MongoDB Servers"]
        A4["HAProxy"]
        A5["Other Services"]
    end

    A1 & A2 & A3 & A4 & A5 -->|"Metrics"| ZS
    ZS --> ZD
    ZW --> ZD

    style ZABBIX_SERVER fill:#90EE90
Feature Details
Hosts Monitored 60+
Check Interval 30-60 seconds
Data Retention 90 days
Alerting Email, SMS, Webhook

UptimeKuma Details

Feature Details
Endpoints Monitored All critical services
Check Types HTTP, HTTPS, TCP, Ping
Check Interval 60 seconds
SSL Monitoring Certificate expiry alerts
Status Page Public/Private status pages

Alert Channels

Channel Used For
Email All alerts
SMS Critical alerts only
Webhook Integration with other systems
Dashboard Real-time visibility

Last Updated: 2025-12-02