Files
proxmox-infra/docs/cluster-state.md
kavren 120c2ec809 Initial commit: KavCorp infrastructure documentation
- CLAUDE.md: Project configuration for Claude Code
- docs/: Infrastructure documentation
  - INFRASTRUCTURE.md: Service map, storage, network
  - CONFIGURATIONS.md: Service configs and credentials
  - CHANGELOG.md: Change history
  - DECISIONS.md: Architecture decisions
  - TASKS.md: Task tracking
- scripts/: Automation scripts

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-07 22:07:01 -05:00

3.2 KiB

KavCorp Proxmox Cluster State

Last Updated: 2025-11-16

Cluster Overview

  • Cluster Name: KavCorp
  • Config Version: 6
  • Transport: knet
  • Quorum Status: Quorate (5/5 nodes online)
  • Total Nodes: 5
  • Total VMs: 2
  • Total LXCs: 19

Node Details

pm1 (10.4.2.2)

  • CPU: 4 cores
  • Memory: 16GB (15.4 GiB)
  • Storage: ~100GB local
  • Uptime: ~52 hours
  • Status: Online
  • Running Containers:
    • VMID 100: haos12.1 (VM - Home Assistant OS)
    • VMID 101: twingate (LXC)
    • VMID 102: zwave-js-ui (LXC)

pm2 (10.4.2.6) - Primary Management Node

  • CPU: 12 cores
  • Memory: 31GB (29.3 GiB)
  • Storage: ~100GB local
  • Uptime: ~52 hours
  • Status: Online
  • Running Containers:
    • VMID 104: traefik (LXC - Reverse Proxy)
    • VMID 105: sonarr (LXC)
    • VMID 108: radarr (LXC)
    • VMID 113: docker-pm2 (LXC - Docker host)
    • VMID 114: prowlarr (LXC)
    • VMID 115: jellyseerr (LXC)
    • VMID 116: authelia (LXC)
    • VMID 117: whisparr (LXC)
    • VMID 118: notifiarr (LXC)
    • VMID 119: bazarr (LXC)
    • VMID 120: kometa (LXC)

pm3 (10.4.2.3)

  • CPU: 16 cores
  • Memory: 33GB (30.7 GiB)
  • Storage: ~100GB local
  • Uptime: ~319 hours (~13 days)
  • Status: Online
  • Running Containers:
    • VMID 106: mqtt (LXC)
    • VMID 107: dockge (LXC - Docker management UI, 12 CPU, 8GB RAM)
    • VMID 109: docker-pm3 (VM - Docker host, 4 CPU, 12GB RAM)
    • VMID 111: frigate (LXC - NVR)
    • VMID 112: foundryvtt (LXC - Virtual tabletop)

pm4 (10.4.2.5)

  • CPU: 12 cores
  • Memory: 31GB (29.3 GiB)
  • Storage: ~100GB local
  • Uptime: ~52 hours
  • Status: Online
  • Running Containers:
    • VMID 103: shinobi (LXC - NVR)
    • VMID 110: docker-pm4 (LXC - Docker host)

elantris (10.4.2.14) - Storage Node

  • CPU: 16 cores
  • Memory: 128GB (125.7 GiB) - Largest node
  • Storage: ~100GB local + 24TB ZFS pool (el-pool)
  • Uptime: ~26 minutes (recently rebooted)
  • Status: Online
  • Running Containers:
    • VMID 121: jellyfin (LXC - Media server)

Cluster Health

  • Quorum: Yes (3/5 required, 5/5 available)
  • Expected Votes: 5
  • Total Votes: 5
  • All Nodes: Online and healthy

Network Architecture

  • Primary Network: 10.4.2.0/24
  • Gateway: 10.4.2.254
  • Bridge: vmbr0 (on all nodes, bridged to eno1)
  • DNS: Managed by gateway/router

Storage Summary

Shared Storage

  • KavNas (NFS): 23TB total, ~9.2TB used - Primary shared storage from Synology DS918+
  • elantris-downloads (NFS): 23TB total, ~10.6TB used - Download storage from elantris

Node-Local Storage

Each node has:

  • local: ~100GB directory storage (backups, templates, ISOs)
  • local-lvm: ~350-375GB LVM thin pool (VM/LXC disks)

ZFS Storage

  • el-pool (elantris only): 24TB ZFS pool, ~13.8TB used

Migration Status

Currently migrating services from Docker containers to dedicated LXCs. Most media stack services (Sonarr, Radarr, etc.) have been successfully migrated to LXCs on pm2.

Active Docker Hosts:

  • docker-pm2 (LXC 113): Currently empty/minimal usage
  • docker-pm3 (VM 109): Active, running containerized services
  • docker-pm4 (LXC 110): Active
  • dockge (LXC 107): Docker management UI with web interface