Files
proxmox-infra/docs/cluster-state.md
kavren 120c2ec809 Initial commit: KavCorp infrastructure documentation
- CLAUDE.md: Project configuration for Claude Code
- docs/: Infrastructure documentation
  - INFRASTRUCTURE.md: Service map, storage, network
  - CONFIGURATIONS.md: Service configs and credentials
  - CHANGELOG.md: Change history
  - DECISIONS.md: Architecture decisions
  - TASKS.md: Task tracking
- scripts/: Automation scripts

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2025-12-07 22:07:01 -05:00

116 lines
3.2 KiB
Markdown

# KavCorp Proxmox Cluster State
**Last Updated**: 2025-11-16
## Cluster Overview
- **Cluster Name**: KavCorp
- **Config Version**: 6
- **Transport**: knet
- **Quorum Status**: Quorate (5/5 nodes online)
- **Total Nodes**: 5
- **Total VMs**: 2
- **Total LXCs**: 19
## Node Details
### pm1 (10.4.2.2)
- **CPU**: 4 cores
- **Memory**: 16GB (15.4 GiB)
- **Storage**: ~100GB local
- **Uptime**: ~52 hours
- **Status**: Online
- **Running Containers**:
- VMID 100: haos12.1 (VM - Home Assistant OS)
- VMID 101: twingate (LXC)
- VMID 102: zwave-js-ui (LXC)
### pm2 (10.4.2.6) - Primary Management Node
- **CPU**: 12 cores
- **Memory**: 31GB (29.3 GiB)
- **Storage**: ~100GB local
- **Uptime**: ~52 hours
- **Status**: Online
- **Running Containers**:
- VMID 104: traefik (LXC - Reverse Proxy)
- VMID 105: sonarr (LXC)
- VMID 108: radarr (LXC)
- VMID 113: docker-pm2 (LXC - Docker host)
- VMID 114: prowlarr (LXC)
- VMID 115: jellyseerr (LXC)
- VMID 116: authelia (LXC)
- VMID 117: whisparr (LXC)
- VMID 118: notifiarr (LXC)
- VMID 119: bazarr (LXC)
- VMID 120: kometa (LXC)
### pm3 (10.4.2.3)
- **CPU**: 16 cores
- **Memory**: 33GB (30.7 GiB)
- **Storage**: ~100GB local
- **Uptime**: ~319 hours (~13 days)
- **Status**: Online
- **Running Containers**:
- VMID 106: mqtt (LXC)
- VMID 107: dockge (LXC - Docker management UI, 12 CPU, 8GB RAM)
- VMID 109: docker-pm3 (VM - Docker host, 4 CPU, 12GB RAM)
- VMID 111: frigate (LXC - NVR)
- VMID 112: foundryvtt (LXC - Virtual tabletop)
### pm4 (10.4.2.5)
- **CPU**: 12 cores
- **Memory**: 31GB (29.3 GiB)
- **Storage**: ~100GB local
- **Uptime**: ~52 hours
- **Status**: Online
- **Running Containers**:
- VMID 103: shinobi (LXC - NVR)
- VMID 110: docker-pm4 (LXC - Docker host)
### elantris (10.4.2.14) - Storage Node
- **CPU**: 16 cores
- **Memory**: 128GB (125.7 GiB) - **Largest node**
- **Storage**: ~100GB local + 24TB ZFS pool (el-pool)
- **Uptime**: ~26 minutes (recently rebooted)
- **Status**: Online
- **Running Containers**:
- VMID 121: jellyfin (LXC - Media server)
## Cluster Health
- **Quorum**: Yes (3/5 required, 5/5 available)
- **Expected Votes**: 5
- **Total Votes**: 5
- **All Nodes**: Online and healthy
## Network Architecture
- **Primary Network**: 10.4.2.0/24
- **Gateway**: 10.4.2.254
- **Bridge**: vmbr0 (on all nodes, bridged to eno1)
- **DNS**: Managed by gateway/router
## Storage Summary
### Shared Storage
- **KavNas** (NFS): 23TB total, ~9.2TB used - Primary shared storage from Synology DS918+
- **elantris-downloads** (NFS): 23TB total, ~10.6TB used - Download storage from elantris
### Node-Local Storage
Each node has:
- **local**: ~100GB directory storage (backups, templates, ISOs)
- **local-lvm**: ~350-375GB LVM thin pool (VM/LXC disks)
### ZFS Storage
- **el-pool** (elantris only): 24TB ZFS pool, ~13.8TB used
## Migration Status
Currently migrating services from Docker containers to dedicated LXCs. Most media stack services (Sonarr, Radarr, etc.) have been successfully migrated to LXCs on pm2.
**Active Docker Hosts**:
- docker-pm2 (LXC 113): Currently empty/minimal usage
- docker-pm3 (VM 109): Active, running containerized services
- docker-pm4 (LXC 110): Active
- dockge (LXC 107): Docker management UI with web interface