Decentralized AI Infrastructure Engineer
Validator Node Operator • GPU Compute • LLM Inference
ErhNode builds and operates production-grade decentralized AI infrastructure across multiple ecosystems.
Focused on:
- Validator operations
- GPU inference pipelines
- Automation & self-healing systems
- Real-world workload optimization
- Republic AI (Validator + Compute Node)
- Shelby (Early Access Contributor)
- Nous Hermes (Dev Contributor)
- VPS Validator Node (Cosmos-based)
- WSL2 GPU Compute Environment
- Cloudflare Tunnel (externally managed)
- Python-based inference + job execution
- Bash automation & watchdog systems
- 94.6% success rate
- 725,000+ completed jobs
- Top validator (Top #4 range)
- 24/7 stable uptime
Production-ready monitoring system for hybrid environments (VPS + WSL).
Includes:
- VPS validator watchdog
- WSL full-auto monitoring
- GPU health checks
- Telegram alert system
👉 Full setup guide:
docs/PRO_SETUP.md
- Monitors
republicd - Detects block stall
- Detects validator issues (jailed / catching_up)
- Auto-restart with cooldown protection
- Monitors
full-auto.sh - Monitors GPU availability
- Auto-restart for compute process only
- cloudflared
- http server
These are intentionally excluded to prevent instability and restart loops.
Telegram-based alerting:
- Critical → GPU missing, node down
- Warning → process restart
No alert spam (state-based notifications).
- Minimal interference
- Maximum stability
- Real-world reliability over theory
This setup is designed for:
- Long-running workloads
- High success-rate execution
- Stable endpoint exposure
- Autonomous recovery
ErhNode
Decentralized AI Infrastructure Builder
Focused on reliability, performance, and production-ready systems.