-
Notifications
You must be signed in to change notification settings - Fork 7
Additional Grafana dashboards: inference metrics and cost tracking #5
Copy link
Copy link
Open
Labels
area/observabilityMonitoring, metrics, logging, tracingMonitoring, metrics, logging, tracingenhancementNew feature or requestNew feature or requestgood-first-issueGood for newcomersGood for newcomerskind/featureNew feature or requestNew feature or requestobservabilityMonitoring and metricsMonitoring and metricssize/mediumMedium effort (1-3 days)Medium effort (1-3 days)
Description
Description
Create additional Grafana dashboards beyond the current GPU metrics dashboard.
Goals
-
Inference Metrics Dashboard
- Requests per second
- Tokens per second (prompt + generation)
- Latency percentiles (P50, P95, P99)
- Request queue depth
- Model usage breakdown
-
Cost Tracking Dashboard
- GPU utilization vs cost
- Cost per 1K tokens
- Idle time tracking
- Spot instance savings
- Monthly cost projections
Success Criteria
- Inference metrics dashboard JSON
- Cost tracking dashboard JSON
- Import instructions in docs
- Screenshots in documentation
- Alert rules for cost thresholds
Sprint
Sprint 2-3 priority
Current State
GPU hardware metrics dashboard exists at config/grafana/llmkube-gpu-dashboard.json
Good First Issue
Great for contributors familiar with Grafana and PromQL!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
area/observabilityMonitoring, metrics, logging, tracingMonitoring, metrics, logging, tracingenhancementNew feature or requestNew feature or requestgood-first-issueGood for newcomersGood for newcomerskind/featureNew feature or requestNew feature or requestobservabilityMonitoring and metricsMonitoring and metricssize/mediumMedium effort (1-3 days)Medium effort (1-3 days)