Summary
Provide a ready-to-import Grafana dashboard JSON for llm-proxy metrics, assuming a Prometheus datasource.
Architectural Guidance
Dashboard must not require proprietary exporters.
Keep it aligned to the metrics we actually expose (see Prometheus endpoint issue).
Avoid overly complex panels; focus on operational usefulness.
Suggested Panels
Request rate, error rate
Latency (if available), or proxy upstream duration if tracked
Cache: hits/misses/bypass/store (if exposed)
Event bus: publish/drop, stream lag/pending (if available)
Uptime / restarts
Delivery
Store dashboard JSON in-repo (suggested path): deploy/helm/llm-proxy/dashboards/llm-proxy.json or docs/observability/dashboards/.
Optionally: Helm can create a ConfigMap with grafana_dashboard label (opt-in) for Grafana sidecar discovery.
Acceptance Criteria
Dashboard imports cleanly.
Queries match the exported metric names.
Docs show how to use it.
Parent: #54
π React with π 1Copilot