v1.4.0
This release introduces significant enhancements to the Sveltos Dashboard, providing deeper visibility into event-driven workflows, and brings critical performance optimizations to EventReports and Pull-Mode cluster management. We’ve also enhanced the Sveltos MCP Server to simplify debugging of the Sveltos event framework.
✨ New Features & Enhancements
The Dashboard now offers a comprehensive view of your event-driven automation.
- Visibility: Display all EventTriggers in a single view.
- Traceability: For each EventTrigger, you can now see the linked EventSource and all matching clusters.
- Resource Inspection: Drill down into specific clusters to see exactly which Kubernetes resources matched the EventSource.
MCP Server for Framework Debugging
The Sveltos MCP (Model Context Protocol) Server has been enhanced to assist in debugging the Sveltos event framework. This tool provides a structured way to interact with and inspect the internal state of the event engine.
Reference: mcp-server PR #33
⚡ Performance Optimizations
- Smart EventReport Processing
We have optimized how EventReports are handled. Previously, every collected report was reprocessed. Now, Sveltos performs a delta check and only reprocesses EventReports that have changed since the last collection, significantly reducing CPU cycles in the management cluster.
- Resilient Pull-Mode Handling
Improved handling for managed clusters in Pull Mode when the Sveltos Agent stops reporting a "keep-alive":
-
Fault Detection: If an agent is non-responsive, Sveltos will skip that cluster and report a specific error: agent is not sending keep alive.
-
Auto-Recovery: As soon as the agent resumes communication, Sveltos automatically reprocesses all resources that were in a faulty state during the downtime.
🛠 Bug Fixes & Improvements
-
Drift-Detection: Fixed a critical issue where the drift-detection-manager failed to detect expired Kubeconfigs when running as an agent in a managed cluster.#363).
-
Event Manager: Enhanced reliability for event-driven resource distribution (#436).
-
Event Manager: Improved cleanup and state management within the event manager (#427).
-
Addon Controller: Various stability fixes and internal logic improvements across PRs #1572, #1568, #1563, and #1555.
-
Reloader: Fixed an issue where triggered rollouts failed for DaemonSets and StatefulSets when the Reloader knob was enabled.