We're excited to announce the release of AWS ParallelCluster Cookbook 3.14.1
This is associated with AWS ParallelCluster v3.14.1
3.14.1
CHANGES
- Improve cluster update resiliency by ensuring clustermgtd is started after updates complete successfully, or after failed updates where queue reconfiguration succeeded.
- Add chef attribute
cluster/in_place_update_on_fleet_enabledto disable in-place updates on compute and login nodes
and mitigate performance impact at scale. See aws/aws-parallelcluster#7095 - Upgrade Slurm to version 24.11.7 (from 24.11.6).
- Upgrade Werkzeug to ~=3.1 (from ~=2.0) to address CVE-2024-34069.
- Upgrade Connexion to ~=2.15.1 (from ~=2.13.0).
- Upgrade Flask to ~=3.1.0 (from >=2.2.5,<2.3).
- Load kernel module
drm_client_libbefore installation of NVIDIA driver, if available on the kernel. - Reduce dependency footprint by installing the package
sssd-commonrather thansssd. - Disable Wayland protocol in GDM3 for Ubuntu 22.04+ to force the use of Xorg on GPU instances running without a display.
- Upgrade libjwt to version 1.18.4 (from 1.17.0) for all OSes except Amazon Linux 2.
- Upgrade amazon-efs-utils to version 2.4.0 (from v2.3.1).
- Upgrade EFA installer to 1.44.0 (from 1.43.2).
- Efa-driver: efa-2.17.3-1
- Efa-config: efa-config-1.18-1
- Efa-profile: efa-profile-1.7-1
- Libfabric-aws: libfabric-aws-2.3.1-1
- Rdma-core: rdma-core-59.0-1
- Open MPI: openmpi40-aws-4.1.7-2 and openmpi50-aws-5.0.8-11
BUG FIXES
- Fix race condition where compute nodes could deploy the wrong cluster config version after an update failure.
- Prevent cluster readiness check failures due to instances launched while the check is in progress.
- Fix incorrect timestamp parsing for chef-client.log in CloudWatch Agent configuration.