From Collective Computational Unit
Jump to navigation
Jump to search
28.12.2021
- Kubernetes version has been updated to 1.23.1. Please update your kubectl accordingly.
- Pod security infrastructure has been migrated from the deprecated PodSecurityPolicy to OPA/Gatekeeper. No changes on your side should be required if everything was configured as intended, but please inform me if there are things you should be allowed to do and can't, or things you can do which should better be forbidden.
- All GPU drivers have been updated to the most recent versions available for the respective machines. You might have to migrate to more recent versions of GPU containers. The GPU driver and CUDA version of all compute nodes are now shown on the cluster status page.
- Node Zariel is currently not available - the system update broke something and the node did not boot up. I need physical access to the server room, so earliest date to fix it is January 10th. Please be considerate with the number of GPUs you reserve.
01.02.2021
- Full cluster rebuild with Kubernetes 1.20.0
- Hostpath volumes for Ceph home directories, shared and dataset storage, and local node data.
30.11.2020
- Node Zariel has been added to the cluster.
15.07.2020
- Ceph persistent storage cluster added