Major Features:
- Support Metax scheduling optimazation
- Support Mthreads sGPU
- Add a configMap hami-scheduler-device for all configurations of HAMi
- Optimize installation process
Details
⬆️ Dependencies
- Bump actions/download-artifact from 3 to 4 by @dependabot in #529
- Bump docker/build-push-action from 6.8.0 to 6.9.0 by @dependabot in #528
- Bump actions/upload-artifact from 3.1.3 to 4.4.0 by @dependabot in #530
- Bump aquasecurity/trivy-action from 0.24.0 to 0.27.0 by @dependabot in #546
- Bump actions/upload-artifact from 4.4.0 to 4.4.3 by @dependabot in #541
- Bump ubuntu from 20.04 to 24.04 in /docker by @dependabot in #394
- Bump aquasecurity/trivy-action from 0.27.0 to 0.28.0 by @dependabot in #559
- Bump codecov/codecov-action from 4 to 5 by @dependabot in #613
🔨 Other Changes
- fix build badge status by @wawa0210 in #526
- update action-gh-release template file to more accurate matching by @wawa0210 in #527
- Refactor helm "Admission Webhook" config. by @4gt-104 in #532
- fix: error happen when allocate iluvatar device by @lijm87 in #522
- Fix code scanning alert-Incorrect conversion between integer types by @ghostloda in #556
- update hami-core version by @chaunceyjiang in #557
- Mthreads support by @archlitchi in #560
- Fix code scanning alert-Incorrect conversion between integer types by @ghostloda in #561
- update docs by @ghostloda in #567
- migrate hami slack to cncf hami group by @wawa0210 in #568
- Fix pod assignment issue when pod already has a node assigned by @chaunceyjiang in #564
- fix(scheduler): prevent array out-of-bounds when GPU containers are placed between non-GPU containers by @Nimbus318 in #572
- improve pkg/k8sutil/pod.go ut coverage by @wawa0210 in #570
- Metax GPU topo-awareness support by @archlitchi in #574
- Add WebUI to readme and readme_cn.md by @archlitchi in #578
- remove watermark of MetaX topo diagrams by @obnah in #581
- update HAMi Talks and References by @wawa0210 in #582
- fix: assgin to wrong devices when 1 pod has 2+ containers request GPU by @joy717 in #593
- docs: fix deployments path in README by @dublc in #608
- Add unified configMap and update charts by @archlitchi in #614
- Fix configMap device-config not properly installed by @archlitchi in #616
- fix CI: race condition error by @archlitchi in #618
- Pre release to v2.4.1 by @archlitchi in #619
New Contributors
- @4gt-104 made their first contribution in #532
- @lijm87 made their first contribution in #522
- @ghostloda made their first contribution in #556
- @Nimbus318 made their first contribution in #572
- @obnah made their first contribution in #581
- @dublc made their first contribution in #608
Full Changelog: v2.4.0...v2.4.1