What's new?
Python Quantization Tool
- Prevent int32 quantized bias from clipping by adjusting the weight's scale (#22020) - @adrianlizarraga
- Update QDQ Pad, Slice, Softmax (#22676) - @adrianlizarraga
- Introduce get_qdq_config() helper to get QDQ configurations (#22677) - @adrianlizarraga
- Add reduce_range option to get_qdq_config() (#22782) - @adrianlizarraga
- Flaky test due to Pad reflect bug (#22798) - @adrianlizarraga
CPU EP
QNN EP
- Add QNN SDK v2.28.2 support (#22724, #22844) - @HectorSVC, @adrianlizarraga
TensorRT EP
Packaging
- Rework the native library usage so that a pre-built ORT native package can be easily used (#22345) - @skottmckay
- Fix Maven Sha256 Checksum Issue (#22600) - @idiskyle
Contributions
Big thank you to the release manager @yf711, along with @adrianlizarraga, @HectorSVC, @jywu-msft, and everyone else who helped to make this patch release process a smooth one!