Alternative backends #333

decahedron1 · 2024-12-14T04:29:14Z

This PR adds runtimes other than the ONNX Runtime.

There are plenty of alternative ONNX inference engines for Rust that each provide their own unique qualities:

candle boasts impressive performance and GPU acceleration.
tract is a battle-tested pure-Rust inference engine with excellent operator support.
wonnx focuses on broader GPU support is designed for the web.

With the removal of wasm32-unknown-unknown support from ort, and the Sisyphean task of getting the damn thing to link, it's clear that ONNX Runtime isn't always the best choice. Most often, though, it is the best choice for one platform, but not another. Applications wishing to target CUDA on desktop and WebGPU on web would need to have 2 different code paths using ort and wonnx. Adding support for wonnx & others directly in ort would mean that developers only need to use the ort API to target both backends, and only a single line of code is required to switch between them, e.g. ort::set_api(ort_candle::api());

Status

_{p.s., sponsorships allow me to spend more time on this PR =)}

decahedron1 added the enhancement New feature or request label Dec 14, 2024

decahedron1 self-assigned this Dec 14, 2024

This comment was marked as outdated.

Sign in to view

decahedron1 marked this pull request as draft December 14, 2024 06:15

decahedron1 marked this pull request as ready for review December 30, 2024 15:56

decahedron1 added 7 commits January 8, 2025 15:33

feat: candle backend

6027532

feat(candle): MemoryInfo & Allocator

3b5af3f

ci(code-quality): only run clippy on main crate

7a62dac

feat(candle): Tensor

924689a

feat(candle): ValueType

0131285

feat(candle): Session

934929f

feat: tract backend

f6f1f83

decahedron1 force-pushed the alternative-backends branch from 2ff9175 to f6f1f83 Compare January 8, 2025 21:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alternative backends #333

Alternative backends #333

decahedron1 commented Dec 14, 2024 •

edited

Loading

This comment was marked as outdated.

Alternative backends #333

Are you sure you want to change the base?

Alternative backends #333

Conversation

decahedron1 commented Dec 14, 2024 • edited Loading

Status

This comment was marked as outdated.

decahedron1 commented Dec 14, 2024 •

edited

Loading