CameronNg

Cam Ng CameronNg

Achievements

Stars

menloresearch / cortex.python

C++ code that run Python embedding

C++ 5 1 Updated Oct 17, 2024

menloresearch / cortex.tensorrt-llm

Forked from NVIDIA/TensorRT-LLM

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

C++ 43 3 Updated Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cam Ng CameronNg

Achievements

Achievements

Block or report CameronNg

Stars

menloresearch / cortex.python

menloresearch / cortex.tensorrt-llm