Skip to content

Conversation

@tangkangqi
Copy link

Description

This PR adds a TensorRT-LLM acceleration example for Cosmos-Reason1, including:

  • Complete setup guide with Docker instructions
  • TensorRT-LLM server configuration
  • Gradio-based web interface for video inference
  • Benchmark examples
  • Documentation of known issues

Changes

  • Added examples/acceleration/README.md with comprehensive setup instructions
  • Added examples/acceleration/trtllm_infer_web.py for web-based inference
  • Included example images and documentation

@tangkangqi tangkangqi changed the title Feature/acceleration acceleration examples Nov 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant