Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.4k36k1.2k474Updated Feb 2, 2025Feb 2, 2025
    • Python
      Apache License 2.0
      1612094Updated Jan 31, 2025Jan 31, 2025
    • Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
      Python
      Apache License 2.0
      759161237Updated Jan 31, 2025Jan 31, 2025
    • HCL
      17803Updated Jan 31, 2025Jan 31, 2025
    • HTML
      MIT License
      7601Updated Jan 30, 2025Jan 30, 2025
    • SCSS
      MIT License
      7400Updated Jan 30, 2025Jan 30, 2025
    • Community maintained hardware plugin for vLLM on Spyre
      Apache License 2.0
      0200Updated Jan 29, 2025Jan 29, 2025
    • Community maintained hardware plugin for vLLM on Ascend
      Apache License 2.0
      31211Updated Jan 29, 2025Jan 29, 2025
    • Fast and memory-efficient exact attention
      C++
      BSD 3-Clause "New" or "Revised" License
      1.4k4308Updated Jan 26, 2025Jan 26, 2025
    • An adaptor to allow Python allocator for PyTorch pluggable allocator
      C++
      Apache License 2.0
      0200Updated Jan 5, 2025Jan 5, 2025
    • media-kit

      Public
      vLLM Logo Assets
      0000Updated Dec 12, 2024Dec 12, 2024
    • vllm-nccl

      Public archive
      Manages vllm-nccl dependency
      Python
      Apache License 2.0
      31620Updated Jun 3, 2024Jun 3, 2024
    • dashboard

      Public
      vLLM performance dashboard
      Python
      Apache License 2.0
      42000Updated Apr 26, 2024Apr 26, 2024