Skip to content
Change the repository type filter

All

    Repositories list

    • modyn

      Public
      Modyn is a research-platform for training ML models on growing datasets.
      Python
      MIT License
      646967Updated Mar 4, 2025Mar 4, 2025
    • mixtera

      Public
      A lightweight, user-friendly data-plane for LLM training.
      Python
      MIT License
      37224Updated Mar 4, 2025Mar 4, 2025
    • Adaptive LLM Inference
      Python
      1491Updated Mar 1, 2025Mar 1, 2025
    • Starter code for semester project in Cloud Computing Architecture course at ETH Zurich
      Python
      11700Updated Feb 24, 2025Feb 24, 2025
    • triteia

      Public
      Useful Kernels for ML in Triton
      Cuda
      Apache License 2.0
      0120Updated Feb 21, 2025Feb 21, 2025
    • Artifact Evaluation for DeltaZip
      Jupyter Notebook
      Apache License 2.0
      0000Updated Feb 21, 2025Feb 21, 2025
    • A native PyTorch Library for large model training
      Python
      BSD 3-Clause "New" or "Revised" License
      308000Updated Feb 19, 2025Feb 19, 2025
    • temporary nuts and bolts for running mixtera trainings on clariden
      Python
      0000Updated Feb 19, 2025Feb 19, 2025
    • deltazip

      Public
      Compression for Foundation Models
      Jupyter Notebook
      Apache License 2.0
      32701Updated Feb 14, 2025Feb 14, 2025
    • CUDA benchmarks for measuring GPU utilization and interference
      Cuda
      MIT License
      1600Updated Feb 11, 2025Feb 11, 2025
    • A native PyTorch Library for large model training
      Python
      BSD 3-Clause "New" or "Revised" License
      308000Updated Feb 7, 2025Feb 7, 2025
    • Nuts and bolts for evaluation of models trained in context of mixtera
      Python
      0000Updated Feb 7, 2025Feb 7, 2025
    • orion

      Public
      An interference-aware scheduler for fine-grained GPU sharing
      Python
      MIT License
      1912591Updated Jan 26, 2025Jan 26, 2025
    • vidur

      Public
      A large-scale simulation framework for LLM inference
      Python
      MIT License
      60000Updated Dec 11, 2024Dec 11, 2024
    • dirigent

      Public
      Dirigent: Lightweight Serverless Orchestration
      Go
      MIT License
      53301Updated Dec 8, 2024Dec 8, 2024
    • Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
      Python
      Apache License 2.0
      94000Updated Nov 28, 2024Nov 28, 2024
    • pccheck

      Public
      Python
      MIT License
      0400Updated Nov 8, 2024Nov 8, 2024
    • Contains instructions and scripts for the ATC'24 Pecan artifact evaluation.
      Python
      Apache License 2.0
      1000Updated Oct 14, 2024Oct 14, 2024
    • fmengine

      Public
      Utilities for Training Very Large Models
      Python
      105840Updated Sep 25, 2024Sep 25, 2024
    • cachew

      Public
      ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
      C++
      Apache License 2.0
      75k3702Updated Sep 10, 2024Sep 10, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      164000Updated Jul 19, 2024Jul 19, 2024
    • mlibc

      Public
      Portable C standard library
      C
      MIT License
      143000Updated Jul 15, 2024Jul 15, 2024
    • Hosts CGLM metadata
      0000Updated Jun 6, 2024Jun 6, 2024
    • serving

      Public
      Kubernetes-based, scale-to-zero, request-driven compute
      Go
      Apache License 2.0
      1.2k000Updated May 7, 2024May 7, 2024
    • Copy node connection information easily
      JavaScript
      0100Updated Mar 13, 2024Mar 13, 2024
    • rWasm

      Public
      A cross-platform high-performance provably-safe sandboxing Wasm-to-native compiler
      Rust
      7000Updated Jan 14, 2024Jan 14, 2024
    • ML Input Data Processing as a Service
      Python
      Apache License 2.0
      2801Updated Oct 20, 2023Oct 20, 2023
    • Python
      Other
      0000Updated Aug 16, 2023Aug 16, 2023
    • airflow

      Public
      Python
      Apache License 2.0
      1001Updated Feb 17, 2023Feb 17, 2023
    • varuna

      Public
      Python
      29000Updated Jul 25, 2022Jul 25, 2022