Skip to content
View feifeibear's full-sized avatar

Block or report feifeibear

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. hpcaitech/ColossalAI hpcaitech/ColossalAI Public

    Making large AI models cheaper, faster and more accessible

    Python 38.9k 4.3k

  2. Tencent/TurboTransformers Tencent/TurboTransformers Public

    a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

    C++ 1.5k 198

  3. xdit-project/xDiT xdit-project/xDiT Public

    xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

    Python 1k 102

  4. Tencent/PatrickStar Tencent/PatrickStar Public

    PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

    Python 749 57

  5. LLMSpeculativeSampling LLMSpeculativeSampling Public

    Fast inference from large lauguage models via speculative decoding

    Python 605 63

  6. long-context-attention long-context-attention Public

    USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

    Python 388 28