Skip to content
View cnlinxi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report cnlinxi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

87 repositories

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,034 765 Updated Oct 16, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,673 4,610 Updated Feb 6, 2025
Python 41 9 Updated May 15, 2023

The reproduced code for Google's SoundStorm

Python 263 19 Updated Oct 7, 2023

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,129 423 Updated Aug 23, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 590 76 Updated Dec 27, 2023

ChatGPT, GenerativeAI and LLMs Timeline

948 59 Updated May 19, 2024

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

574 32 Updated Jun 19, 2023

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

Python 488 34 Updated Jan 27, 2025

潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.

Python 20,598 3,447 Updated Oct 9, 2023

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,769 771 Updated Feb 11, 2024

Text-to-Audio/Music Generation

Python 2,367 183 Updated Sep 29, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,278 1,107 Updated Nov 14, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,271 122 Updated Jul 11, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 526 45 Updated Jun 9, 2024

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 167 11 Updated Jul 12, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,404 163 Updated Jun 25, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,500 4,841 Updated Feb 6, 2025

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,134 89 Updated Dec 12, 2024

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

Python 298 21 Updated Nov 12, 2024

The official implementation of HierSpeech++

Python 1,200 137 Updated Feb 20, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,399 481 Updated Aug 10, 2024

Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning

Python 84 5 Updated Nov 20, 2024
Python 258 18 Updated Jun 8, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,732 3,075 Updated Jan 7, 2025

Unoffical implementation of Megatts2

Python 274 35 Updated Mar 23, 2024
Python 255 24 Updated Mar 15, 2024
36 1 Updated Jan 28, 2024

Foundational model for human-like, expressive TTS

Python 4,015 675 Updated Jul 30, 2024