cnlinxi

Follow

🎯

Focusing

cnlinxi

🎯

Focusing

Follow

242 followers · 127 following

Achievements

Achievements

Highlights

Pro

Lists (26)

Sort

AcousticFrontend

10 repositories

AcousticModel

71 repositories

ASR

14 repositories

ASR-pretrain

ASV

AudioQuality

10 repositories

AwesomeList

Paper list, awesome list and so on.

63 repositories

BandwidthExtension

Classification

Codec

Data

90 repositories

Develop

Evaluation

FrontEnd

FrontEnd for Text-to-Speech

18 repositories

How-to

LLM

84 repositories

Music

Performance

Quant

SingingVoiceSynthesis

SpeechEditing

SpeechSeperation

Tools

31 repositories

Universal Method

Vocoder

20 repositories

VoiceConversion

Starred repositories

volcengine / verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python 1,219 89 Updated Jan 29, 2025

WangRongsheng / awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

3,051 347 Updated Jan 29, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,911 1,285 Updated Jan 27, 2025

jingyaogong / minimind-v

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM！🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 813 82 Updated Dec 13, 2024

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis

Python 125 11 Updated Jan 16, 2025

MaximeVandegar / Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

Python 1,412 150 Updated Dec 2, 2024

imxtx / awesome-controllabe-speech-synthesis

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

112 3 Updated Jan 14, 2025

nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 546 26 Updated Dec 19, 2024

andysingal / llm-course

Jupyter Notebook 432 45 Updated Jan 26, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 4,914 568 Updated Oct 22, 2024

liutaocode / TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 354 22 Updated Jan 29, 2025

LqNoob / Neural-Codec-and-Speech-Language-Models

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 95 6 Updated Jan 24, 2025

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,596 211 Updated Dec 5, 2024

pytorch / torchtitan

A PyTorch native library for large model training

Python 3,213 257 Updated Jan 29, 2025

pytorch / torchtune

PyTorch native post-training library

Python 4,769 506 Updated Jan 29, 2025

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,412 236 Updated Jan 27, 2025

mubtasimahasan / DM-Codec

Source code for DM-Codec.

Python 35 2 Updated Oct 18, 2024

line / promptttspp

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions

Python 67 5 Updated Oct 11, 2024

alessandroragano / scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 63 4 Updated Jan 24, 2025

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,201 167 Updated Dec 6, 2024

felldo / JEmoji

Java Emoji (JEmoji) is a lightweight, fast and auto generated emoji library for Java with the purpose to improve and ease working with emojis

Java 67 10 Updated Jan 17, 2025

Open-Source-O1 / Open-O1

Python 1,150 41 Updated Nov 21, 2024

GTSinger / GTSinger

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 225 9 Updated Dec 25, 2024

mcf330 / efts2code

source code of EfficientTTS 2

Python 12 1 Updated Feb 18, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,179 449 Updated Jan 29, 2025

mpariente / pytorch_stoi

STOI loss function in PyTorch

Python 89 20 Updated Sep 30, 2024

janhq / ichigo

Local realtime voice AI

Python 2,183 118 Updated Jan 22, 2025

hlt-mt / mosel

Collection of Open Source Speech Data

151 6 Updated Nov 8, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 535 42 Updated Oct 17, 2024

Glanvery / LLM-Travel

欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 287 34 Updated Jul 21, 2024

Starred topics

stt

speaker-recognition

Awesome Lists