Skip to content
View cnlinxi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report cnlinxi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

veRL: Volcano Engine Reinforcement Learning for LLM

Python 1,219 89 Updated Jan 29, 2025

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

3,051 347 Updated Jan 29, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 17,911 1,285 Updated Jan 27, 2025

🚀 「大模型」3小时从0训练27M参数的视觉多模态VLM!🌏 Train a 27M-parameter VLM from scratch in just 3 hours!

Python 813 82 Updated Dec 13, 2024

Codec for paper: LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis

Python 125 11 Updated Jan 16, 2025

Implementation of papers in 100 lines of code.

Python 1,412 150 Updated Dec 2, 2024

This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".

112 3 Updated Jan 14, 2025

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

Python 546 26 Updated Dec 19, 2024
Jupyter Notebook 432 45 Updated Jan 26, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,914 568 Updated Oct 22, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 354 22 Updated Jan 29, 2025

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 95 6 Updated Jan 24, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,596 211 Updated Dec 5, 2024

A PyTorch native library for large model training

Python 3,213 257 Updated Jan 29, 2025

PyTorch native post-training library

Python 4,769 506 Updated Jan 29, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,412 236 Updated Jan 27, 2025

Source code for DM-Codec.

Python 35 2 Updated Oct 18, 2024

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions

Python 67 5 Updated Oct 11, 2024

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 63 4 Updated Jan 24, 2025

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,201 167 Updated Dec 6, 2024

Java Emoji (JEmoji) is a lightweight, fast and auto generated emoji library for Java with the purpose to improve and ease working with emojis

Java 67 10 Updated Jan 17, 2025
Python 1,150 41 Updated Nov 21, 2024

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 225 9 Updated Dec 25, 2024

source code of EfficientTTS 2

Python 12 1 Updated Feb 18, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 5,179 449 Updated Jan 29, 2025

STOI loss function in PyTorch

Python 89 20 Updated Sep 30, 2024

Local realtime voice AI

Python 2,183 118 Updated Jan 22, 2025

Collection of Open Source Speech Data

151 6 Updated Nov 8, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 535 42 Updated Oct 17, 2024

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 287 34 Updated Jul 21, 2024
Next
Showing results