#

voice-synthesis

Here are 125 public repositories matching this topic...

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

python text-to-speech deep-learning speech pytorch tts speech-synthesis voice-conversion vocoder voice-synthesis tacotron voice-cloning speaker-encodings melgan speaker-encoder multi-speaker-tts glow-tts hifigan tts-model

Updated Aug 16, 2024
Python

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

Updated Jun 6, 2024

DanRuta / xVA-Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

electron machine-learning skyrim elder-scrolls speech-synthesis fallout voice-synthesis tacotron

Updated Apr 28, 2024
JavaScript

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

deep-learning voice tts speech-processing voice-synthesis saidl speaker-adaptation voice-cloning speaker-encodings mel-spectogram

Updated Feb 23, 2021
Python

hujinsen / pytorch-StarGAN-VC

Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .

pytorch voice-conversion voice-synthesis stargan pytorch-implementation voice-converter stargan-vc

Updated Mar 28, 2024
Python

ZDisket / TensorVox

Desktop application for neural speech synthesis written in C++

text-to-speech real-time desktop tts speech-synthesis phoneme voice-synthesis tacotron2 multiband-melgan mb-melgan fastspeech2

Updated Mar 1, 2023
C++

ManimCommunity / manim-voiceover

Manim plugin for all things voiceover

text-to-speech ai tts speech-synthesis voice-synthesis voiceover manim math-animations

Updated Sep 29, 2024
Python

Voice-synthesis

smoke-trees / Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to…

tensorflow keras speech-to-text voice-synthesis voice-cloning pytorch-implementation sv2tts

Updated Sep 25, 2020
Python

zakaton / Pink-Trombone

A programmable version of Neil Thapen's Pink Trombone

api web-component voice speech-synthesis web-audio auditory-display voice-synthesis sound-design voice-ui web-audio-worklet procedural-audio pink-trombone voice-design vocal-tract

Updated Dec 20, 2023
JavaScript

JollyToday / GhostCut-auto_video_translation

auto video translation-video translator can auto translate video hard subtitles, auto video translation and dubbing, remove any video text, auto remove video subtitles/text. 自动视频翻译配音，自动翻译视频字幕和回填样式，自动硬字幕翻译。

material-design ffmpeg tts subtitles moviepy inpainting video-api video-maker-api translation-api voice-synthesis video-translation dubbing-service video-subtitles video-voice

Updated Oct 24, 2023
Python

Azure-Samples / Cognitive-Services-Voice-Assistant

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

microsoft bot sdk bots wpf bot-framework voice-commands microsoft-bot-framework speech-recognition dotnet-core microsoft-cognitive-services speech-to-text voice-control voice-assistant botframework voice-synthesis

Updated Oct 4, 2023
C++

tts-arabic-pytorch

nipponjo / tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

python text-to-speech deep-learning speech pytorch tts speech-synthesis arabic voice-synthesis torchaudio tacotron2-pytorch tacotron2 multi-speaker-tts hifi-gan hifigan fastpitch tts-model arabic-tts vocos

Updated Nov 5, 2024
Jupyter Notebook

sidmulajkar / sentiment-predictor-for-stress-detection

Voice stress analysis (VSA) aims to differentiate between stressed and non-stressed outputs in response to stimuli (e.g., questions posed), with high stress seen as an indication of deception. In this work, we propose a deep learning-based psychological stress detection model using speech signals. With increasing demands for communication betwee…

deep-learning voice emotion stress-testing convolutional-neural-network emotion-detection emotion-recognition deception voice-synthesis stimuli stress-detector vsa depression-detection cortisol stress-detection voice-stress-analysis stressed-outputs speech-signals

Updated Oct 18, 2021
Jupyter Notebook

com.rest.elevenlabs

RageAgainstThePixel / com.rest.elevenlabs

A non-official Eleven Labs voice synthesis client for Unity (UPM)

ai unity unity3d ml tts upm voice-synthesis upm-package openupm

Updated Nov 5, 2024
C#

YuzukiTsuru / lessampler

lessampler is a Singing Voice Synthesizer

dsp voice synthesizer synthesis utau svs voice-synthesis singing-voice singing-synthesis openutau

Updated Nov 11, 2022
C++

spokestack-android

spokestack / spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

android text-to-speech nlu voice speech tts speech-synthesis voice-recognition speech-recognition vad asr voice-assistant natural-language-understanding voice-as-an-interface speech-api voice-activity-detection voice-synthesis wakeword wakeword-activation

Updated Oct 18, 2021
Java

ElevenLabs-DotNet

RageAgainstThePixel / ElevenLabs-DotNet

A Non-Official ElevenLabs RESTful API Client for dotnet

ai dotnet ml tts speech-synthesis voice-synthesis tts-api eleven-labs

Updated Oct 27, 2024
C#

chdh / klatt-syn

Klatt formant synthesizer

speech-synthesis klatt voice-synthesis formant klatt-synthesizer klattsyn

Updated Oct 11, 2023
TypeScript

hparcells / rtvc

💬 "Realtime" voice transcription and cloning using ElevenLabs's API.

api website web ai interactive transcription voice-synthesis voice-cloning speech-to-speech voicecloning elevenlabs

Updated Mar 1, 2023
TypeScript

olaviinha / NeuralTextToAudio

Text prompt steered synthetic audio generators

audio colab audio-synthesis music-generation audio-processing voice-synthesis text2music colab-notebook voice-cloning audio-generation audioldm text2audio mubert mubertai

Updated Dec 2, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the voice-synthesis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the voice-synthesis topic, visit your repo's landing page and select "manage topics."