Skip to content

second-state/silero_vad_server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An API server for AI VAD

This is the VAD server that is designed to wok with the EchoKit server.

Install libtorch dependencies

Linux x86 CPU with or without CUDA and ROCm

# download libtorch
curl -LO https://download.pytorch.org/libtorch/cu124/libtorch-cxx11-abi-shared-with-deps-2.4.0%2Bcu124.zip
unzip libtorch-cxx11-abi-shared-with-deps-2.4.0%2Bcu124.zip

MacOS on Apple Silicon (M-series) devices

curl -LO https://download.pytorch.org/libtorch/cpu/libtorch-macos-arm64-2.8.0.zip
unzip libtorch-macos-arm64-2.8.0.zip

Then, tell the system where to find your LibTorch.

# Add to ~/.zprofile or ~/.bash_profile
export LD_LIBRARY_PATH=$(pwd)/libtorch/lib:$LD_LIBRARY_PATH
export LIBTORCH=$(pwd)/libtorch 

Build the API server

git clone https://github.com/second-state/silero_vad_server

cd silero_vad_server
cargo build --release

Run the API server

VAD_LISTEN=0.0.0.0:9094 nohup target/release/silero_vad_server &

In the EchoKit server configuration, you can now use the VAD server in the [asr] section to use it together with any ASR API.

[asr]
url = "http://localhost:9092/v1/audio/transcriptions"
lang = "auto"
prompt = "Hello\n你好\n(noise)\n(bgm)\n(silence)\n"
vad_url = "http://localhost:9094/v1/audio/vad"

About

VAD (Voice Activity Detection) as a service, written in Rust

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors 2

  •  
  •