Speaker recognition using Mel Frequency Cepstral Coefficients (MFCC) and Linde-Buzo-Gray (LBG) clustering algorithm. The implementation is based on this matlab tutorial. It gives about ~65% of correct results using this data set.
bash scripts/preprocess-audio.sh
Creates audio-preprocessed
folder
./src/cli.js train ./audio-preprocessed/train/
Creates training-results.json
Requires training-results.json
./src/cli.js recognize ./audio-preprocessed/test/F101_test_1.wav
Requires training-results.json
./src/cli.js test-performance ./audio-preprocessed/test/