ASR

ASR
- Survey
- ASR
- Projects
- Datasets
- Whisper
- Toolkits
- Products
- Misc

Survey

ASR

Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models, arXiv, 2501.02832, arxiv, pdf, cication: -1

Syed Abdul Gaffar Shakhadri, Kruthika KR, Kartik Basavaraj Angadi
🌟 A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models, arXiv, 2411.08742, arxiv, pdf, cication: -1

Dingdong Wang, Mingyu Cui, Dongchao Yang, ..., Xueyuan Chen, Helen Meng
Moonshine: Speech Recognition for Live Transcription and Voice Commands, arXiv, 2410.15608, arxiv, pdf, cication: -1

Nat Jeffries, Evan King, Manjunath Kudlur, ..., James Wang, Pete Warden · (moonshine - usefulsensors)

Projects

transformers.js-examples - huggingface
whisper-ner - aiola-lab

· (arxiv)
CrisperWhisper - nyrahealth

· (huggingface)

Datasets

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context, arXiv, 2309.08105, arxiv, pdf, cication: -1

Wei Kang, Xiaoyu Yang, Zengwei Yao, ..., Long Lin, Daniel Povey

Whisper

CrisperWhisper: Accurate Timestamps on Verbatim Speech Transcriptions, arXiv, 2408.16589, arxiv, pdf, cication: -1

Laurin Wagner, Bernhard Thallinger, Mario Zusag · (huggingface)
Whisper-ZeroA complete rework of Whisper ASR that eliminates hallucinations and drastically improves accuracy.

Toolkits

Products

Next-gen Speech AI for next-level product experiences

· (x)

Misc

Misc

Robust ASR Error Correction with Conservative Data Filtering 🤗