-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: added deepgram for transcription #97
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In testing it out, whisper is much more accurate than the default nova model. I'd suggest leaving the current whisper code as is and adding a flag to use deepgram but be able to pass it various models.
One of the problems is that the chapters breaking up the timestamps. Instead of breaking up the file, I'd suggest adding the headers by locating the timestamp and keeping the file intact. That might be difficult without granular timestamps like deepgram provides, but it should be possible even with the whisper srt format. If we keep the file intact, we also should get summarization via deepgram which can close #38.
Additionally, the default should be with the speaker diarization off and turn it on via flag and/or adding multiple speakers to the speakers metadata.
Could you please provide an example of the first part. |
4beaccb
to
6a941da
Compare
…zation doesn't work with chapters)
testing the chapters
Otherwise this is ready to merge. EDIT: 025e519 should fix it |
This PR also fixes #38 |
This PR fixes #96.