feat: added deepgram for transcription #97

masterchief164 · 2023-04-17T18:10:21Z

This PR fixes #96.

adamjonas

In testing it out, whisper is much more accurate than the default nova model. I'd suggest leaving the current whisper code as is and adding a flag to use deepgram but be able to pass it various models.

One of the problems is that the chapters breaking up the timestamps. Instead of breaking up the file, I'd suggest adding the headers by locating the timestamp and keeping the file intact. That might be difficult without granular timestamps like deepgram provides, but it should be possible even with the whisper srt format. If we keep the file intact, we also should get summarization via deepgram which can close #38.

Additionally, the default should be with the speaker diarization off and turn it on via flag and/or adding multiple speakers to the speakers metadata.

app/application.py

masterchief164 · 2023-04-19T18:49:08Z

One of the problems is that the chapters breaking up the timestamps. Instead of breaking up the file, I'd suggest adding the headers by locating the timestamp and keeping the file intact. That might be difficult without granular timestamps like deepgram provides, but it should be possible even with the whisper srt format. If we keep the file intact, we also should get summarization via deepgram which can close #38.

Could you please provide an example of the first part.

adamjonas · 2023-04-19T21:05:10Z

https://github.com/adamjonas/bitcointranscripts/pull/113/files

This reverts commit eb288ec.

This reverts commit 9f1a3a4.

…zation doesn't work with chapters)

adamjonas · 2023-05-12T00:53:38Z

testing the chapters

tstbtc '1Z7GjXgdUy4' test-dir -p -D -C produces https://github.com/adamjonas/bitcointranscripts/pull/146/files and https://github.com/adamjonas/bitcointranscripts/pull/145/files, which both cut off before transcribing the last chapter of the video.

Otherwise this is ready to merge.

EDIT: 025e519 should fix it

masterchief164 · 2023-05-12T10:49:55Z

This PR also fixes #38

masterchief164 added 2 commits April 17, 2023 23:39

feat: added deepgram for transcription

9f1a3a4

fix: removed the option to select different models

eb288ec

adamjonas reviewed Apr 18, 2023

View reviewed changes

app/application.py Show resolved Hide resolved

masterchief164 added 3 commits April 20, 2023 23:04

Revert "fix: removed the option to select different models"

9a726e5

This reverts commit eb288ec.

Revert "feat: added deepgram for transcription"

8708491

This reverts commit 9f1a3a4.

fix: added chapters without splitting the original file

6a941da

masterchief164 force-pushed the issue_96 branch from 4beaccb to 6a941da Compare April 22, 2023 18:29

feat: added options for diarization and summary using deepgram (diari…

1e6df96

…zation doesn't work with chapters)

masterchief164 force-pushed the issue_96 branch from 4d9e772 to 1e6df96 Compare May 9, 2023 18:50

masterchief164 and others added 2 commits May 10, 2023 03:00

feat: added chapters support to deepgram

8662914

add -M flag for diarize and double-digit timestamps

3d87efe

adamjonas and others added 2 commits May 11, 2023 21:01

add transcription after final chapter header

025e519

feat: added chapters support to deepgram with diarization

5d96b2e

masterchief164 force-pushed the issue_96 branch from 52c941f to 5d96b2e Compare May 12, 2023 10:48

masterchief164 requested a review from adamjonas May 12, 2023 10:51

adamjonas merged commit f0d9d9c into bitcointranscripts:main May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added deepgram for transcription #97

feat: added deepgram for transcription #97

masterchief164 commented Apr 17, 2023

adamjonas left a comment •

edited

Loading

masterchief164 commented Apr 19, 2023

adamjonas commented Apr 19, 2023

adamjonas commented May 12, 2023 •

edited

Loading

masterchief164 commented May 12, 2023

feat: added deepgram for transcription #97

feat: added deepgram for transcription #97

Conversation

masterchief164 commented Apr 17, 2023

adamjonas left a comment • edited Loading

Choose a reason for hiding this comment

masterchief164 commented Apr 19, 2023

adamjonas commented Apr 19, 2023

adamjonas commented May 12, 2023 • edited Loading

masterchief164 commented May 12, 2023

adamjonas left a comment •

edited

Loading

adamjonas commented May 12, 2023 •

edited

Loading