Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Lambda based Youtube video transcript download seems to be blocked #491

Open
typex1 opened this issue Aug 13, 2024 · 2 comments
Open
Assignees
Labels
bug Something isn't working deprecated Related feature will be deprecated

Comments

@typex1
Copy link

typex1 commented Aug 13, 2024

Describe the bug

Filling out this field will help us investigate the issue efficiently. Providing detailed information allows us to set the appropriate priority. We appreciate your cooperation.
A clear and concise description of what the bug is.

Creating a new Bot including a knowledge base from Youtube transcriptions fails. Error in Frontend: "Failed to detect language: Could not retrieve a transcript for the video https://www.youtube.com/watch?v=Pv0cfsastFs"

This error message is misleading, because what seems to go wrong is not language detection specifically, but the whole transcript API seems to be not usable.

It took me quite some research to have good evidence that AWS owned IP addresses are (currently) blocked from Youtube transcription download. This applies at least to Lambda functions and Cloud9. Tested in us-east-1, eu-central-1 and ap-northeast-1.

To Reproduce

Filling out this field will help us investigate the issue efficiently. Providing detailed information allows us to set the appropriate priority. We appreciate your cooperation.
Steps to reproduce the behavior:

  • Create a new Bot, adding any arbitrary Youtube URL as part of the knowledge base. After the sync phase is done, it will show you the related error.

Screenshots

If applicable, add screenshots to help explain your problem.

Additional context

Add any other context about the problem here.
Screenshot 2024-08-13 at 14 26 53

@typex1 typex1 changed the title [BUG] Lambda based Youtube video download seems to be blocked [BUG] Lambda based Youtube video transcript download seems to be blocked Aug 13, 2024
@typex1
Copy link
Author

typex1 commented Aug 13, 2024

Error situation on the Bot overview list:
Screenshot 2024-08-13 at 14 47 41

@statefb statefb self-assigned this Aug 14, 2024
@statefb statefb added the bug Something isn't working label Aug 14, 2024
@statefb
Copy link
Contributor

statefb commented Aug 14, 2024

@statefb statefb added the deprecated Related feature will be deprecated label Aug 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working deprecated Related feature will be deprecated
Projects
None yet
Development

No branches or pull requests

2 participants