Create InfiniteStreamRecognizeSoftHandover.java #8251

bburli · 2023-06-12T05:31:14Z

Adding a sample for Soft Handover in stream switching.

Please refer #8250 for issue background.

Description

This class demonstrates how to perform infinite streaming speech recognition using theStreamingRecognize functionality of the Speech API. This class is almost identical to the InfiniteStreamRecognize.java class, except that it demonstrates how to perform "soft handover" between two streams.
A "soft handover" is making a new stream before breaking the old stream. As against a "hard handover" where you break the old stream before making the new stream. This is useful in situations where you want to perform speech recognition on a continuous audio input, but need to periodically restart the stream to avoid exceeding the maximum allowed continuous streaming duration. For demonstration purposes only, this sample uses a reset duration of 30 seconds whereas the actual allowed duration is 5 minutes per Google documentation for Streaming API.
This class uses two streams, STREAM1 and STREAM2, and alternates between them. When one stream is active, the other stream is used to buffer audio input. When the active stream is stopped, the buffered audio input is used to create a new stream.
This class also demonstrates how to align the transcript of the previous stream with the transcript of the current stream using a very simplistic algorithm.

Fixes #8250

Note: Before submitting a pull request, please open an issue for discussion if you are not associated with Google.

Checklist

- Adding a sample for Soft Handover in stream switching. Please refer GoogleCloudPlatform#8250 for issue background.

google-cla · 2023-06-12T05:31:18Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

snippet-bot · 2023-06-12T05:31:21Z

Here is the summary of changes.

You are about to add 1 region tag.

speech/src/main/java/com/example/speech/InfiniteStreamRecognizeSoftHandover.java:19, tag speech_transcribe_infinite_streaming_soft_handover

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

Refresh this comment

Ran formatting with google-java-formatter.

anguillanneuf

Please

add a test
move the region tag start tag to line 18 (to include imports in the displayed/published sample)
can you link to where in the docs this sample will get published? have you checked with the tech writer?

bburli · 2023-06-13T09:28:50Z

Hello @anguillanneuf!

This example is a variation of InfiniteStreamingRecognize.java. As I see, that file also doesn't have a test. I am not sure how to write a test for infinite streaming use case. Can you provide any pointers?
Done.
I am not sure who decides to publish this. I brought in a PR because I thought it is an important example to share on Google streaming, especially where application control of failover is desirable. Issue InfiniteStreaming with soft handover. #8250 has more background as I mentioned in description. The infinite streaming class I referred above links from https://cloud.google.com/speech-to-text/docs/endless-streaming-tutorial page. I haven't checked with the tech writer. I am not sure how to do that as this is my first PR. I would appreciate any guidance in this regard.

anguillanneuf · 2023-06-13T23:49:04Z

Thanks @bburli for providing the great context for your PR! I apologize that I only had a chance to read carefully through the links you provided today. I'm sorry what i'm about to tell you may disappoint you. I have provided some options for you to consider.

This repo [language]-docs-samples is reserved for docs samples. If a sample you want to contribute is not going to be published on g.co/cloud, we cannot include it here. We've established sample guidelines over the years and we now require all samples to have tests (so they can be trusted by developers). You correctly pointed out that speech/src/main/java/com/example/speech/InfiniteStreamRecognize.java doesn't have a test. It looks like we missed it. I have gone ahead and opened the following issues to address our lack of tests for the published samples at https://cloud.google.com/speech-to-text/docs/endless-streaming-tutorial:

I also saw your communication with @minherz. He and I are on the same team and I will follow up with him offline. A helpful sample is different from a sample that we are committed to maintain thus a sample we will publish on g.co/cloud. I would recommend that you publish it in a different venue.

Google Cloud Publication on Medium: https://medium.com/google-cloud. Write a full blogpost exploring the topic and show code snippets.
Google Cloud Community: https://www.googlecloudcommunity.com/gc/forums/filteredbylabelpage/board-id/cloud-ai-ml/label-name/Speech-to-Text. Convert the issue you filed there as a question, and post your code there as an answer. You can feel free to point other folks (customers, developers, partners) there for further discussions.

In order to publish your sample here, you need to engage with the product team and the TW to plan it. If they approve and are committed to add a docs page or a section in the docs describing what this sample is and why, we will allow it, and we will at that point require this sample to have a test.

bburli · 2023-06-14T09:56:42Z

@anguillanneuf Thank you so much for your guidance on this. I have a few questions:
1.

Thanks @bburli for providing the great context for your PR! I apologize that I only had a chance to read carefully through the links you provided today. I'm sorry what i'm about to tell you may disappoint you. I have provided some options for you to consider.

This repo [language]-docs-samples is reserved for docs samples. If a sample you want to contribute is not going to be published on g.co/cloud, we cannot include it here. We've established sample guidelines over the years and we now require all samples to have tests (so they can be trusted by developers). You correctly pointed out that speech/src/main/java/com/example/speech/InfiniteStreamRecognize.java doesn't have a test. It looks like we missed it. I have gone ahead and opened the following issues to address our lack of tests for the published samples at https://cloud.google.com/speech-to-text/docs/endless-streaming-tutorial:

Test needed for speech/src/main/java/com/example/speech/InfiniteStreamRecognize.java #8261

Test missing for speech/microphone/transcribe_streaming_infinite.py python-docs-samples#10233

Test needed for speech/infiniteStreaming.js nodejs-docs-samples#3271

I also saw your communication with @minherz. He and I are on the same team and I will follow up with him offline. A helpful sample is different from a sample that we are committed to maintain thus a sample we will publish on g.co/cloud. I would recommend that you publish it in a different venue.

Google Cloud Publication on Medium: https://medium.com/google-cloud. Write a full blogpost exploring the topic and show code snippets.

Google Cloud Community: https://www.googlecloudcommunity.com/gc/forums/filteredbylabelpage/board-id/cloud-ai-ml/label-name/Speech-to-Text. Convert the issue you filed there as a question, and post your code there as an answer. You can feel free to point other folks (customers, developers, partners) there for further discussions.

In order to publish your sample here, you need to engage with the product team and the TW to plan it. If they approve and are committed to add a docs page or a section in the docs describing what this sample is and why, we will allow it, and we will at that point require this sample to have a test.

Hi @anguillanneuf - Thank you for your guidance here. This helps.

I have posted on community: https://www.googlecloudcommunity.com/gc/AI-ML/Soft-Handover-in-Infinite-streaming/m-p/602877/thread-id/2153
Question: How can I work with the concerned product team on this? I am not from Google, so I am not sure how to reach out or even who to reach out. Please do let me know. I am open to any feedback regardless of whether the PR makes it to the repo.

anguillanneuf · 2023-06-14T16:08:01Z

@bburli I mistook you for a Googler who would have contact with the product team and their tech writer, ofc you would not know how to reach out, I'm sorry about that. Please ignore the last paragraph in my previous response. I saw your post in Google Cloud Community. Thanks a lot. I'm going to go ahead and close this PR and the related issue.

Create InfiniteStreamRecognizeSoftHandover.java

d18d228

- Adding a sample for Soft Handover in stream switching. Please refer GoogleCloudPlatform#8250 for issue background.

bburli requested review from a team and yoshi-approver as code owners June 12, 2023 05:31

product-auto-label bot added samples Issues that are directly related to samples. api: speech Issues related to the Speech-to-Text API. labels Jun 12, 2023

blunderbuss-gcf bot assigned anguillanneuf Jun 12, 2023

bburli added 2 commits June 12, 2023 11:02

Formatting changes

024a20b

Ran formatting with google-java-formatter.

Updating copyright text

7a60aa3

anguillanneuf requested changes Jun 12, 2023

View reviewed changes

Moving START region tag after package declartion

1ebc192

bburli requested a review from anguillanneuf June 13, 2023 09:31

anguillanneuf closed this Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create InfiniteStreamRecognizeSoftHandover.java #8251

Create InfiniteStreamRecognizeSoftHandover.java #8251

bburli commented Jun 12, 2023 •

edited

Loading

google-cla bot commented Jun 12, 2023

snippet-bot bot commented Jun 12, 2023 •

edited

Loading

anguillanneuf left a comment •

edited

Loading

bburli commented Jun 13, 2023 •

edited

Loading

anguillanneuf commented Jun 13, 2023 •

edited

Loading

bburli commented Jun 14, 2023

anguillanneuf commented Jun 14, 2023 •

edited

Loading

Create InfiniteStreamRecognizeSoftHandover.java #8251

Create InfiniteStreamRecognizeSoftHandover.java #8251

Conversation

bburli commented Jun 12, 2023 • edited Loading

Adding a sample for Soft Handover in stream switching.

Description

Checklist

google-cla bot commented Jun 12, 2023

snippet-bot bot commented Jun 12, 2023 • edited Loading

anguillanneuf left a comment • edited Loading

Choose a reason for hiding this comment

bburli commented Jun 13, 2023 • edited Loading

anguillanneuf commented Jun 13, 2023 • edited Loading

bburli commented Jun 14, 2023

anguillanneuf commented Jun 14, 2023 • edited Loading

bburli commented Jun 12, 2023 •

edited

Loading

snippet-bot bot commented Jun 12, 2023 •

edited

Loading

anguillanneuf left a comment •

edited

Loading

bburli commented Jun 13, 2023 •

edited

Loading

anguillanneuf commented Jun 13, 2023 •

edited

Loading

anguillanneuf commented Jun 14, 2023 •

edited

Loading