🎤 feat: Improve speech-to-text with configurable silence timeout #9090

dirkpetersen · 2025-08-16T15:08:01Z

🎤 feat: Improve speech-to-text with configurable silence timeout and text accumulation

Summary

Add configurable silence timeout (1-15s, default 8s) to prevent premature recording stops during thinking pauses
Implement text accumulation across speech recognition sessions to preserve previously spoken text
Add SilenceTimeoutSelector component with slider control in advanced speech settings
Enhance browser STT to accumulate text instead of replacing on each recognition cycle
Modify external STT to use configurable timeout instead of hardcoded 3-second limit
Add double-click functionality to microphone button for manual text clearing
Include clearAccumulatedText() methods for both browser and external STT implementations
Add localization strings for silence timeout and speech text cleared messages
Preserve accumulated text until successful message submission or manual clear
Added test cases and docs

This resolves the issue where speech-to-text would delete previous text after pauses, allowing users to think while speaking without losing their words.

🤖 Generated with Claude Code

Change Type

Please delete any irrelevant options.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Translation update

…text accumulation - Add configurable silence timeout (1-15s, default 8s) to prevent premature recording stops during thinking pauses - Implement text accumulation across speech recognition sessions to preserve previously spoken text - Add SilenceTimeoutSelector component with slider control in advanced speech settings - Enhance browser STT to accumulate text instead of replacing on each recognition cycle - Modify external STT to use configurable timeout instead of hardcoded 3-second limit - Add double-click functionality to microphone button for manual text clearing - Include clearAccumulatedText() methods for both browser and external STT implementations - Add localization strings for silence timeout and speech text cleared messages - Preserve accumulated text until successful message submission or manual clear This resolves the issue where speech-to-text would delete previous text after pauses, allowing users to think while speaking without losing their words. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Add tests for new SilenceTimeoutSelector component with slider functionality - Add tests for updated useSpeechToTextBrowser hook with text accumulation logic - Add tests for updated useSpeechToTextExternal hook with configurable timeout - Add tests for enhanced AudioRecorder component with double-click clear functionality - Add tests for new silenceTimeoutMs store setting and existing speech settings - Add integration tests for Speech settings UI with advanced mode interactions - Ensure test coverage for all new features: configurable silence timeout, text accumulation, and manual clearing Tests cover component rendering, user interactions, state management, accessibility, and integration scenarios to ensure robust functionality of the speech-to-text improvements. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Update README.md with new speech features: configurable silence detection, text accumulation, and manual clearing - Add CHANGELOG.md entry for the speech-to-text improvements in unreleased section - Create SPEECH_FEATURES.md with comprehensive documentation covering: - Feature descriptions and usage instructions - Technical implementation details for both browser and external STT - Configuration options and accessibility features - Testing coverage and migration notes - Backwards compatibility information Documentation provides clear guidance for users and developers on the enhanced speech-to-text functionality that addresses text deletion during thinking pauses. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

Major bug fixes: - Fix text accumulation logic that was replacing instead of appending text - Fix accumulated text being cleared when toggling recording sessions - Prevent concurrent recording sessions Performance improvements: - Optimize silence detection from 60Hz to 10Hz (83% CPU reduction) - Improve resource management with stream reuse - Add throttling and debouncing for better efficiency Enhanced error handling: - Add comprehensive permission error handling with specific messages - Implement network error recovery with retry logic - Add offline state detection and handling User experience improvements: - Add mobile double-tap support with proper debouncing - Preserve accumulated text across recording sessions - Provide clear, actionable error messages Testing and code quality: - Fix test file extensions for JSX support - Add comprehensive edge case test coverage - Add detailed code documentation This resolves the issue where speech-to-text would delete previous text after pauses, significantly improving the user experience. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <[email protected]>

- Remove CLAUDE.md from version control (keep local copy) - Add CLAUDE.md to .gitignore to prevent future tracking - This file contains project-specific Claude Code configuration

github-advanced-security

ESLint found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

dustinhealy · 2025-08-19T13:10:50Z

Hi Dirk,

Thank you for your contribution! Could you please resolve the outstanding ESLint issues before I review?

dirkpetersen · 2025-08-19T15:05:42Z

Thanks Dustin, hope i have time to work on this on the weekend. Allowing me to think while speaking without losing my words is quite important to me

Dirk Petersen and others added 5 commits August 16, 2025 07:56

chore: Remove CLAUDE.md from git tracking and add to .gitignore

e9cef36

- Remove CLAUDE.md from version control (keep local copy) - Add CLAUDE.md to .gitignore to prevent future tracking - This file contains project-specific Claude Code configuration

github-advanced-security bot found potential problems Aug 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

🎤 feat: Improve speech-to-text with configurable silence timeout #9090

🎤 feat: Improve speech-to-text with configurable silence timeout #9090

dirkpetersen commented Aug 16, 2025

Uh oh!

github-advanced-security bot left a comment

Uh oh!

dustinhealy commented Aug 19, 2025

Uh oh!

dirkpetersen commented Aug 19, 2025

Uh oh!

Uh oh!

Uh oh!

🎤 feat: Improve speech-to-text with configurable silence timeout #9090

Are you sure you want to change the base?

🎤 feat: Improve speech-to-text with configurable silence timeout #9090

Conversation

dirkpetersen commented Aug 16, 2025

Summary

Change Type

Uh oh!

github-advanced-security bot left a comment

Choose a reason for hiding this comment

Uh oh!

dustinhealy commented Aug 19, 2025

Uh oh!

dirkpetersen commented Aug 19, 2025

Uh oh!

Uh oh!