Skip to content
Georgy Treshchev edited this page May 1, 2023 · 21 revisions

Runtime Speech Recognizer Documentation

Runtime Speech Recognizer is an open-source plugin that enables real-time, offline speech recognition. Based on Whisper OpenAI technology, particularly whisper.cpp library, and supports multiple language models pre-selected in the plugin's settings.

Please be aware that the 4.27 engine may experience more errors in speech recognition, particularly in streaming mode, and speech recognition may also take longer. We're currently investigating this issue.

How to install

There're two ways to install the plugin:

  1. Through the marketplace.
  2. Manual installation. Select and download the release for the required engine version, extract the archive into your plugins project folder to get the following path: "[ProjectName] / Plugins / RuntimeSpeechRecognizer".

On first run, install language models (a dialog box will appear asking you to do this automatically).

Basic description

This plugin provides real-time speech recognition using advanced algorithms based on whisper.cpp library. It matches incoming audio data, provided as a stream or non-stream input, against pre-trained language models.

Clone this wiki locally