Skip to content
Georgy Treshchev edited this page Apr 8, 2023 · 21 revisions

Runtime Speech Recognizer Documentation

Runtime Speech Recognizer is an open-source plugin that enables real-time, offline speech recognition. Based on Whisper OpenAI technology, particularly whisper.cpp library, and supports multiple language models pre-selected in the plugin's settings.

How to install

There're two ways to install the plugin:

  1. Through the marketplace.
  2. Manual installation. Select and download the release for the required engine version, extract the archive into your plugins project folder to get the following path: "[ProjectName] / Plugins / RuntimeSpeechRecognizer". Then download the language models by following this page if needed.

Basic description

This plugin provides real-time speech recognition using advanced algorithms based on whisper.cpp library. It matches incoming audio data, provided as a stream or non-stream input, against pre-trained language models.

Clone this wiki locally