Rhino SDK for Arduino boards - Mandarin language

Made in Vancouver, Canada by Picovoice

Rhino is Picovoice's Speech-to-Intent engine. It directly infers intent from spoken commands within a given context of interest, in real-time. For example, given a spoken command:

Can I have a small double-shot espresso?

Rhino infers what the user wants and emits the following inference result:

{
  "isUnderstood": "true",
  "intent": "orderBeverage",
  "slots": {
    "beverage": "espresso",
    "size": "small",
    "numberOfShots": "2"
  }
}

Rhino is:

using deep neural networks trained in real-world environments.
compact and computationally-efficient. It is perfect for IoT.
self-service. Developers can train custom contexts using Picovoice Console.

Compatibility

Arduino Nano 33 BLE Sense

Dependency

LibPrintf

AccessKey

The Rhino SDK requires a valid AccessKey at initialization. AccessKeys act as your credentials when using Rhino SDKs. You can create your AccessKey for free. Make sure to keep your AccessKey secret.

To obtain your AccessKey:

Login or Signup for a free account on the Picovoice Console.
Once logged in, go to the AccessKey tab to create one or use an existing AccessKey.

Integration

define all the necessary variables before setup():

#include <Rhino_ZH.h>

#define MEMORY_BUFFER_SIZE ...
static uint8_t memory_buffer[MEMORY_BUFFER_SIZE] __attribute__((aligned(16));

static const char* ACCESS_KEY = ...; //AccessKey string obtained from [Picovoice Console](https://picovoice.ai/console/)

const uint8_t CONTEXT_ARRAY[] = {...};
static const float SENSITIVITY = 0.75f;
static const float ENDPOINT_DURATION_SEC = 1.0f;
static const bool REQUIRE_ENDPOINT = true;

pv_rhino_t *handle = NULL;

Sensitivity is the parameter that enables developers to trade miss rate for false alarm. A higher sensitivity value results in fewer misses at the cost of (potentially) increasing the erroneous inference rate.

Endpoint duration is a chunk of silence at the end of an utterance that marks the end of spoken command. A lower endpoint duration reduces delay and improves responsiveness. A higher endpoint duration assures Rhino doesn't return inference pre-emptively in case the user pauses before finishing the request.

Require endpoint is a parameter when set to true, Rhino requires an endpoint (a chunk of silence) after the spoken command. If set to false, Rhino tries to detect silence, but if it cannot, it still will provide inference regardless. Set to false only if operating in an environment with overlapping speech (e.g. people talking in the background).

handle is an instance of Rhino runtime engine.

put the following code block inside setup() in order to initialize the Rhino engine:

const pv_status_t status = pv_rhino_init(
        ACCESS_KEY,
        memory_buffer,
        MEMORY_BUFFER_SIZE,
        CONTEXT_ARRAY,
        sizeof(CONTEXT_ARRAY),
        SENSITIVITY,
        ENDPOINT_DURATION_SEC,
        REQUIRE_ENDPOINT,
        &handle);

if (status != PV_STATUS_SUCCESS) {
    // error handling logic
}

Rhino accepts single channel, 16-bit PCM audio. The sample rate can be retrieved using pv_sample_rate(). Rhino accepts input audio in consecutive chunks (aka frames); the length of each frame can be retrieved using pv_rhino_frame_length(). Inside the loop() function in the sketch, pass the recorded audio to the Rhino engine:

const int16_t *pcm = pv_audio_rec_get_new_buffer()
bool is_finalized = false;
pv_status_t status = pv_rhino_process(handle, pcm, &is_finalized);
if (status != PV_STATUS_SUCCESS) {
    // error handling logic
}
if (is_finalized) {
    // inference event logic/callback
}

Create Custom Context

Compile and upload the Rhino_ZH/GetUUID sketch from the File -> Examples menu. Copy the UUID of the board printed at the beginning of the session to the serial monitor.
Go to Picovoice Console to create a context for Rhino speech to intent engine.
Select Arm Cortex M as the platform when training the model.
Select your board type (Arduino Nano 33 BLE Sense) and provide the UUID of the chipset on the board.

Import the Custom Context

Download your custom voice model(s) from Picovoice Console.
Decompress the zip file. The model for Rhino speech to intent is located in two files: A binary .rhn file, and as a .h header file containing a C array version of the binary model.
Copy the contents of the array inside the .h header file and update the CONTEXT_ARRAY values in params.h.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
examples		examples
src		src
LICENSE		LICENSE
README.md		README.md
keywords.txt		keywords.txt
library.properties		library.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rhino SDK for Arduino boards - Mandarin language

Compatibility

Dependency

AccessKey

Integration

Create Custom Context

Import the Custom Context

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Picovoice/rhino-arduino-zh

Folders and files

Latest commit

History

Repository files navigation

Rhino SDK for Arduino boards - Mandarin language

Compatibility

Dependency

AccessKey

Integration

Create Custom Context

Import the Custom Context

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages