Supertone Public API: Supertone API is a RESTful API for using our state-of-the-art AI voice models.
The SDK can be installed with uv, pip, or poetry package managers.
uv is a fast Python package installer and resolver, designed as a drop-in replacement for pip and pip-tools. It's recommended for its speed and modern Python tooling capabilities.
uv add supertonePIP is the default package installer for Python, enabling easy installation and management of packages from PyPI via the command line.
pip install supertonePoetry is a modern tool that simplifies dependency management and package publishing by using a single pyproject.toml file to handle project metadata and dependencies.
poetry add supertoneTip
To finish publishing your SDK to PyPI you must run your first generation action.
Note
Python version upgrade policy
Once a Python version reaches its official end of life date, a 3-month grace period is provided for users to upgrade. Following this grace period, the minimum python version supported in the SDK will be updated.
The SDK can be installed with uv, pip, or poetry package managers.
uv is a fast Python package installer and resolver, designed as a drop-in replacement for pip and pip-tools. It's recommended for its speed and modern Python tooling capabilities.
uv add git+<UNSET>.gitPIP is the default package installer for Python, enabling easy installation and management of packages from PyPI via the command line.
pip install git+<UNSET>.gitPoetry is a modern tool that simplifies dependency management and package publishing by using a single pyproject.toml file to handle project metadata and dependencies.
poetry add git+<UNSET>.gitYou can use this SDK in a Python shell with uv and the uvx command that comes with it like so:
uvx --from supertone pythonIt's also possible to write a standalone Python script without needing to set up a whole project like so:
#!/usr/bin/env -S uv run --script
# /// script
# requires-python = ">=3.9"
# dependencies = [
# "supertone",
# ]
# ///
from supertone import Supertone
sdk = Supertone(
# SDK arguments
)
# Rest of script here...Once that is saved to a file, you can run it with uv run script.py where
script.py can be replaced with the actual file name.
Generally, the SDK will work well with most IDEs out of the box. However, when using PyCharm, you can enjoy much better integration with Pydantic by installing an additional plugin.
# Synchronous Example
from supertone import Supertone, models
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)The same SDK client can also be used to make asynchronous requests by importing asyncio.
# Asynchronous Example
import asyncio
from supertone import Supertone, models
async def main():
async with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = await s_client.text_to_speech.create_speech_async(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)
asyncio.run(main())This SDK supports the following security scheme globally:
| Name | Type | Scheme |
|---|---|---|
api_key |
apiKey | API key |
To authenticate with the API the api_key parameter must be set when initializing the SDK client instance. For example:
from supertone import Supertone, models
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)Supertone’s Text-to-Speech API provides multiple TTS models, each with different supported languages, available voice settings, and streaming capabilities.
| Model Name | Identifier | Streaming Support (stream_speech) |
Voice Settings Support |
|---|---|---|---|
| SONA Speech 1 | sona_speech_1 |
âś… Supported | Supports all Voice Settings |
| Supertonic API 1 | supertonic_api_1 |
❌ Not supported | Supports only the speed setting (others are ignored) |
| SONA Speech 2 | sona_speech_2 |
❌ Not supported | Supports all Voice Settings except subharmonic_amplitude_control |
| SONA Speech 2 Flash | sona_speech_2_flash |
❌ Not supported | Supports all Voice Settings except similarity, text_guidance,subharmonic_amplitude_control |
Note
Streaming Support
Streaming TTS using the stream_speech endpoint is only available for the sona_speech_1 model.
Note
The set of supported input languages varies depending on the TTS model.
-
sona_speech_1
en,ko,ja
-
supertonic_api_1
en,ko,ja,es,pt
-
sona_speech_2
en,ko,ja,bg,cs,da,el,es,et,fi,hu,it,nl,pl,pt,ro,
ar,de,fr,hi,id,ru,vi
-
sona_speech_2_flash
en,ko,ja,bg,cs,da,el,es,et,fi,hu,it,nl,pl,pt,ro,
ar,de,fr,hi,id,ru,vi
Some TTS models support optional voice settings that allow fine control over output speech characteristics (e.g., speed, pitch, pitch variance).
Note
The available Voice Settings vary depending on the TTS model.
-
sona_speech_1
- Supports all available Voice Settings.
-
supertonic_api_1
- Supports only the
speedsetting. All other settings will be ignored.
- Supports only the
-
sona_speech_2
- Supports all Voice Settings except
subharmonic_amplitude_control.
- Supports all Voice Settings except
-
sona_speech_2_flash
- Supports all Voice Settings except
similarity,text_guidance,subharmonic_amplitude_control.
- Supports all Voice Settings except
All Voice Settings are optional. When omitted, each model’s default values will be applied.
Available methods
- create_cloned_voice - Create cloned voice
- list_custom_voices - Gets custom (cloned) voices
- search_custom_voices - Search custom (cloned) voices
- get_custom_voice - Get single cloned voice
- edit_custom_voice - Update cloned voice (partial update)
- delete_custom_voice - Delete cloned voice
- create_speech - Convert text to speech
- stream_speech - Convert text to speech with streaming response
- predict_duration - Predict text-to-speech duration
- get_voice_usage - Retrieve TTS API usage data
- get_usage - Retrieve advanced API usage analytics
- get_credit_balance - Retrieve credit balance
- list_voices - Gets available voices
- search_voices - Search voices.
- get_voice - Get voice details by ID
Certain SDK methods accept file objects as part of a request body or multi-part request. It is possible and typically recommended to upload files as a stream rather than reading the entire contents into memory. This avoids excessive memory consumption and potentially crashing with out-of-memory errors when working with very large files. The following example demonstrates how to attach a file stream to a request.
Tip
For endpoints that handle file uploads bytes arrays can also be used. However, using streams is recommended for large files.
from supertone import Supertone
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.custom_voices.create_cloned_voice(files={
"file_name": "example.file",
"content": open("example.file", "rb"),
}, name="<value>")
# Handle response
print(res)Some of the endpoints in this SDK support retries. If you use the SDK without any configuration, it will fall back to the default retry strategy provided by the API. However, the default retry strategy can be overridden on a per-operation basis, or across the entire SDK.
To change the default retry strategy for a single API call, simply provide a RetryConfig object to the call:
from supertone import Supertone, models
from supertone.utils import BackoffStrategy, RetryConfig
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False,
RetryConfig("backoff", BackoffStrategy(1, 50, 1.1, 100), False))
# Handle response
print(res)If you'd like to override the default retry strategy for all operations that support retries, you can use the retry_config optional parameter when initializing the SDK:
from supertone import Supertone, models
from supertone.utils import BackoffStrategy, RetryConfig
with Supertone(
retry_config=RetryConfig("backoff", BackoffStrategy(1, 50, 1.1, 100), False),
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)SupertoneError is the base class for all HTTP error responses. It has the following properties:
| Property | Type | Description |
|---|---|---|
err.message |
str |
Error message |
err.status_code |
int |
HTTP response status code eg 404 |
err.headers |
httpx.Headers |
HTTP response headers |
err.body |
str |
HTTP body. Can be empty string if no body is returned. |
err.raw_response |
httpx.Response |
Raw HTTP response |
err.data |
Optional. Some errors may contain structured data. See Error Classes. |
from supertone import Supertone, errors, models
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = None
try:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)
except errors.SupertoneError as e:
# The base class for HTTP error responses
print(e.message)
print(e.status_code)
print(e.body)
print(e.headers)
print(e.raw_response)
# Depending on the method different errors may be thrown
if isinstance(e, errors.BadRequestErrorResponse):
print(e.data.status) # str
print(e.data.message) # strPrimary error:
SupertoneError: The base class for HTTP error responses.
Less common errors (15)
Network errors:
httpx.RequestError: Base class for request errors.httpx.ConnectError: HTTP client was unable to make a request to a server.httpx.TimeoutException: HTTP request timed out.
Inherit from SupertoneError:
UnauthorizedErrorResponse: Unauthorized: Invalid API key. Status code401. Applicable to 10 of 15 methods.*InternalServerErrorResponse: Status code500. Applicable to 10 of 15 methods.*NotFoundErrorResponse: Status code404. Applicable to 9 of 15 methods.*BadRequestErrorResponse: Status code400. Applicable to 5 of 15 methods.*ForbiddenErrorResponse: Status code403. Applicable to 4 of 15 methods.*RequestTimeoutErrorResponse: Status code408. Applicable to 4 of 15 methods.*TooManyRequestsErrorResponse: Status code429. Applicable to 4 of 15 methods.*PaymentRequiredErrorResponse: Status code402. Applicable to 3 of 15 methods.*PayloadTooLargeErrorResponse: Payload Too Large: File size exceeds 3MB limit. Status code413. Applicable to 1 of 15 methods.*UnsupportedMediaTypeErrorResponse: Unsupported Media Type: Invalid audio file format. Status code415. Applicable to 1 of 15 methods.*ResponseValidationError: Type mismatch between the response data and the expected Pydantic model. Provides access to the Pydantic validation error via thecauseattribute.
* Check the method documentation to see if the error is applicable.
Additional example code can be found in the examples directory.
The default server can be overridden globally by passing a URL to the server_url: str optional parameter when initializing the SDK client instance. For example:
from supertone import Supertone, models
with Supertone(
server_url="https://supertoneapi.com",
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
res = s_client.text_to_speech.create_speech(voice_id="<id>", text="<value>", language=models.APIConvertTextToSpeechUsingCharacterRequestLanguage.JA, model=models.APIConvertTextToSpeechUsingCharacterRequestModel.SONA_SPEECH_1, output_format=models.APIConvertTextToSpeechUsingCharacterRequestOutputFormat.WAV, include_phonemes=False)
# Handle response
print(res)The Python SDK makes API calls using the httpx HTTP library. In order to provide a convenient way to configure timeouts, cookies, proxies, custom headers, and other low-level configuration, you can initialize the SDK client with your own HTTP client instance.
Depending on whether you are using the sync or async version of the SDK, you can pass an instance of HttpClient or AsyncHttpClient respectively, which are Protocol's ensuring that the client has the necessary methods to make API calls.
This allows you to wrap the client with your own custom logic, such as adding custom headers, logging, or error handling, or you can just pass an instance of httpx.Client or httpx.AsyncClient directly.
For example, you could specify a header for every request that this sdk makes as follows:
from supertone import Supertone
import httpx
http_client = httpx.Client(headers={"x-custom-header": "someValue"})
s = Supertone(client=http_client)or you could wrap the client with your own custom logic:
from supertone import Supertone
from supertone.httpclient import AsyncHttpClient
import httpx
class CustomClient(AsyncHttpClient):
client: AsyncHttpClient
def __init__(self, client: AsyncHttpClient):
self.client = client
async def send(
self,
request: httpx.Request,
*,
stream: bool = False,
auth: Union[
httpx._types.AuthTypes, httpx._client.UseClientDefault, None
] = httpx.USE_CLIENT_DEFAULT,
follow_redirects: Union[
bool, httpx._client.UseClientDefault
] = httpx.USE_CLIENT_DEFAULT,
) -> httpx.Response:
request.headers["Client-Level-Header"] = "added by client"
return await self.client.send(
request, stream=stream, auth=auth, follow_redirects=follow_redirects
)
def build_request(
self,
method: str,
url: httpx._types.URLTypes,
*,
content: Optional[httpx._types.RequestContent] = None,
data: Optional[httpx._types.RequestData] = None,
files: Optional[httpx._types.RequestFiles] = None,
json: Optional[Any] = None,
params: Optional[httpx._types.QueryParamTypes] = None,
headers: Optional[httpx._types.HeaderTypes] = None,
cookies: Optional[httpx._types.CookieTypes] = None,
timeout: Union[
httpx._types.TimeoutTypes, httpx._client.UseClientDefault
] = httpx.USE_CLIENT_DEFAULT,
extensions: Optional[httpx._types.RequestExtensions] = None,
) -> httpx.Request:
return self.client.build_request(
method,
url,
content=content,
data=data,
files=files,
json=json,
params=params,
headers=headers,
cookies=cookies,
timeout=timeout,
extensions=extensions,
)
s = Supertone(async_client=CustomClient(httpx.AsyncClient()))The Supertone class implements the context manager protocol and registers a finalizer function to close the underlying sync and async HTTPX clients it uses under the hood. This will close HTTP connections, release memory and free up other resources held by the SDK. In short-lived Python programs and notebooks that make a few SDK method calls, resource management may not be a concern. However, in longer-lived programs, it is beneficial to create a single SDK instance via a context manager and reuse it across the application.
from supertone import Supertone
def main():
with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
# Rest of application here...
# Or when using async:
async def amain():
async with Supertone(
api_key="<YOUR_API_KEY_HERE>",
) as s_client:
# Rest of application here...You can setup your SDK to emit debug logs for SDK requests and responses.
You can pass your own logger class directly into your SDK.
from supertone import Supertone
import logging
logging.basicConfig(level=logging.DEBUG)
s = Supertone(debug_logger=logging.getLogger("supertone"))