🎤 VOICEVOX OpenAI TTS API

VOICEVOXエンジンをOpenAIの音声合成APIフォーマットに変換するためのAPIサーバーです。

🌟 特徴

OpenAIのTTS APIと同じフォーマットでリクエストを受け付け
VOICEVOXエンジンを使用した高品質な日本語音声合成
AivisSpeechエンジンにも対応
Dockerで簡単にデプロイ可能

🚀 使用方法

🐳 起動方法

# VOICEVOX（CPU）
docker-compose up -d

# VOICEVOX（GPU）
docker compose -f docker-compose.gpu.yml up -d

# AivisSpeech（Docker実行）
docker compose -f docker-compose.aivis-speech.yml up -d

# AivisSpeech（ローカル実行中のAPIに接続）
docker compose -f docker-compose.aivis-speech-api-only.yml up -d

📝 APIエンドポイント

POST http://localhost:8000/audio/speech

リクエスト形式（OpenAI互換）

{
  "model": "voicevox-v1",
  "input": "こんにちは、音声合成のテストです。",
  "voice": "1",
  "response_format": "mp3",
  "speed": 1.0
}

パラメータ説明

model: 使用するモデル（現在は"voicevox-v1"のみ）
input: 読み上げるテキスト
voice: VOICEVOXのスピーカーID
response_format: 出力フォーマット（現在は"mp3"のみ）
speed: 読み上げ速度（デフォルト: 1.0）

レスポンス形式

Content-Type: audio/mpeg
Body: MP3形式の音声データ（バイナリ）

Pythonでの使用例

from openai import OpenAI

# カスタムベースURLを持つOpenAIクライアントを作成
client = OpenAI(base_url="http://localhost:8000", api_key="sk-1234")

# 音声を生成
response = client.audio.speech.create(
    model="voicevox-v1",
    voice="1",
    input="こんにちは、音声合成のテストです。",
    speed=1.0
)

# 音声ファイルを保存（ストリーミングレスポンスを使用）
with response.with_streaming_response.stream_to_file("output.mp3"):
    pass

📁 プロジェクト構造

.
├── docker-compose.yml                        # VOICEVOX CPU版
├── docker-compose.gpu.yml                    # VOICEVOX GPU版
├── docker-compose.aivis-speech.yml           # AivisSpeech Docker版
├── docker-compose.aivis-speech-api-only.yml  # AivisSpeech（ローカル実行）用APIブリッジ
├── Dockerfile           # APIサーバーのビルド設定
├── voice_mappings/      # 各エンジン用の話者IDマッピング
│   ├── voicevox.json
│   └── aivis-speech.json
├── voicevox_tts_api/   # OpenAI互換APIの実装
│   ├── tts_api.py      # メインAPIコード
│   └── requirements.txt # Python依存パッケージ
└── example/            # 使用例とテストスクリプト
    ├── tts_example.py  # サンプルスクリプト
    └── README.md       # サンプルの説明

🔧 システム要件

Docker
Docker Compose

🎯 サンプルコード

exampleディレクトリに、APIの使用例とテストスクリプトが用意されています。詳しい使い方はexample/README.mdを参照してください。

🛠️ アーキテクチャ

                                  ┌─────────────┐
HTTP Request (OpenAI Format) ──▶  │  TTS API    │
                                  │  (FastAPI)   │
                                  └──────┬──────┘
                                         │
                                         ▼
                                  ┌─────────────┐
                                  │  VOICEVOX / │
                                  │ AivisSpeech │
                                  │   Engine    │
                                  └─────────────┘

🔒 ライセンス

MITライセンス

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎤 VOICEVOX OpenAI TTS API

🌟 特徴

🚀 使用方法

🐳 起動方法

📝 APIエンドポイント

リクエスト形式（OpenAI互換）

パラメータ説明

レスポンス形式

Pythonでの使用例

📁 プロジェクト構造

🔧 システム要件

🎯 サンプルコード

🛠️ アーキテクチャ

🔒 ライセンス

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
example		example
voice_mappings		voice_mappings
voicevox_tts_api		voicevox_tts_api
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.aivis-speech-api-only.yml		docker-compose.aivis-speech-api-only.yml
docker-compose.aivis-speech.yml		docker-compose.aivis-speech.yml
docker-compose.gpu.yml		docker-compose.gpu.yml
docker-compose.yml		docker-compose.yml

nichiki/voicevox-openai-tts

Folders and files

Latest commit

History

Repository files navigation

🎤 VOICEVOX OpenAI TTS API

🌟 特徴

🚀 使用方法

🐳 起動方法

📝 APIエンドポイント

リクエスト形式（OpenAI互換）

パラメータ説明

レスポンス形式

Pythonでの使用例

📁 プロジェクト構造

🔧 システム要件

🎯 サンプルコード

🛠️ アーキテクチャ

🔒 ライセンス

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages