From c17057c901dffdf98cc489ee73863369467a4a16 Mon Sep 17 00:00:00 2001 From: Rishabh Bhargava Date: Mon, 20 Apr 2026 11:44:57 -0700 Subject: [PATCH] docs: note Kokoro voice mixing syntax on audio-speech voice field hexgrad/Kokoro-82M supports combining voices via "voiceA+voiceB" (equal weights) or "voiceA(2)+voiceB(1)" (weighted). Mention the syntax on the voice field description so API consumers see it inline with the other voice guidance. Co-Authored-By: Claude Opus 4.7 (1M context) --- openapi.yaml | 3 +++ 1 file changed, 3 insertions(+) diff --git a/openapi.yaml b/openapi.yaml index 8449b9d..a59956b 100644 --- a/openapi.yaml +++ b/openapi.yaml @@ -10015,6 +10015,9 @@ components:
You can view the voices supported for each model using the /v1/voices endpoint sending the model name as the query parameter. [View all supported voices here](https://docs.together.ai/docs/text-to-speech#supported-voices). +
+
+ `hexgrad/Kokoro-82M` additionally supports voice mixing, where two or more voices are combined into a single blended voice by joining their names with `+` (e.g. `af_bella+af_heart`). Optional per-voice weights can be provided in parentheses (e.g. `af_bella(2)+af_heart(1)`). Other models require a single voice name. type: string response_format: type: string