From c17057c901dffdf98cc489ee73863369467a4a16 Mon Sep 17 00:00:00 2001
From: Rishabh Bhargava <rishabh.bhargava93@gmail.com>
Date: Mon, 20 Apr 2026 11:44:57 -0700
Subject: [PATCH] docs: note Kokoro voice mixing syntax on audio-speech voice
 field

hexgrad/Kokoro-82M supports combining voices via "voiceA+voiceB" (equal
weights) or "voiceA(2)+voiceB(1)" (weighted). Mention the syntax on the
voice field description so API consumers see it inline with the other
voice guidance.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 openapi.yaml | 3 +++
 1 file changed, 3 insertions(+)
diff --git a/openapi.yaml b/openapi.yaml
index 8449b9d..a59956b 100644
--- a/openapi.yaml
+++ b/openapi.yaml
@@ -10015,6 +10015,9 @@ components:
             <br>
             You can view the voices supported for each model using the /v1/voices endpoint sending the model name as the query parameter.
             [View all supported voices here](https://docs.together.ai/docs/text-to-speech#supported-voices).
+            <br>
+            <br>
+            `hexgrad/Kokoro-82M` additionally supports voice mixing, where two or more voices are combined into a single blended voice by joining their names with `+` (e.g. `af_bella+af_heart`). Optional per-voice weights can be provided in parentheses (e.g. `af_bella(2)+af_heart(1)`). Other models require a single voice name.
           type: string
         response_format:
           type: string