Speech Models - Parameters Reference

This document provides a comprehensive reference for the parameters available across various audio generation models in the Scenario API. Each model has a unique modelId and a set of specific parameters that can be used to control the speech generation process. Understanding these parameters is crucial for effectively utilizing the API to achieve desired audio outputs.

Below, you will find detailed information for each audio model, including its modelId, the types of parameters it accepts, allowed values, default settings, and a clear description of each parameter's function.

ElevenLabs

ElevenLabs V3

Model ID: model_elevenlabs-tts-v3

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
textTextstringRequired. Up to 40k characters
voiceVoiceselectAria"Aria", "Roger", "Sarah", "Laura", "Charlie", "George", "Callum", "River", "Liam", "Charlotte", "Alice", "Matilda", "Will", "Jessica", "Eric", "Chris", "Brian", "Daniel", "Lily", "Bill"
stabilityStabilitynumber0.501
similarityBoostSimilarity Boostnumber0.501
styleStyle Exaggerationnumber001
speedSpeednumber10.71.2<1 slows; >1 speeds up
previousTextPrevious Textstringoptional context
nextTextNext Textstringoptional context
languageCodeLanguage Codeselect""ISO 639‑1 codes

ElevenLabs Turbo v2.5

Model ID: elevenlabs-turbo-v2-5

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
textTextstringRequired. Text to convert to speech (max 40000 chars)
voiceVoicestringAriaAria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, BillVoice preset
stabilityStabilitynumber0.501Controls voice stability
similarityBoostSimilarity Boostnumber0.501Closeness to selected voice
styleExaggerationStyle Exaggerationnumber001Boosts emotional expression
speedSpeednumber10.71.2<1 slows, >1 speeds up
previousTextPrevious TextstringOptional. Helps continuity across multi-part generation (max 10000 chars)
nextTextNext TextstringOptional. Helps continuity (max 10000 chars)
languageCodeLanguage Codestring"""" (auto), en, ca, es, fr, de, it, ja, ko, zh, ru, ar, hi, bn, pa, ta, te, mr, ur, fa, tr, nl, sv, da, no, fi, el, ro, hu, cs, sk, sl, pt, id, th, vi, ms, tl, yo, ig, ha, am, az, be, bg, hrForces language for synthesis

ElevenLabs Multilingual v2

Model ID: model_elevenlabs-multilingual-v2

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
textTextstringRequired. Up to 40k characters
voiceVoiceselectAria"Aria", "Roger", "Sarah", "Laura", "Charlie", "George", "Callum", "River", "Liam", "Charlotte", "Alice", "Matilda", "Will", "Jessica", "Eric", "Chris", "Brian", "Daniel", "Lily", "Bill"
stabilityStabilitynumber0.501
similarityBoostSimilarity Boostnumber0.501
styleStyle Exaggerationnumber001
speedSpeednumber10.71.2<1 slows; >1 speeds up
previousTextPrevious Textstringoptional context
nextTextNext Textstringoptional context
languageCodeLanguage Codeselect""ISO 639‑1 codes

Minimax

Minimax Speech 2.6 HD

Model ID: model_minimax-speech-2-6-hd

InputLabelTypeDefaultMinMaxAllowed Values
textTextstring
voiceIdVoice IdselectWise_WomanWise_Woman, Friendly_Person, Inspirational_girl, Deep_Voice_Man, Calm_Woman, Casual_Guy, Lively_Girl, Patient_Man, Young_Knight, Determined_Man, Lovely_Girl, Decent_Boy, Imposing_Manner, Elegant_Man, Abbess, Sweet_Girl_2, Exuberant_Girl
speedSpeednumber10.52
volumeVolumenumber1010
pitchPitchnumber0-1212
emotionEmotionselectautoauto, neutral, happy, sad, angry, fearful, disgusted, surprised
englishNormalizationEnglish Normalizationbooleanfalse
sampleRateSample Ratenumber320008000, 16000, 22050, 24000, 32000, 44100
bitrateBitratenumber12800032000, 64000, 128000, 256000
channelChannelselectmonomono, stereo
languageBoostLanguage BoostselectAutomatic(list of 25 language options)

Minimax Speech 2.6 Turbo

Model ID: model_minimax-speech-2-6-turbo

InputLabelTypeDefaultMinMaxAllowed Values
textTextstring
voiceIdVoice IdselectWise_WomanWise_Woman, Friendly_Person, Inspirational_girl, Deep_Voice_Man, Calm_Woman, Casual_Guy, Lively_Girl, Patient_Man, Young_Knight, Determined_Man, Lovely_Girl, Decent_Boy, Imposing_Manner, Elegant_Man, Abbess, Sweet_Girl_2, Exuberant_Girl
speedSpeednumber10.52
volumeVolumenumber1010
pitchPitchnumber0-1212
emotionEmotionselectautoauto, neutral, happy, sad, angry, fearful, disgusted, surprised
englishNormalizationEnglish Normalizationbooleanfalse
sampleRateSample Ratenumber320008000, 16000, 22050, 24000, 32000, 44100
bitrateBitratenumber12800032000, 64000, 128000, 256000
channelChannelselectmonomono, stereo
languageBoostLanguage BoostselectAutomatic(list of 25 language options)