Skip to content
Get started
GENERATION API CALLS
Audio Generation

ElevenLabs

This page is auto-generated from model configurations. Last updated: 2026-07-01.

This reference lists all available ElevenLabs audio generation models and their parameters. Use these parameter names when calling the Generation API.


Life-like, emotionally rich text-to-speech model supporting 29 languages.

Model ID: model_elevenlabs-multilingual-v2

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-multilingual-v2/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
textstringYes----The text to convert to speech
voiceIdmodelNo----Your cloned ElevenLabs voice model
publicVoicestringNoAdam--Adam, Alice, Bella, Bill, Brian, Callum, Charlie, Chris, Daniel, Eric, George, Harry, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, WillSelect a pre-built ElevenLabs public voice. Ignored if the input Voice is set.
stabilitynumberNo0.501-Voice stability
similarityBoostnumberNo0.501-Similarity boost
styleExaggerationnumberNo001-Style exaggeration
speednumberNo10.71.2-Speech speed (0.7-1.2). Values below 1.0 slow down the speech, above 1.0 speed it up. Extreme values may affect quality.
languageCodestringNo---“, en, ca, es, fr, de, it, ja, ko, zh, ru, ar, hi, bn, pa, ta, te, mr, ur, fa, tr, nl, sv, da, no, fi, el, ro, hu, cs, sk, sl, pt, id, th, vi, ms, tl, yo, ig, ha, am, az, be, bg, hrLanguage code (ISO 639-1) used to enforce a language for the model.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, wav_8000, wav_16000, wav_22050, wav_24000, wav_32000, wav_44100, wav_48000, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192Output audio format.
seednumberNo----Seed for deterministic output

Advanced AI music generation with chunk-based composition plans and section-by-section control.

Model ID: model_elevenlabs-music-advanced-v2

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-music-advanced-v2/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
sectionsinputs_arrayYes----Ordered song sections defining the composition structure.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192The format and quality of the audio you get back. Higher numbers mean better quality and larger files.
seednumberNo----A number that makes results repeatable. Reusing the same seed and settings produces the same music; leave it empty for a different result each time.

Next-generation AI music generation from text descriptions.

Model ID: model_elevenlabs-music-v2

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-music-v2/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
promptstringYes----Describe the music you want — mood, genre, instruments, tempo, and any other direction. For example, “upbeat lo-fi hip-hop with mellow piano and soft drums.”
durationSecondsnumberNo303180-How long the music lasts, in seconds (3–180). Longer tracks cost more.
forceInstrumentalbooleanNofalse---Generates music without any vocals.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192The format and quality of the audio you get back. Higher numbers mean better quality and larger files.
seednumberNo----A number that makes results repeatable. Reusing the same seed and settings produces the same music; leave it empty for a different result each time.

Professional sound effects generation for audio production and content creation.

Model ID: model_elevenlabs-sound-effects-v2

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-sound-effects-v2/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
textstringYes----A textual description of the sound effect to generate.
durationSecondsnumberNo50.530-Duration in seconds (0.5-30). If not set, optimal duration will be determined from prompt.
promptInfluencenumberNo0.301-How closely to follow the sound description. Higher values mean less variation.
loopbooleanNofalse---Whether to loop the sound effect.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192Output audio format.

High-quality, low-latency text-to-speech model in multiple languages.

Model ID: model_elevenlabs-turbo-v2-5

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-turbo-v2-5/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
textstringYes----The text to convert to speech
voiceIdmodelNo----Your cloned ElevenLabs voice model
publicVoicestringNoAdam--Adam, Alice, Bella, Bill, Brian, Callum, Charlie, Chris, Daniel, Eric, George, Harry, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, WillSelect a pre-built ElevenLabs public voice. Ignored if the input Voice is set.
stabilitynumberNo0.501-Voice stability
similarityBoostnumberNo0.501-Similarity boost
styleExaggerationnumberNo001-Style exaggeration
speednumberNo10.71.2-Speech speed (0.7-1.2). Values below 1.0 slow down the speech, above 1.0 speed it up. Extreme values may affect quality.
languageCodestringNo---“, en, ca, es, fr, de, it, ja, ko, zh, ru, ar, hi, bn, pa, ta, te, mr, ur, fa, tr, nl, sv, da, no, fi, el, ro, hu, cs, sk, sl, pt, id, th, vi, ms, tl, yo, ig, ha, am, az, be, bg, hrLanguage code (ISO 639-1) used to enforce a language for the model.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, wav_8000, wav_16000, wav_22050, wav_24000, wav_32000, wav_44100, wav_48000, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192Output audio format.
seednumberNo----Seed for deterministic output

Next-generation text-to-speech model with advanced voice synthesis and enhanced naturalness.

Model ID: model_elevenlabs-tts-v3

Capabilities: txt2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-tts-v3/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
textstringYes----The words that will be spoken aloud in the generated audio.
voiceIdmodelNo----A custom voice you cloned or trained to speak the text.
publicVoicestringNoAdam--Adam, Alice, Bella, Bill, Brian, Callum, Charlie, Chris, Daniel, Eric, George, Harry, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, WillA ready-made ElevenLabs voice, used only when no custom Voice is set.
stabilitynumberNo0.501-Higher values make the voice steadier and more consistent; lower values make it more varied and expressive.
similarityBoostnumberNo0.7501-How closely the output matches the original voice; higher values track it more tightly but can amplify artifacts.
styleExaggerationnumberNo001-Amplifies the speaker’s style and emotion; higher values are more expressive but can reduce stability.
speednumberNo10.71.2-How fast the speech is delivered; below 1.0 slows it down, above 1.0 speeds it up. Extreme values may affect quality.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, wav_8000, wav_16000, wav_22050, wav_24000, wav_32000, wav_44100, wav_48000, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192File type, sample rate, and bitrate of the generated audio.
seednumberNo----Fixes randomness so identical inputs produce the same audio every time.

Transform speech audio into a different voice while preserving emotion, timing, and delivery.

Model ID: model_elevenlabs-voice-changer

Capabilities: audio2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-voice-changer/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
audiofileYes----The audio file you want to transform. The words, timing, and emotion are kept. Only the voice itself changes. Audio file should be less than 5 minutes long.
voiceIdmodelNo----The voice you want the recording to sound like. Pick one of your own cloned ElevenLabs voices
publicVoicestringNoAdam--Adam, Alice, Bella, Bill, Brian, Callum, Charlie, Chris, Daniel, Eric, George, Harry, Jessica, Laura, Liam, Lily, Matilda, River, Roger, Sarah, WillOne of ElevenLabs’ ready-made voices to use as the target. Ignored if you’ve selected your own Voice above. Ignored if the input Voice is set.
fileFormatstringNoother--pcm_s16le_16, otherThe format of the file you upload. Choose ‘Encoded’ for common files like MP3 or WAV. ‘PCM’ is a raw audio format that’s slightly faster to process if your file already uses it.
removeBackgroundNoisebooleanNofalse---Cleans up background noise in your recording before converting it. Useful for noisy or low-quality audio.
stabilitynumberNo-01-Controls how steady the voice sounds. Higher values keep it calm and consistent; lower values make it more varied and expressive. Leave empty to use the voice’s own setting.
similarityBoostnumberNo-01-How closely the result should match the target voice. Higher values stick to it more tightly. Leave empty to use the voice’s own setting.
styleExaggerationnumberNo-01-How much to amplify the target voice’s style and emotion. Higher values are more expressive but can make the voice less steady. Leave empty to use the voice’s own setting.
useSpeakerBoostbooleanNo----Makes the result sound more like the chosen voice, at the cost of slightly slower processing. Leave empty to use the voice’s own setting.
outputFormatstringNomp3_44100_128--mp3_22050_32, mp3_24000_48, mp3_44100_32, mp3_44100_64, mp3_44100_96, mp3_44100_128, mp3_44100_192, opus_48000_32, opus_48000_64, opus_48000_96, opus_48000_128, opus_48000_192The format and quality of the audio you get back. Higher numbers mean better quality and larger files.
seednumberNo----An optional number that locks in the result. Reusing the same seed with the same settings produces the same audio every time; leave it empty for a fresh result on each run.

Remove background noise from audio and isolate the voice.

Model ID: model_elevenlabs-voice-isolator

Capabilities: audio2audio

LLM Markdown: https://app.scenario.com/api/models/model_elevenlabs-voice-isolator/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
audiofileYes----The recording you want to clean up. The voice is kept and isolated, while background noise, music, and other sounds are removed.
fileFormatstringNoother--pcm_s16le_16, otherThe format of the file you upload. Choose ‘Encoded’ for common files like MP3 or WAV. ‘PCM’ is a raw audio format that’s slightly faster to process if your file already uses it.