Video Lipsync - Parameters Reference

HeyGen

HeyGen Avatar 4

Model ID: model_heygen-avatar4-i2v

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
imageAvatar ImagefileNoneNoneNoneNoneRequired. Image to animate. Should contain a clear face.
promptScriptstringNoneNoneNoneNoneRequired. The text the avatar will speak.
backgroundBackgroundinputs_array[]None1NoneOptional background configuration (color, image, or video).
voiceVoicestringMelissaNoneNoneMelissa, Warm Pro Narrator, Chill Brian, Ivy, etc.Required. Name of the voice to use for the avatar.
resolutionResolutionstring720pNoneNone360p, 480p, 540p, 720p, 1080pVideo resolution preset.
talkingStyleTalking StylestringstableNoneNonestable, expressive'stable' for minimal movement, 'expressive' for more animation.
captionAdd CaptionsbooleanfalseNoneNoneNoneWhether to add captions to the video.

HeyGen Video Agent

Model ID: model_heygen-v2-video-agent

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
promptPromptstringNoneNoneNoneNoneRequired. Natural language prompt describing style, visual elements, and desired length.
durationDurationnumber305120NoneApproximate video duration in seconds. Suggested: 30, 60, or 90.
orientationOrientationstringportraitNoneNoneportrait, landscapeVideo orientation selection.
avatarAvatarstringAdriana SuitSofa FrontNoneNoneSee full list in documentationOptional avatar to use in the video; extensive list available.

HeyGen Video Translate Precision

Model ID: model_heygen-v2-translate-precision

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
videoVideofileNoneNoneNoneNoneRequired. URL of the video to translate.
outputLanguageOutput LanguagestringSpanishNoneNoneEnglish, Spanish, French, Hindi, Italian, German, Polish, Portuguese, Chinese, Japanese, Dutch, Turkish, Korean, and many others.The target language to translate the video into.
translateAudioOnlyTranslate Audio OnlybooleanfalseNoneNoneNoneTranslate only the audio, ignore faces and only translate the voice track.
enableDynamicDurationEnable Dynamic DurationbooleantrueNoneNoneNoneEnhances conversational fluidity between languages with different speaking rates.
speakerNumSpeaker NumnumberNoneNoneNoneNoneNumber of speakers in the video.

HeyGen Video Translate Speed

Model ID: model_heygen-v2-translate-speed

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
videoVideofileNoneNoneNoneNoneRequired. URL of the video to translate.
outputLanguageOutput LanguagestringSpanishNoneNoneEnglish, Spanish, French, Hindi, Italian, German, Polish, Portuguese, Chinese, Japanese, Dutch, Turkish, Korean, and many othersThe target language to translate the video into.
translateAudioOnlyTranslate Audio OnlybooleanfalseNoneNoneNoneTranslate only the audio, ignore faces and only translate the voice track.
enableDynamicDurationEnable Dynamic DurationbooleantrueNoneNoneNoneEnhances conversational fluidity between languages with different speaking rates.
speakerNumSpeaker NumnumberNoneNoneNoneNoneNumber of speakers in the video.

Kling

Kling Lipsync

Model ID: model_kling-lip-sync

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
videoUrlVideoassetId2-10s, <100MB, 720p–1080p
audioFileLip Sync Audio FileassetIdrequired if text not provided
textLip Sync Textstringrequired if audio not provided
voiceIdAI Voice Liststringen_AOT(40+ voices available)used when text provided
voiceSpeedVoice Speednumber10.82

Kling AI Avatar V2 Pro

Model ID: model_kling-video-ai-avatar-v2-pro

InputLabelTypeDefaultNotes
imageImageassetIdrequired avatar image
audioAudioassetIdrequired
textAdd Descriptionstringoptional cue

Bytedance

Bytedance OmniHuman 1.5

Model ID: model_bytedance-omni-human-1-5

InputLabelTypeDefaultNotes
imageImageassetIdrequired
audioAudioassetIdrequired (recommended ≤15s for best quality)

Bytedance OmniHuman

Model ID: model_bytedance-omni-human

InputLabelTypeDefaultNotes
imageImageassetIdrequired
audioAudioassetIdrequired (recommended ≤15s for best quality)

Pixverse

Pixverse Lipsync

Model ID: model_pixverse-lipsync

InputLabelTypeDefaultAllowed ValuesNotes
videoVideoassetIdrequired
audioAudioassetIdrequired if text not provided
textText To Speechstring""required if audio not provided
voiceIdVoicestringAutoEmily, James, Isabella, Liam, Chloe, Adrian, Harper, Ava, Sophia, Julia, Mason, Jack, Oliver, Ethan, Autoused when text provided

Creatify

Creatify Aurora (Avatar)

Model ID: model_creatify-aurora

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
imageAvatar ImageassetIdNoneNoneNoneNoneRequired. Input avatar image
audioAudioassetIdNoneNoneNoneNoneRequired. Input audio file
promptPromptstringNoneNone2048NoneOptional text prompt to guide generation
guidanceScalePrompt Guidancenumber105NoneHigher values follow the prompt more closely
audioGuidanceScaleAudio Guidancenumber205NoneHigher values follow the audio more closely
resolutionResolutionstring720pNoneNone480p, 720pOutput video resolution

Sync Labs

Sync Lipsync React 1

Model ID: model_sync-lipsync-react-1

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
videoVideoassetIdNoneNoneNoneNoneRequired. Input video file
audioAudioassetIdNoneNoneNoneNoneRequired. Input audio file. Maximum duration is 15 seconds.
emotionEmotionstringneutralNoneNonehappy, angry, sad, neutral, disgusted, surprisedEmotion prompt for generation
modelModeModel ModestringfaceNoneNonelips, face, headControls edit region and movement scope
lipsyncModeLipsync ModestringbounceNoneNoneloop, bounce, cut_off, silence, remapBehavior when audio and video durations differ
temperatureTemperaturenumber0.501NoneControls expressiveness of lipsync

Sync Lipsync v2 Pro

Model ID: model_sync-lipsync-v2-pro

InputLabelTypeDefaultMinMaxAllowed ValuesNotes
videoVideoassetIdrequired
audioAudioassetIdrequired
syncModeSync Modestringlooploop, bounce, cut_off, silence, remaplipsync behavior when lengths mismatch
temperatureTemperaturenumber0.501expressiveness
activeSpeakerActive Speakerbooleanfalsedetects active speaker