Alibaba
This page is auto-generated from model configurations. Last updated: 2026-03-13.
This reference lists all available Alibaba video generation models and their parameters. Use these parameter names when calling the Generation API.
- Wan 2.1 - 1.3b
- Wan 2.2 - I2V
- Wan 2.2 - T2V
- Wan 2.2 Animate - Move
- Wan 2.2 Animate - Replace
- Wan 2.2 Outpainting
- Wan 2.2 Reframe
- Wan 2.5 - I2V
- Wan 2.5 - T2V
- Wan 2.6 I2V
- Wan 2.6 T2V
Wan 2.1 - 1.3b
Wan 2.1 1.3b is a text-to-video model that ensures detail and accuracy in animations.
Model ID: model_wan-2-1-1-3b
Capabilities: txt2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-1-1-3b/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | Describe your video |
aspectRatio | string | No | 16:9 | - | - | 16:9, 9:16 | Video aspect ratio |
frameNum | number | No | 81 | - | - | 17, 33, 49, 65, 81 | Video duration in frames (based on standard 16fps playback) |
sampleSteps | number | No | 30 | 10 | 50 | - | Number of sampling steps (higher = better quality but slower) |
sampleGuideScale | number | No | 6 | 0 | 20 | - | Higher values follow the prompt more closely, lower values are more creative |
sampleShift | number | No | 8 | 0 | 20 | - | Sampling shift factor for flow matching (recommended range: 8-12) |
seed | number | No | - | - | - | - | Use a seed for reproducible results. Leave blank to use a random seed. |
Wan 2.2 - I2V
Wan 2.2 A14B is a image-to-video model at 720p and 480p resolutions
Model ID: model_wan-2-2-i2v-a14b
Capabilities: img2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-i2v-a14b/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | Describe your video |
image | file | No | - | - | - | - | Image used as the first frame of the video. Ideal images are 16:9 or 9:16 and 1280x720 or 720x1280, depending on the aspect ratio you choose. |
lastFrameImage | file | No | - | - | - | - | Input image for last frame generation. This only works if an image start frame is given too. |
resolution | string | No | 720p | - | - | 720p, 480p | Video resolution |
numFrames | number | No | 81 | 81 | 100 | - | Number of video frames. 81 frames give the best results |
framesPerSecond | number | No | 16 | 5 | 24 | - | Frames per second. |
sampleSteps | number | No | 30 | 1 | 50 | - | Number of generation steps. Fewer steps means faster generation, at the expensive of output quality. 30 steps is sufficient for most prompts |
sampleShift | number | No | 5 | 1 | 20 | - | Controls how much motion is added between video frames. Higher values create faster motion, lower values result in smoother, slower changes. |
seed | number | No | - | - | - | - | Use a seed for reproducible results. Leave blank to use a random seed. |
Wan 2.2 - T2V
Wan 2.2 A14B text-to-video
Model ID: model_wan-2-2-t2v
Capabilities: txt2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-t2v/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | Describe your video |
resolution | string | No | 720p | - | - | 720p, 480p | Video resolution |
numFrames | number | No | 81 | 81 | 121 | - | Number of video frames. 81 frames give the best results |
framesPerSecond | number | No | 16 | 5 | 30 | - | Frames per second. |
sampleShift | number | No | 5 | 1 | 20 | - | Controls how much motion is added between video frames. Higher values create faster motion, lower values result in smoother, slower changes. |
seed | number | No | - | - | - | - | Use a seed for reproducible results. Leave blank to use a random seed. |
Wan 2.2 Animate - Move
Wan-Animate is a video model that generates high-fidelity character videos by replicating the expressions and movements of characters from reference videos.
Model ID: model_wan-2-2-14b-animate-move
Capabilities: video2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-14b-animate-move/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
videoUrl | file | Yes | - | - | - | - | Input video |
imageUrl | file | Yes | - | - | - | - | Input image. If the input image does not match the chosen aspect ratio, it is resized and center cropped. |
resolution | string | No | 720p | - | - | 720p, 480p | Output video resolution |
mergeAudio | boolean | No | true | - | - | - | Merge audio from input video into output |
numInferenceSteps | number | No | 12 | 1 | 40 | - | Number of inference steps. Higher values improve quality but slow generation |
guidanceScale | number | No | 1 | 1 | 20 | - | Guidance scale for generation |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.2 Animate - Replace
Wan-Animate Replace is a model that can integrate animated characters into reference videos, replacing the original character while preserving the scene's lighting and color tone for seamless environmental integration.
Model ID: model_wan-2-2-14b-animate-replace
Capabilities: video2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-14b-animate-replace/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
videoUrl | file | Yes | - | - | - | - | Input video |
imageUrl | file | Yes | - | - | - | - | Input image. If the input image does not match the chosen aspect ratio, it is resized and center cropped. |
resolution | string | No | 720p | - | - | 720p, 480p | Output video resolution |
mergeAudio | boolean | No | true | - | - | - | Merge audio from input video into output |
numInferenceSteps | number | No | 12 | 1 | 40 | - | Number of inference steps. Higher values improve quality but slow generation |
guidanceScale | number | No | 1 | 1 | 20 | - | Guidance scale for generation |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.2 Outpainting
VACE Fun for Wan 2.2 A14B from Alibaba-PAI
Model ID: model_wan-2-2-vace-fun-a14b-outpainting
Capabilities: video2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-vace-fun-a14b-outpainting/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | Text prompt for video generation |
videoUrl | file | Yes | - | - | - | - | Input video for outpainting |
expandLeft | boolean | No | true | - | - | - | Expand video to the left |
expandRight | boolean | No | true | - | - | - | Expand video to the right |
expandTop | boolean | No | true | - | - | - | Expand video to the top |
expandBottom | boolean | No | true | - | - | - | Expand video to the bottom |
expandRatio | number | No | 0.25 | 0 | 1 | - | Amount of expansion. This is a float value between 0 and 1, where 0.25 adds 25% to the original video size on the specified sides. |
negativePrompt | string | No | - | - | - | - | Text negative prompt for video generation |
refImageUrls | file_array | No | `` | - | - | - | Reference images |
matchInputNumFrames | boolean | No | false | - | - | - | Match the number of frames from input video |
numFrames | number | No | 81 | 81 | 241 | - | Number of frames to generate |
matchInputFramesPerSecond | boolean | No | false | - | - | - | If true, the frames per second of the generated video will match the input video. If false, the frames per second will be determined by the Frames Per Seconds parameter. |
framesPerSecond | number | No | 16 | 5 | 30 | - | Frames per second of the generated video. Ignored if match_input_frames_per_second is true. Default value: 16 |
resolution | string | No | 720p | - | - | 720p, 580p, 480p | Output video resolution |
aspectRatio | string | No | auto | - | - | auto, 16:9, 1:1, 9:16 | Aspect ratio for output video |
numInferenceSteps | number | No | 30 | 2 | 50 | - | Number of inference steps |
guidanceScale | number | No | 5 | 1 | 10 | - | Guidance scale for generation |
enablePromptExpansion | boolean | No | false | - | - | - | Enable prompt expansion |
acceleration | string | No | regular | - | - | regular, none | Processing acceleration mode |
videoQuality | string | No | high | - | - | maximum, high, medium, low | Output video quality |
videoWriteMode | string | No | balanced | - | - | balanced, fast, small | Video writing mode |
numInterpolatedFrames | number | No | 1 | 0 | 5 | - | Number of frames to interpolate between the original frames. A value of 0 means no interpolation |
interpolatorModel | string | No | film | - | - | film, rife | Interpolator model to use |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.2 Reframe
VACE Fun for Wan 2.2 A14B from Alibaba-PAI
Model ID: model_wan-2-2-vace-fun-a14b-reframe
Capabilities: video2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-2-vace-fun-a14b-reframe/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
videoUrl | file | Yes | - | - | - | - | Input video for reframe |
prompt | string | No | - | - | - | - | Text prompt for video generation |
negativePrompt | string | No | - | - | - | - | Text negative prompt for video generation |
matchInputNumFrames | boolean | No | false | - | - | - | Match the number of frames from input video |
numFrames | number | No | 81 | 81 | 241 | - | Number of frames to generate |
matchInputFramesPerSecond | boolean | No | false | - | - | - | If true, the frames per second of the generated video will match the input video. If false, the frames per second will be determined by the Frames Per Seconds parameter. |
framesPerSecond | number | No | 16 | 5 | 30 | - | Frames per second of the generated video. Ignored if match_input_frames_per_second is true. Default value: 16 |
resolution | string | No | 720p | - | - | 720p, 580p, 480p | Output video resolution |
aspectRatio | string | No | auto | - | - | auto, 16:9, 1:1, 9:16 | Aspect ratio for output video |
numInferenceSteps | number | No | 30 | 2 | 50 | - | Number of inference steps |
guidanceScale | number | No | 5 | 1 | 10 | - | Guidance scale for generation |
enablePromptExpansion | boolean | No | false | - | - | - | Enable prompt expansion |
acceleration | string | No | regular | - | - | regular, none | Processing acceleration mode |
videoQuality | string | No | high | - | - | maximum, high, medium, low | Output video quality |
videoWriteMode | string | No | balanced | - | - | balanced, fast, small | Video writing mode |
numInterpolatedFrames | number | No | 1 | 0 | 5 | - | Number of frames to interpolate between the original frames. A value of 0 means no interpolation |
interpolatorModel | string | No | film | - | - | film, rife | Interpolator model to use |
zoomFactor | number | No | 0 | 0 | 0.9 | - | Zoom factor for the video. When this value is greater than 0, the video will be zoomed in by this factor (in relation to the canvas size,) cutting off the edges of the video. A value of 0 means no zoom. |
trimBorders | boolean | No | true | - | - | - | Whether to trim borders from the video. |
temporalDownsampleFactor | number | No | 0 | 0 | 5 | - | Temporal downsample factor for the video. This is an integer value that determines how many frames to skip in the video. A value of 0 means no downsampling. For each downsample factor, one upsample factor will automatically be applied. |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.5 - I2V
Wan 2.5 image-to-video model.
Model ID: model_wan-2-5-i2v
Capabilities: img2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-5-i2v/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
image | file | Yes | - | - | - | - | Image to use for your video |
prompt | string | Yes | - | - | - | - | A textual prompt to guide model generation. |
audio | file | No | - | - | - | - | Audio file for voice/music synchronization. 3-30s, ≤15MB. |
negativePrompt | string | No | - | - | - | - | Negative prompt used to guide the model away from undesirable features. |
resolution | string | No | 720p | - | - | 720p, 1080p | Video resolution. |
duration | number | No | 5 | - | - | 5, 10 | Duration of the generated video in seconds. |
enablePromptExpansion | boolean | No | true | - | - | - | Whether to enable prompt rewriting using LLM. |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.5 - T2V
Wan 2.5 text-to-video model.
Model ID: model_wan-2-5-t2v
Capabilities: txt2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-5-t2v/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | A textual prompt to guide model generation. |
audio | file | No | - | - | - | - | Audio file for voice/music synchronization. 3-30s, ≤15MB. |
negativePrompt | string | No | - | - | - | - | Negative prompt used to guide the model away from undesirable features. |
size | string | No | 1280*720 | - | - | 1280*720, 720*1280, 1920*1080, 1080*1920 | Video resolution and aspect ratio. |
duration | number | No | 5 | - | - | 5, 10 | Duration of the generated video in seconds. |
enablePromptExpansion | boolean | No | true | - | - | - | Whether to enable prompt rewriting using LLM. |
seed | number | No | - | - | - | - | Random seed for reproducibility. If None, a random seed is chosen. |
Wan 2.6 I2V
Alibaba Wan 2.6 image to video generation model
Model ID: model_wan-2-6-i2v
Capabilities: img2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-6-i2v/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
image | file | Yes | - | - | - | - | Input image for video generation |
prompt | string | Yes | - | - | - | - | Text prompt for video generation |
audio | file | No | - | - | - | - | Audio file (3-30s, ≤15MB) for voice/music synchronization |
negativePrompt | string | No | - | - | - | - | Negative prompt to avoid certain elements |
resolution | string | No | 720p | - | - | 720p, 1080p | Video resolution |
duration | number | No | 5 | - | - | 5, 10, 15 | Duration of the generated video in seconds |
enablePromptExpansion | boolean | No | true | - | - | - | If set to true, the prompt optimizer will be enabled |
multiShots | boolean | No | true | - | - | - | Enable intelligent multi-shot segmentation (only active when 'Enable Prompt Expansion' is enabled). True enables multi-shot segmentation, false generates single-shot content. |
seed | number | No | - | - | - | - | Random seed for reproducible generation |
Wan 2.6 T2V
Alibaba Wan 2.6 text to video generation model
Model ID: model_wan-2-6-t2v
Capabilities: txt2video
LLM Markdown: https://app.scenario.com/api/models/model_wan-2-6-t2v/markdown
| Parameter | Type | Required | Default | Min | Max | Allowed Values | Description |
|---|---|---|---|---|---|---|---|
prompt | string | Yes | - | - | - | - | Text prompt for video generation |
audio | file | No | - | - | - | - | Audio file (3-30s, ≤15MB) for voice/music synchronization |
negativePrompt | string | No | - | - | - | - | Negative prompt to avoid certain elements |
size | string | No | 1280*720 | - | - | 1280*720, 720*1280, 1920*1080, 1080*1920 | Video resolution and aspect ratio |
duration | number | No | 5 | - | - | 5, 10, 15 | Duration of the generated video in seconds |
enablePromptExpansion | boolean | No | true | - | - | - | If set to true, the prompt optimizer will be enabled |
multiShots | boolean | No | true | - | - | - | Enable intelligent multi-shot segmentation (only active when 'Enable Prompt Expansion' is enabled). True enables multi-shot segmentation, false generates single-shot content. |
seed | number | No | - | - | - | - | Random seed for reproducible generation |
Updated about 4 hours ago