Academia

This page is auto-generated from model configurations. Last updated: 2026-04-25.

This reference lists all available Academia image generation models and their parameters. Use these parameter names when calling the Generation API.


ERNIE Image

Baidu ERNIE-Image (SFT) — strong instruction following, text rendering, and layout-aware generation; ~50 inference steps with optional prompt enhancer.

Model ID: model_ernie-image

Capabilities: txt2img

LLM Markdown: https://app.scenario.com/api/models/model_ernie-image/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
promptstringYes----Text prompt for image generation.
numOutputsnumberNo114-Number of images to generate.
widthnumberNo1024642048-Width of the generated image. Recommended sizes are listed in presets.
heightnumberNo1024642048-Height of the generated image. Recommended sizes are listed in presets.
numInferenceStepsnumberNo501100-Number of denoising steps. The SFT model is tuned for ~50 steps.
guidanceScalenumberNo4020-Classifier-free guidance scale.
usePebooleanNotrue---Expand short prompts into richer descriptions (recommended on the model card).
seednumberNo----Random seed for reproducible generation.

ERNIE Image Turbo

Baidu ERNIE-Image-Turbo — fast ~8-step generation with optional prompt enhancer (guidance fixed in the deployment).

Model ID: model_ernie-image-turbo

Capabilities: txt2img

LLM Markdown: https://app.scenario.com/api/models/model_ernie-image-turbo/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
promptstringYes----Text prompt for image generation.
numOutputsnumberNo114-Number of images to generate.
widthnumberNo1024642048-Width of the generated image. Recommended sizes are listed in presets.
heightnumberNo1024642048-Height of the generated image. Recommended sizes are listed in presets.
usePebooleanNotrue---Expand short prompts into richer descriptions (recommended on the model card).
seednumberNo----Random seed for reproducible generation.

Joyai Image Edit

All-in-one image editing model with strong visual understanding for precise natural language edits.

Model ID: model_joyai-image-edit

Capabilities: img2img

LLM Markdown: https://app.scenario.com/api/models/model_joyai-image-edit/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
promptstringYes----The edit instruction describing what changes to make to the image.
imagefileYes----Input image to edit.
negativePromptstringNo----Text describing what should not appear in the output image.
numInferenceStepsnumberNo30150-Number of denoising steps.
cfgScalenumberNo5115-Controls how closely the output follows the edit instruction.
seednumberNo-02147483647-Random seed for reproducibility.

Physic Edit

Physics-aware image editing with realistic refraction, material changes, and deformations.

Model ID: model_physic-edit

Capabilities: img2img

LLM Markdown: https://app.scenario.com/api/models/model_physic-edit/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
imagefileYes----Input image to edit with physics-aware transformations.
promptstringYes----Editing instructions describing the physical transformation to apply.
negativePromptstringNo----Description of unwanted elements in the output.
numInferenceStepsnumberNo40150-Number of denoising steps.
cfgScalenumberNo4120-Classifier-free guidance scale.
seednumberNo-02147483647-Random seed for reproducibility.

Z-Image

Z-Image - Highest quality text-to-image and image-to-image generation

Model ID: model_z-image

Capabilities: txt2img, img2img, controlnet

LLM Markdown: https://app.scenario.com/api/models/model_z-image/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
modelIdmodelNo----A single LoRA model ID. If provided, it will be added to the loras array.
lorasmodel_arrayNo``---List of one or more LoRA model IDs. If modelId is a LoRA and this is not provided, it will be automatically populated with the modelId.
lorasScalenumber_arrayNo-02-Scales for the LoRA weights
promptstringYes----Text prompt for image generation
negativePromptstringNo----Text prompt for what to avoid in the image
imagefileNo----Source image for img2img transformation
strengthnumberNo0.601-How much to transform the reference image. 0 means no change, 1 means complete transformation
controlImagefile_arrayNo----Control images to use as input (edge, depth, or pose)
controlnetConditioningScalenumberNo0.7501-The scale of the controlnet conditioning
numOutputsnumberNo414-Number of images to generate
numInferenceStepsnumberNo281100-Number of inference steps.
widthnumberNo-1282048-The width of the generated images, must be a multiple of 16
heightnumberNo-1282048-The height of the generated images, must be a multiple of 16
guidancenumberNo5110-Controls how closely the generated image follows the prompt
seednumberNo-02147483647-Used to reproduce previous results

Z-Image Turbo

Z-Image Turbo - Fast text-to-image and image-to-image generation model

Model ID: model_z-image-turbo

Capabilities: txt2img, img2img, controlnet

LLM Markdown: https://app.scenario.com/api/models/model_z-image-turbo/markdown

ParameterTypeRequiredDefaultMinMaxAllowed ValuesDescription
modelIdmodelNo----A single LoRA model ID. If provided, it will be added to the loras array.
lorasmodel_arrayNo``---List of one or more LoRA model IDs. If modelId is a LoRA and this is not provided, it will be automatically populated with the modelId.
lorasScalenumber_arrayNo-02-Scales for the LoRA weights
promptstringYes----Text prompt for image generation
negativePromptstringNo----Text prompt for what to avoid in the image
imagefileNo----Source image for img2img transformation. Cannot be used with ControlNet.
strengthnumberNo0.7501-How much to transform the reference image. 0 means no change, 1 means complete transformation
controlImagefile_arrayNo----Control images to use as input (edge, depth, or pose). Cannot be used with Reference Image.
controlnetConditioningScalenumberNo0.701-The scale of the controlnet conditioning
numOutputsnumberNo414-Number of images to generate
numInferenceStepsnumberNo91100-Number of inference steps.
widthnumberNo-1282048-Width of the generated image
heightnumberNo-1282048-Height of the generated image
seednumberNo-02147483647-Used to reproduce previous results