Skip to main content
AI

Wan 2.6 Video Generation

Wan 2.6 covers text-to-video, image-to-video, and reference-to-video model families. The source scope also includes a duration reference table for the Wan 2.6 public models.

Supported Models

ModelTypeDuration rules
wan2.6-video-720pText-to-video / image-to-videoRequired; 5, 10, or 15.
wan2.6-video-1080pText-to-video / image-to-videoRequired; 5, 10, or 15.
wan2.6-i2v-flash-720p-audioImage-to-video with audioRequired; any integer from 5 to 15.
wan2.6-i2v-flash-720p-silentImage-to-video silentRequired; any integer from 5 to 15.
wan2.6-i2v-flash-1080p-audioImage-to-video with audioRequired; any integer from 5 to 15.
wan2.6-i2v-flash-1080p-silentImage-to-video silentRequired; any integer from 5 to 15.
wan2.6-r2v-720pReference-to-videoRequired; see account model configuration.
wan2.6-r2v-1080pReference-to-videoRequired; see account model configuration.
wan2.6-r2v-flash-720p-audioReference-to-video with audioRequired; see account model configuration.
wan2.6-r2v-flash-720p-silentReference-to-video silentRequired; see account model configuration.
wan2.6-r2v-flash-1080p-audioReference-to-video with audioRequired; see account model configuration.
wan2.6-r2v-flash-1080p-silentReference-to-video silentRequired; see account model configuration.

Endpoint

POST /v1/video/generations
GET /v1/videos/{task_id}

Authentication

Authorization: Bearer sk-***
Content-Type: application/json

Request Parameters

ParameterTypeRequiredDescription
modelstringYesWan 2.6 model ID.
promptstringYesPrompt.
imagestringConditionalReference image for image-to-video.
imagesstring[]ConditionalReference images for R2V models.
videosstring[]ConditionalReference videos for R2V models.
durationintegerYesSee the supported model table.
sizestringNoOutput size. Text-to-video can use this to choose landscape or portrait. Image-to-video follows the reference image orientation.
extra_body.negative_promptstringNoContent to avoid.
extra_body.shot_typestringNosingle or multi.
extra_body.seedintegerNo-1 for random; fixed value for reproducibility.
extra_body.audiostringNoPublic audio URL.
extra_body.enable_prompt_expansionbooleanNoPrompt expansion switch for R2V models.

Common size values include 1280*720, 720*1280, 1920*1080, and 1080*1920.

Text-To-Video Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "wan2.6-video-1080p",
"prompt": "sunrise",
"duration": 5,
"size": "1080*1920",
"extra_body": {
"negative_prompt": "blur, watermark",
"shot_type": "multi",
"seed": -1
}
}'

Reference Video Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "wan2.6-r2v-flash-720p-audio",
"prompt": "Keep the same subject identity and cinematic style, generate a faster-paced new shot.",
"videos": [
"https://example.com/reference-shot-1.mp4"
],
"size": "1280*720",
"duration": 5,
"extra_body": {
"negative_prompt": "watermark, blur, flicker",
"audio": "https://example.com/guide-audio.mp3",
"shot_type": "single",
"enable_prompt_expansion": false,
"seed": -1
}
}'

Multi-Image Reference Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "wan2.6-r2v-flash-1080p-silent",
"prompt": "Keep the same product identity and style, generate a clean marketing video.",
"images": [
"https://example.com/ref-1.png",
"https://example.com/ref-2.png"
],
"size": "1920*1080",
"duration": 10,
"extra_body": {
"negative_prompt": "watermark, blur",
"shot_type": "multi",
"enable_prompt_expansion": true,
"seed": 42
}
}'

Query Task Status

curl "{BASE_URL}/v1/videos/task_xxx" \
-H "Authorization: Bearer sk-***"
{
"id": "task_xxx",
"task_id": "task_xxx",
"object": "video.generation.job",
"model": "wan2.6-video-1080p",
"status": "completed",
"progress": 100,
"video_url": "https://example.com/output.mp4"
}

Billing Notes

Wan 2.6 billing depends on model family, output resolution, duration, audio mode, and whether the task is text/image/video reference generation. Use settlement records for final accounting.

Common Errors

  • Using a duration value outside the selected model's allowed range.
  • Expecting size to control portrait or landscape for image-to-video; image orientation follows the reference image.
  • Passing private media URLs.
  • Using R2V fields with a non-R2V model.
  • Using audio with a silent model.