Skip to main content
AI

Kling Video Generation

Kling supports text-to-video, image-to-video, first-last-frame generation, and advanced prompt controls. Select the public model ID for standard or pro quality and silent or audio output.

Supported Models

ModelTierAudio
kling-v3-std-silentStandardSilent output.
kling-v3-std-audioStandardAudio output.
kling-v3-pro-silentProSilent output.
kling-v3-pro-audioProAudio output.

Endpoint

POST /v1/video/generations
GET /v1/videos/{task_id}

Authentication

Authorization: Bearer sk-***
Content-Type: application/json

Request Parameters

ParameterTypeRequiredDescription
modelstringYesOne of the Kling v3 model IDs.
promptstringYesMain prompt. In image mode, do not pass it together with extra_body.multi_prompt.
imagestringConditionalReference image URL. If last_image is present, this is the first frame.
last_imagestringConditionalLast-frame image URL.
aspect_ratiostringNo16:9, 9:16, or 1:1; image-to-video may not guarantee this ratio.
durationintegerNoSource range: 3 to 15 seconds.
extra_body.negative_promptstringNoContent to avoid.
extra_body.cfg_scalenumberNoPrompt adherence strength. Higher values generally follow the prompt more closely.
extra_body.multi_promptarrayNoMulti-shot prompt list for advanced image mode.
extra_body.element_listarrayNoElement references.
extra_body.shot_typestringNocustomize or intelligent.

Text-To-Video Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "kling-v3-std-silent",
"prompt": "A wide cinematic shot of a traveler walking through dusty desert ruins, cloak moving in the wind, slow pullback, natural motion.",
"aspect_ratio": "16:9",
"duration": 5,
"extra_body": {
"cfg_scale": 0.5,
"shot_type": "customize",
"negative_prompt": "blur, low detail"
}
}'

Image-To-Video Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "kling-v3-pro-audio",
"prompt": "The person writes naturally in a notebook, subtle hand detail, small thoughtful pauses, soft classroom depth of field.",
"image": "https://example.com/start-frame.png",
"aspect_ratio": "16:9",
"duration": 6,
"extra_body": {
"cfg_scale": 0.6,
"shot_type": "intelligent",
"negative_prompt": "artifacts, distortion"
}
}'

First-Last-Frame Example

curl -X POST "{BASE_URL}/v1/video/generations" \
-H "Authorization: Bearer sk-***" \
-H "Content-Type: application/json" \
-d '{
"model": "kling-v3-pro-silent",
"prompt": "The portrait subject slowly turns toward the camera while keeping identity stable and motion natural.",
"image": "https://example.com/first-frame.png",
"last_image": "https://example.com/last-frame.png",
"duration": 8
}'

Query Task Status

curl "{BASE_URL}/v1/videos/task_xxx" \
-H "Authorization: Bearer sk-***"
{
"id": "task_xxx",
"task_id": "task_xxx",
"object": "video.generation.job",
"model": "kling-v3-pro-audio",
"status": "completed",
"progress": 100,
"video_url": "https://example.com/output.mp4"
}

Billing Notes

Kling billing depends on model tier, audio mode, output seconds, and generation mode. The task settlement record and current product pricing are the final billing source.

Common Errors

  • Passing both prompt and extra_body.multi_prompt in image mode.
  • Requesting a duration outside 3 to 15.
  • Assuming aspect_ratio is guaranteed in image-to-video mode.
  • Using an audio model when silent output is expected.
  • Passing private media URLs.