=== sound ===
ase refer to the capability 
map mentioned above.


video_list
	
array
	
Optional
	
None
	
Reference Video, get link for uploaded video.
●It can be used as a reference video for feature or as a video to be edited, with the default 
being the video to be edited; Selective retention of video original sound.
○Distinguish reference video types based on the refer_type parameter: feature is the 
feature reference video, base is the video to be edited.
○When the reference video is a video to be edited, the start and end frames of the video 
cannot be defined.
○Select whether to keep the video original sound through the parameter 
keep_original_stound, with yes indicating retention and no indicating non retention; The 
current parameters also apply to the feature reference video.
●When there is a reference video, the value of the sound parameter can only be off.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"video_list":[
2
  {
3
    "video_url":"video_url",
4
    "refer_type":"base",
5
    "keep_original_sound":"yes"
6
  }
7
]
●Only .mp4/.mov formats are supported.
●Only supports videos with a duration of ≥ 3 seconds and ≤ 10 seconds.
●Video resolution must be between 720px and 2160px (inclusive) in both width and height.
●Only supports videos with frame rates of 24 ~ 60 fps, the output result is 24 fps.
●Only supports 1 video can be uploaded, with a video size not exceeding 200MB.
●The value of video_url parameter must not be empty.
Different model versions support diff

================================================================================

=== mode ===
KlingAI Series 3.0 Model API Specification
登录/注册
目录
Kling AI Series 3.0 Model API Specification
Capability Map
Video Generation
Image Generation
API
Omni-Video - Create Task
Request Header
Request Body
Response Body
Invocation examples
Image-to-video with multi-shot
Text-to-video with multi-shot
Omni-Video - Query Task (Single)
Request Header
Request Path Parameters
Request Body
Response Body
Omni-Video - Query Task (List)
Request Header
Query Parameters
Request Body
Response Body
Text to Video - Create Task
Request Header
Request Body
Response Body
Invocation examples
Text-to-video with multi-shot
Text to Video - Query Task (Single)
Request Header
Request Path Parameters
Request Body
Response Body
Text to Video - Query Task (List)
Request Header
Query Parameters
Request Body
Response Body
Image to Video - Create Task
Request Header
Request Body
Response Body
Invocation examples
Image-to-video with multi-shot
Image-to-video with voice of element
Image to Video - Query Task (Single)
Request Header
Request Path Parameters
Request Body
Response Body
Image to Video - Query Task (List)
Request Header
Query Parameters
Request Body
Response Body
Omni-Image - Create Task
Request Header
Request Body
Response Body

================================================================================

=== aspect_ratio ===
e
●Enum values: std, pro
●std: Standard Mode, generating 720P videos, which is cost-effective
●pro: Professional Mode, generating 1080P videos, which is higher quality video output
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


aspect_ratio
	
string
	
Optional
	
None
	
The aspect ratio of the generated video frame (width:height)
●Enum values：16:9, 9:16, 1:1
●This parameter is required when the first-frame reference or video editing features are not 
used.


duration
	
string
	
Optional
	
5
	
Video Length, unit: s (seconds)
●Enum values: 3，4，5，6，7，8，9，10，11，12，13，14，15:
○When using the video editing function ("refer_date": "base"), the output result is the 
same as the duration of the incoming video, and the current parameter is invalid. 
Calculate billing by rounding the input video duration to the nearest integer.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


================================================================================

=== duration ===
r the shot-type parameter is intelligence, the 
current parameter must not be empty.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


multi_prompt
	
array
	
Optional
	
None
	
Information about each storyboard, such as prompts and duration
●Define the shot sequence number, corresponding prompt word, and duration through the 
index, prompt, and duration parameters, where:
○Supports up to 6 storyboards, with a minimum of 1 storyboard.
○The maximum length of the content for each storyboard 512. 
○The duration of each storyboard should not exceed the total duration of the current task, 
and it must not be less than 1.
○The sum of the durations of all storyboards equals the total duration of the current task.
●Load with key:value, details as follows:
JSON
自动换行
折叠
1
"multi_prompt":[
2
  {
3
    "index":int,
4
    "prompt": "string",
5
    "duration": "5"
6
  },
7
  {
8
    "index":int,
9
    "prompt": "string",
10
    "duration": "5"
11
  }
12
]
●When the mult_shot parameter is set to true and the shot_type parameter is set to 
customize, the current parameter must not be empty.


image_list
	
array
	
Optional
	
None
	
Reference Image List
●Including reference images of the element, scene, style, etc., it can also be used as the start 
or end frame to generate videos; When generating a video as the start or end frame:
○Define whether the image is in the first and last frames using the type parameter: 
first_frame is the s

================================================================================

=== refer_type ===
eo_list
	
array
	
Optional
	
None
	
Reference Video, get link for uploaded video.
●It can be used as a reference video for feature or as a video to be edited, with the default 
being the video to be edited; Selective retention of video original sound.
○Distinguish reference video types based on the refer_type parameter: feature is the 
feature reference video, base is the video to be edited.
○When the reference video is a video to be edited, the start and end frames of the video 
cannot be defined.
○Select whether to keep the video original sound through the parameter 
keep_original_stound, with yes indicating retention and no indicating non retention; The 
current parameters also apply to the feature reference video.
●When there is a reference video, the value of the sound parameter can only be off.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"video_list":[
2
  {
3
    "video_url":"video_url",
4
    "refer_type":"base",
5
    "keep_original_sound":"yes"
6
  }
7
]
●Only .mp4/.mov formats are supported.
●Only supports videos with a duration of ≥ 3 seconds and ≤ 10 seconds.
●Video resolution must be between 720px and 2160px (inclusive) in both width and height.
●Only supports videos with frame rates of 24 ~ 60 fps, the output result is 24 fps.
●Only supports 1 video can be uploaded, with a video size not exceeding 200MB.
●The value of video_url parameter must not be empty.
Different model versions support different scopes. For details, please refer to the capability 
m

================================================================================

=== keep_original_sound ===
The 
current parameters also apply to the feature reference video.
●When there is a reference video, the value of the sound parameter can only be off.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"video_list":[
2
  {
3
    "video_url":"video_url",
4
    "refer_type":"base",
5
    "keep_original_sound":"yes"
6
  }
7
]
●Only .mp4/.mov formats are supported.
●Only supports videos with a duration of ≥ 3 seconds and ≤ 10 seconds.
●Video resolution must be between 720px and 2160px (inclusive) in both width and height.
●Only supports videos with frame rates of 24 ~ 60 fps, the output result is 24 fps.
●Only supports 1 video can be uploaded, with a video size not exceeding 200MB.
●The value of video_url parameter must not be empty.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


sound
	
string
	
Optional
	
off
	
Is sound generated simultaneously when generating videos
●Enum values: on, off
Only V2.6 and subsequent versions of the model supports the current parameters


mode
	
string
	
Optional
	
pro
	
Video generation mode
●Enum values: std, pro
●std: Standard Mode, generating 720P videos, which is cost-effective
●pro: Professional Mode, generating 1080P videos, which is higher quality video output
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


aspect_ratio
	
string
	
Optional
	
None
	
The aspect ratio of the generated video frame (width:height

================================================================================

=== element_id ===
odel versions support different scopes. For details, please refer to the capability 
map mentioned above.


element_list
	
array
	
Optional
	
None
	
Reference Element List
●Based on element ID configuration. Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"element_list":[
2
  {
3
    "element_id":long
4
  },
5
  {
6
    "element_id":long
7
  }
8
]
●The amount of reference element is related to whether there is reference video and the 
amount of reference image:
○ When using raw video from the first frame or raw video from the first and last frames, a 
maximum of 3 subjects is supported.
○ When there is a reference video, the sum of the number of reference images and the 
number of reference subjects must not exceed 4, and the use of video subjects is not 
supported.
○ When there is no reference video, the sum of the number of reference images and the 
number of reference subjects must not exceed 7.
●The elements are categorized into video customization element (named as video character 
elements) and image customization elements (named as Multi-Image Elements), each with 
distinct scopes of application. Please exercise caution in distinguishing between them.
●For more detailed information on the subject, please refer to: Kling Element Library User 
Guide.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


video_list
	
array
	
Optional
	
None
	
Reference Video, get link for uploaded video.
●It can be used 

================================================================================

=== image_url ===
ame, which means that when there is a 
end frame image, there must be a first frame image.
○When generating a video using the first frame or the first and last frames, video editing 
functions cannot be used.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"image_list":[
2
  {
3
    "image_url":"image_url",
4
    "type":"first_frame"
5
  },
6
  {
7
    "image_url":"image_url",
8
    "type":"end_frame"
9
  }
10
]
●Supports inputting image Base64 encoding or image URL (ensure accessibility).
●Supported image formats include.jpg / .jpeg / .png.
●The image file size cannot exceed 10MB, and the width and height dimensions of the image 
shall not be less than 300px, and the aspect ratio of the image should be between 1:2.5 ~ 
2.5:1.
●The amount of reference image is related to whether there is refrence video and the amount 
of reference element:
○When there are reference video, the sum of the amount of reference image and the 
amount of reference element shall not exceed 4.
○When there is no reference video, the sum of the amount of reference image and the 
amount of reference element shall not exceed 7.
○Setting an end frame is not supported when there are more than 2 images.
●The value of image_url parameter must not be empty.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


element_list
	
array
	
Optional
	
None
	
Reference Element List
●Based on element ID configuration. Load with key:value, details as 

================================================================================

=== video_url ===
yes indicating retention and no indicating non retention; The 
current parameters also apply to the feature reference video.
●When there is a reference video, the value of the sound parameter can only be off.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"video_list":[
2
  {
3
    "video_url":"video_url",
4
    "refer_type":"base",
5
    "keep_original_sound":"yes"
6
  }
7
]
●Only .mp4/.mov formats are supported.
●Only supports videos with a duration of ≥ 3 seconds and ≤ 10 seconds.
●Video resolution must be between 720px and 2160px (inclusive) in both width and height.
●Only supports videos with frame rates of 24 ~ 60 fps, the output result is 24 fps.
●Only supports 1 video can be uploaded, with a video size not exceeding 200MB.
●The value of video_url parameter must not be empty.
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


sound
	
string
	
Optional
	
off
	
Is sound generated simultaneously when generating videos
●Enum values: on, off
Only V2.6 and subsequent versions of the model supports the current parameters


mode
	
string
	
Optional
	
pro
	
Video generation mode
●Enum values: std, pro
●std: Standard Mode, generating 720P videos, which is cost-effective
●pro: Professional Mode, generating 1080P videos, which is higher quality video output
Different model versions support different scopes. For details, please refer to the capability 
map mentioned above.


aspect_ratio
	
string
	
Optional


================================================================================

=== first_frame ===
al
	
None
	
Reference Image List
●Including reference images of the element, scene, style, etc., it can also be used as the start 
or end frame to generate videos; When generating a video as the start or end frame:
○Define whether the image is in the first and last frames using the type parameter: 
first_frame is the start frame, end_frame is the end frame.
■Currently does not support only the end frame, which means that when there is a 
end frame image, there must be a first frame image.
○When generating a video using the first frame or the first and last frames, video editing 
functions cannot be used.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"image_list":[
2
  {
3
    "image_url":"image_url",
4
    "type":"first_frame"
5
  },
6
  {
7
    "image_url":"image_url",
8
    "type":"end_frame"
9
  }
10
]
●Supports inputting image Base64 encoding or image URL (ensure accessibility).
●Supported image formats include.jpg / .jpeg / .png.
●The image file size cannot exceed 10MB, and the width and height dimensions of the image 
shall not be less than 300px, and the aspect ratio of the image should be between 1:2.5 ~ 
2.5:1.
●The amount of reference image is related to whether there is refrence video and the amount 
of reference element:
○When there are reference video, the sum of the amount of reference image and the 
amount of reference element shall not exceed 4.
○When there is no reference video, the sum of the amount of reference image and the 
amount of reference eleme

================================================================================

=== end_frame ===

●Including reference images of the element, scene, style, etc., it can also be used as the start 
or end frame to generate videos; When generating a video as the start or end frame:
○Define whether the image is in the first and last frames using the type parameter: 
first_frame is the start frame, end_frame is the end frame.
■Currently does not support only the end frame, which means that when there is a 
end frame image, there must be a first frame image.
○When generating a video using the first frame or the first and last frames, video editing 
functions cannot be used.
●Load with key:value, details as follows:
Plain Text
自动换行
折叠
1
"image_list":[
2
  {
3
    "image_url":"image_url",
4
    "type":"first_frame"
5
  },
6
  {
7
    "image_url":"image_url",
8
    "type":"end_frame"
9
  }
10
]
●Supports inputting image Base64 encoding or image URL (ensure accessibility).
●Supported image formats include.jpg / .jpeg / .png.
●The image file size cannot exceed 10MB, and the width and height dimensions of the image 
shall not be less than 300px, and the aspect ratio of the image should be between 1:2.5 ~ 
2.5:1.
●The amount of reference image is related to whether there is refrence video and the amount 
of reference element:
○When there are reference video, the sum of the amount of reference image and the 
amount of reference element shall not exceed 4.
○When there is no reference video, the sum of the amount of reference image and the 
amount of reference element shall not exceed 7.
○Settin

================================================================================

=== Content-Type ===
-


element control
（only multi-image elements）
	
✅
	
-


others
	
-
	
-
API
Omni-Video - Create Task
Protocol
	
https


Request URL
	
/v1/videos/omni-video


Request Method
	
POST


Request Format
	
application/json


Response Format
	
application/json
Request Header
Field
	
Value
	
Description 


Content-Type
	
application/json
	
Data Exchange Format


Authorization
	
Authentication information, refer 
to API authentication
	
Authentication information, refer 
to API authentication
Request Body
Field
	
Type
	
Required Field
	
Default Value
	
Description 


model_name
	
string
	
Optional
	
kling-video-o1
	
Model Name
●Enum values：kling-video-o1, kling-v3-omni


multi_shot
	
boolea
n
	
Optional
	
false
	
Whether to generate multi-shot video
●true: the prompt parameter is invalid
●false: the shot_type parameter and the multi_prompt parameter are invalid


shot_type
	
string
	
Optional
	
None
	
Storyboard method
●Enum values：customize
●When the multi_shot parameter is set to true, the current parameter is required.


prompt
	
string
	
Optional
	
None
	
Text prompt words, which can include positive and negative descriptions
●The prompt words can be templated to meet different video generation needs.
📌
The Omni model can achieve various capabilities through Prompt with elements, 
images, videos, and other content.
1.Specify a element, image, or video in the format of<<<>>, such 
as<<element_1>>>,<<<image_1>>>,<<<video_1>>>.
2.For more information, please refer to: Kling VIDEO 3.0 Omni Model 

================================================================================