KlingAI Series 3.0 Model API Specification

Kling AI Series 3.0 Model API Specification

Capability Map
- Video Generation
- Image Generation

API
- Omni-Video - Create Task
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Image-to-video with multi-shot
    - Text-to-video with multi-shot
- Omni-Video - Query Task (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
- Omni-Video - Query Task (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- Text to Video - Create Task
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Text-to-video with multi-shot
- Text to Video - Query Task (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
- Text to Video - Query Task (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- Image to Video - Create Task
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Image-to-video with multi-shot
    - Image-to-video with voice of element
- Image to Video - Query Task (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
- Image to Video - Query Task (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- Omni-Image - Create Task
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Image generation with element
- Omni-Image - Query Task (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
- Omni-Image - Query Task (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- Image Generation - Create task
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Image generation with element
- Image Generation - Query Task (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
- Image Generation - Query Task (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- General - Create Element
  - Request Header
  - Request Body
  - Response Body
  - Invocation examples
    - Create Multi-Image Elements
    - Create Video Character Elements
- General - Query Custom Element (Single)
  - Request Header
  - Request Path Parameters
  - Request Body
  - Response Body
  - Invocation examples
    - Query a specific custom entity
- General - Query Custom Element (List)
  - Request Header
  - Request Body
  - Query Parameters
  - Response Body
  - Invocation examples
    - List query of custom elements
- General - Query Presets Element (List)
  - Request Header
  - Query Parameters
  - Request Body
  - Response Body
- General - Delete Custom Element
  - Request Header
  - Request Body
  - Response Body

Prepaid Resource Packs

text to video	single-shot-video generation	✅	✅
		multi-shot-video generation	✅	✅
		voice control	❌	❌
		others	-	-
image to video	single-shot-video generation （only start frame）	✅	✅
		multi-shot-video generation	✅	✅
		start & end frame	✅	✅
		element control （video character elements & multi-image elements）	✅	✅
		motion control	（coming soon）	（coming soon）
		voice control	❌	❌
		others	-	-

Image Generation

kling-v3-omni			custom aspect ratio（1K/2K/4K）	intelligent aspect ratio
text to image	single-image generation	✅	✅
text to image		others	-	-
image to image	single-image generation	✅	✅
		series-image generation	✅	✅
		element control （only multi-image elements）	✅	✅
		others	-	-

kling-image-o1			custom aspect ratio（1K/2K）	intelligent aspect ratio
text to image	single-image generation	✅	-
text to image		others	-	-
image to image	single-image generation	✅	✅
		element control （only multi-image elements）	✅	✅
		others	-	-

kling-v3			custom aspect ratio（1K/2K）	intelligent aspect ratio
text to image	single-image generation	✅	-
text to image		others	-	-
image to image	single-image generation	✅	-
		element control （only multi-image elements）	✅	-
		others	-	-

API

Omni-Video - Create Task

Protocol	https
Request URL	/v1/videos/omni-video
Request Method	POST
Request Format	application/json
Response Format	application/json

Request Header

Field	Value	Description
Content-Type	application/json	Data Exchange Format
Authorization	Authentication information, refer to API authentication	Authentication information, refer to API authentication

Request Body

Field	Type	Required Field	Default Value	Description
model_name	string	Optional	kling-video-o1	Model Name ●Enum values：kling-video-o1, kling-v3-omni
multi_shot	boolea n	Optional	false	Whether to generate multi-shot video ●true: the prompt parameter is invalid ●false: the shot_type parameter and the multi_prompt parameter are invalid
shot_type	string	Optional	None	Storyboard method ●Enum values：customize ●When the multi_shot parameter is set to true, the current parameter is required.
prompt	string	Optional	None	Text prompt words, which can include positive and negative descriptions ●The prompt words can be templated to meet different video generation needs. 📌 The Omni model can achieve various capabilities through Prompt with elements, images, videos, and other content. 1.Specify a element, image, or video in the format of<<<>>, such as<<element_1>>>,<<<image_1>>>,<<<video_1>>>. 2.For more information, please refer to: Kling VIDEO 3.0 Omni Model User Guide ●Cannot exceed 2500 characters. ●When the multi_shot parameter is false or the shot-type parameter is intelligence, the current parameter must not be empty. Different model versions support different scopes. For details, please refer to the capability map mentioned above.
multi_prompt	array	Optional	None	Information about each storyboard, such as prompts and duration ●Define the shot sequence number, corresponding prompt word, and duration through the index, prompt, and duration parameters, where: ○Supports up to 6 storyboards, with a minimum of 1 storyboard. ○The maximum length of the content for each storyboard 512. ○The duration of each storyboard should not exceed the total duration of the current task, and it must not be less than 1. ○The sum of the durations of all storyboards equals the total duration of the current task. ●Load with key:value, details as follows: JSON 自动换行折叠 xxxxxxxxxx 1 "multi_prompt":[ 2 { 3 "index":int, 4 "prompt": "string", 5 "duration": "5" 6 }, 7 { 8 "index":int, 9 "prompt": "string", 10 "duration": "5" 11 } 12 ] ●When the mult_shot parameter is set to true and the shot_type parameter is set to customize, the current parameter must not be empty.
image_list	array	Optional	None	Reference Image List ●Including reference images of the element, scene, style, etc., it can also be used as the start or end frame to generate videos; When generating a video as the start or end frame: ○Define whether the image is in the first and last frames using the type parameter: first_frame is the start frame, end_frame is the end frame. ■Currently does not support only the end frame, which means that when there is a end frame image, there must be a first frame image. ○When generating a video using the first frame or the first and last frames, video editing functions cannot be used. ●Load with key:value, details as follows: Plain Text 自动换行折叠 xxxxxxxxxx 1 "image_list":[ 2 { 3 "image_url":"image_url", 4 "type":"first_frame" 5 }, 6 { 7 "image_url":"image_url", 8 "type":"end_frame" 9 } 10 ] ●Supports inputting image Base64 encoding or image URL (ensure accessibility). ●Supported image formats include.jpg / .jpeg / .png. ●The image file size cannot exceed 10MB, and the width and height dimensions of the image shall not be less than 300px, and the aspect ratio of the image should be between 1:2.5 ~ 2.5:1. ●The amount of reference image is related to whether there is refrence video and the amount of reference element: ○When there are reference video, the sum of the amount of reference image and the amount of reference element shall not exceed 4. ○When there is no reference video, the sum of the amount of reference image and the amount of reference element shall not exceed 7. ○Setting an end frame is not supported when there are more than 2 images. ●The value of image_url parameter must not be empty. Different model versions support different scopes. For details, please refer to the capability map mentioned above.
element_list	array	Optional	None	Reference Element List ●Based on element ID configuration. Load with key:value, details as follows: Plain Text 自动换行折叠 xxxxxxxxxx 1 "element_list":[ 2 { 3 "element_id":long 4 }, 5 { 6 "element_id":long 7 } 8 ] ●The amount of reference element is related to whether there is reference video and the amount of reference image: ○ When using raw video from the first frame or raw video from the first and last frames, a maximum of 3 subjects is supported. ○ When there is a reference video, the sum of the number of reference images and the number of reference subjects must not exceed 4, and the use of video subjects is not supported. ○ When there is no reference video, the sum of the number of reference images and the number of reference subjects must not exceed 7. ●The elements are categorized into video customization element (named as video character elements) and image customization elements (named as Multi-Image Elements), each with distinct scopes of application. Please exercise caution in distinguishing between them. ●For more detailed information on the subject, please refer to: Kling Element Library User Guide. Different model versions support different scopes. For details, please refer to the capability map mentioned above.
video_list	array	Optional	None	Reference Video, get link for uploaded video. ●It can be used as a reference video for feature or as a video to be edited, with the default being the video to be edited; Selective retention of video original sound. ○Distinguish reference video types based on the refer_type parameter: feature is the feature reference video, base is the video to be edited. ○When the reference video is a video to be edited, the start and end frames of the video cannot be defined. ○Select whether to keep the video original sound through the parameter keep_original_stound, with yes indicating retention and no indicating non retention; The current parameters also apply to the feature reference video. ●When there is a reference video, the value of the sound parameter can only be off. ●Load with key:value, details as follows: Plain Text 自动换行折叠 xxxxxxxxxx 1 "video_list":[ 2 { 3 "video_url":"video_url", 4 "refer_type":"base", 5 "keep_original_sound":"yes" 6 } 7 ] ●Only .mp4/.mov formats are supported. ●Only supports videos with a duration of ≥ 3 seconds and ≤ 10 seconds. ●Video resolution must be between 720px and 2160px (inclusive) in both width and height. ●Only supports videos with frame rates of 24 ~ 60 fps, the output result is 24 fps. ●Only supports 1 video can be uploaded, with a video size not exceeding 200MB. ●The value of video_url parameter must not be empty. Different model versions support different scopes. For details, please refer to the capability map mentioned above.
sound	string	Optional	off	Is sound generated simultaneously when generating videos ●Enum values: on, off Only V2.6 and subsequent versions of the model supports the current parameters
mode	string	Optional	pro	Video generation mode ●Enum values: std, pro ●std: Standard Mode, generating 720P videos, which is cost-effective ●pro: Professional Mode, generating 1080P videos, which is higher quality video output Different model versions support different scopes. For details, please refer to the capability map mentioned above.
aspect_ratio	string	Optional	None	The aspect ratio of the generated video frame (width:height) ●Enum values：16:9, 9:16, 1:1 ●This parameter is required when the first-frame reference or video editing features are not used.
duration	string	Optional	5	Video Length, unit: s (seconds) ●Enum values: 3，4，5，6，7，8，9，10，11，12，13，14，15: ○When using the video editing function ("refer_date": "base"), the output result is the same as the duration of the incoming video, and the current parameter is invalid. Calculate billing by rounding the input video duration to the nearest integer. Different model versions support different scopes. For details, please refer to the capability map mentioned above.

页眉

页脚