The Kling 3.0 series models API is now fully available
Learn More
Get Started
Overview
Quick Start
Changelog
API Reference
General Info
Rate Limits
Callback Schema
Video Generation
Models
Video Omni
Text to Video
Image to Video
Reference to Video
Motion Control
Multi-elements to video
Extend Video
Lip Sync
Avatar
Text to Audio
Video to Audio
Text to Speech
Voice Clone
Image Recognize
Element
Effects
Effect Templates
NEW
Video Effects
Image Generation
Models
Image Omni
Image Generation
Reference to Image
Extend Image
AI Multi-Shot
Virtual Try-On
Others
Query user info
Pricing
Billing Info
Prepaid Resource Packs
Protocols
Privacy Policy of API Service
Terms of API Service
API Service Level Agreement
Lip-Sync
Identify Face
POST
/v1/videos/identify-face
cURL
Copy
Collapse
curl --request POST \
  --url https://api-singapore.klingai.com/v1/videos/identify-face \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
    "video_url": "https://p1-kling.klingai.com/kcdn/cdn-kcdn112452/kling-qa-test/kling20260206mp4.mp4",
    "video_id": ""
  }'
200
Copy
Collapse
{
  "code": 0, // Error codes; Specific definitions can be found in "Error Code"
  "message": "string", // Error information
  "request_id": "string", // Request ID, generated by the system, used to track requests and troubleshoot problems
  "data": {
    "session_id": "id", // Session ID
    "final_unit_deduction": "string", // The deduction units of task
    "face_data": [ //Face data list
      {
        "face_id": "string", // Face ID
        "face_image": "url", // Face image URL
        "start_time": 0, // Face appearance start time, unit: ms
        "end_time": 5200 //Face appearance end time, unit: ms
      }
    ]
  }
}

Identify faces in the video for lip-sync processing.

Request Header
Content-Type
string
Required
Default to application/json

Data Exchange Format

Authorization
string
Required

Authentication information, refer to API authentication

Request Body
video_id
string
Optional

The ID of the video generated by Kling AI

Used to specify the video and determine whether it can be used for lip-sync services.
This parameter and 'video_url' are mutually exclusive—only one can be filled, and neither can be left empty.
Only supports videos generated within the last 30 days with a duration of no more than 60 seconds.
video_url
string
Optional

The URL of the video

Used to specify the video and determine whether it can be used for lip-sync services.
This parameter and 'video_id' are mutually exclusive—only one can be filled, and neither can be left empty.
Supported video formats: .mp4/.mov, file size ≤100MB, duration 2s–60s, resolution 720p or 1080p, with both width and height between 512px–2160px. If validation fails, an error code will be returned.
The system checks video content—if issues are detected, an error code will be returned.
Create Task
POST
/v1/videos/advanced-lip-sync
cURL
Copy
Collapse
curl --request POST \
  --url https://api-singapore.klingai.com/v1/videos/advanced-lip-sync \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
    "session_id": "850508686686064678",
    "face_choose": [
      {
        "face_id": "0",
        "sound_file": "https://p1-kling.klingai.com/kcdn/cdn-kcdn112452/kling-qa-test/go-to-world.mp3",
        "sound_insert_time": 1000,
        "sound_start_time": 0,
        "sound_end_time": 3000,
        "sound_volume": 2,
        "original_audio_volume": 2
      }
    ],
    "external_task_id": "",
    "callback_url": ""
  }'
200
Copy
Collapse
{
  "code": 0, // Error codes; Specific definitions can be found in "Error Code"
  "message": "string", // Error information
  "request_id": "string", // Request ID, generated by the system, used to track requests and troubleshoot problems
  "data": {
    "task_id": "string", // Task ID, generated by the system
    "task_info": { //Task creation parameters
      "external_task_id": "string" //User-defined task ID
    },
    "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed
    "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms
    "updated_at": 1722769557708 //Task update time, Unix timestamp, unit: ms
  }
}
Request Header
Content-Type
string
Required
Default to application/json

Data Exchange Format

Authorization
string
Required

Authentication information, refer to API authentication

Request Body
session_id
string
Required

Session ID generated during the identify face API. It remains unchanged during the selection/editing process.

face_choose
array
Required

Specified Face for Lip-Sync

Includes Face ID, lip movement reference data, etc.
Currently only supports one person lip-sync.
▾
Hide child attributes
face_id
string
Required

Face ID

Returned by the facial recognition interface.
audio_id
string
Optional

Sound ID Generated via TTS API

Only supports audio generated within the last 30 days with a duration of no less than 2 seconds and no more than 60 seconds.
Either audio_id or sound_file must be provided (mutually exclusive; cannot be empty or both populated).
sound_file
string
Optional

Sound File

Supports Base64-encoded audio or accessible audio URL.
Accepted formats: .mp3/.wav/.m4a/.aac (max 5MB). Format mismatches or oversized files will return error codes.
Only supports audio with a duration of no less than 2 seconds and no more than 60 seconds.
Either audio_id or sound_file must be provided (mutually exclusive; cannot be empty or both populated).
The system will verify the audio content and return error codes if there are any problems.
sound_start_time
long
Required

Time point to start cropping sound

Based on the original sound start time, the start time is 0'0", units: ms
The sound before the starting point will be cropped, and the cropped sound must not be shorter than 2 seconds.
sound_end_time
long
Required

Time point to stop cropping sound

Based on the original sound start time, the start time is 0'0", units: ms
The sound after the end point will be cropped, and the cropped sound must not be shorter than 2 seconds.
The end point time shouldn't be later than the total duration of the original sound.
sound_insert_time
long
Required

The time for inserting cropped sound

Based on the original video start time, the start time is 0'0", units: ms
The time range for inserting sound should overlap with the face's lip-sync time interval for at least 2 seconds.
The start time for inserting sound must not be earlier than the start time of the video, and the end time for inserting sound must not be later than the end time of the video.
sound_volume
float
Optional
Default to 1

Volume Controls (higher values = louder)

Value range: [0, 2]
original_audio_volume
float
Optional
Default to 1

Original video volume (higher values = louder)

Value range: [0, 2]
No effect if source video is silent.
watermark_info
object
Optional

Whether to generate watermarked results simultaneously

Defined by the enabled parameter, format:
  "watermark_info": { "enabled": boolean } 
true: generate watermarked result, false: do not generate
Custom watermarks are not currently supported
external_task_id
string
Optional

Custom Task ID

User-defined task ID. It will not override the system-generated task ID, but supports querying tasks by this ID.
Please note that uniqueness must be ensured for each user.
callback_url
string
Optional

The callback notification address for the result of this task. If configured, the server will actively notify when the task status changes.

For specific message schema, see Callback Protocol
Query Task (Single)
GET
/v1/videos/advanced-lip-sync/{id}
cURL
Copy
Collapse
curl --request GET \
  --url https://api-singapore.klingai.com/v1/videos/advanced-lip-sync/{task_id} \
  --header 'Authorization: Bearer <token>'
200
Copy
Collapse
{
  "code": 0, // Error codes; Specific definitions can be found in "Error Code"
  "message": "string", // Error information
  "request_id": "string", // Request ID, generated by the system, used to track requests and troubleshoot problems
  "data": {
    "task_id": "string", // Task ID, generated by the system
    "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed
    "task_status_msg": "string", // Task status message, displaying the failure reason when the task fails (such as triggering the content risk control of the platform, etc.)
    "task_info": { //Task creation parameters
      "parent_video": { //Original video information
        "id": "string", // Original video ID
        "url": "string", // Original video URL
        "duration": "string" //Original video duration, unit: s
      }
    },
    "task_result": { //Task result
      "videos": [ //Generated video list
        {
          "id": "string", // Generated video ID; globally unique
          "url": "string", // URL for generating videos (Please note that for security purposes, generated images/videos will be deleted after 30 days. Please save them promptly.)
          "watermark_url": "string", // Watermarked video download URL, anti-hotlinking format
          "duration": "string" //Total video duration, unit: s
        }
      ]
    },
    "watermark_info": {
      "enabled": boolean //Whether watermark is enabled
    },
    "final_unit_deduction": "string", // The deduction units of task
    "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms
    "updated_at": 1722769557708 //Task update time, Unix timestamp, unit: ms
  }
}
Request Header
Content-Type
string
Required
Default to application/json

Data Exchange Format

Authorization
string
Required

Authentication information, refer to API authentication

Path Parameters
task_id
string
Optional

Task ID for Video Generation - Lip-Sync. Fill the value directly in the request path.

Query Task (List)
GET
/v1/videos/advanced-lip-sync
cURL
Copy
Collapse
curl --request GET \
  --url 'https://api-singapore.klingai.com/v1/videos/advanced-lip-sync?pageNum=1&pageSize=30' \
  --header 'Authorization: Bearer <token>'
200
Copy
Collapse
{
  "code": 0, // Error codes; Specific definitions can be found in "Error Code"
  "message": "string", // Error information
  "request_id": "string", // Request ID, generated by the system, used to track requests and troubleshoot problems
  "data": [
    {
      "task_id": "string", // Task ID, generated by the system
      "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed
      "task_status_msg": "string", // Task status message, displaying the failure reason when the task fails (such as triggering the content risk control of the platform, etc.)
      "task_info": { //Task creation parameters
        "parent_video": { //Original video information
          "id": "string", // Original video ID
          "url": "string", // Original video URL
          "duration": "string" //Original video duration, unit: s
        }
      },
      "task_result": { //Task result
        "videos": [ //Generated video list
          {
            "id": "string", // Generated video ID; globally unique
            "url": "string", // URL for generating videos (Please note that for security purposes, generated images/videos will be deleted after 30 days. Please save them promptly.)
            "watermark_url": "string", // Watermarked video download URL, anti-hotlinking format
            "duration": "string" //Total video duration, unit: s
          }
        ]
      },
      "watermark_info": {
        "enabled": boolean //Whether watermark is enabled
      },
      "final_unit_deduction": "string", // The deduction units of task
      "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms
      "updated_at": 1722769557708 //Task update time, Unix timestamp, unit: ms
    }
  ]
}
Request Header
Content-Type
string
Required
Default to application/json

Data Exchange Format

Authorization
string
Required

Authentication information, refer to API authentication

Query Parameters
pageNum
int
Optional
Default to 1

Page number

Value range: [1, 1000]
pageSize
int
Optional
Default to 30

Number of items per page

Value range: [1, 500]
Previous chapter：Extend Video
Next chapter：Avatar
Identify Face
Create Task
Query Task (Single)
Query Task (List)
The Kling 3.0 Series Models API is Now Fully Available
– All in One, One for All！

Models Available in This Release

Kling 3.0 Motion Control, Kling Video 3.0, Kling Video 3.0 Omni, Kling Image 3.0, Kling Image 3.0 Omni

Refer to <Kling AI Series 3.0 Model API Specification>

Key Highlights of the Models

3.0 All-in-One: A unified model for multi-modal input and output.

Most powerful consistency across the universe: Subject consistency (supports cameo, subject with voice control, i2v + subject) and text consistency.
Narrative control at your fingertips: More freedom, precision, and control—up to 15 seconds long, video scene cuts, ultra-high-definition storyboards/images, custom seconds.
Upgraded native audio-visual output: Supports multiple speakers and languages (with accents).

Kling 3.0 Motion Control

Consistent Facial Identity from any angle
Complex Emotions faithfully reproduced
High fidelity Restoration, Even with Face Occlusions
Consistent Facial Clarity Across Dynamic Framing

User Guide ->

Kling Video 3.0

Compared to 2.6, expected improvements:

Supports subject upload in I2V scenarios for enhanced consistency
Significant improvement in multi-character referencing, especially for three-person scenarios
Supports Japanese, Korean, and Spanish in addition to Chinese and English
Capable of generating certain dialects and accents
Better distinction and control over different types of audio (speech, sound effects, BGM)
Improved text retention in I2V scenarios
Supports scene transitions, with up to 6 shots and customizable storyboarding

User Guide ->

Kling Video 3.0 Omni

Compared to O1, expected improvements:

Native audio-visual synchronization
Supports video subject creation
Further improved consistency in reference-based tasks, especially for characters and products
Combined capabilities of reference + storyboarding + audio-visual sync significantly enhance usability
Supports scene transitions, with up to 6 shots
Extended generation duration up to 15 seconds

User Guide ->

Kling Image 3.0

Highly consistent feature retention
Precise response to detail modifications
Accurate control over style and tone
Rich imaginative capabilities

User Guide ->

Kling Image 3.0 Omni

Enhanced narrative sense
New storyboard image set generation, retaining reference image features with scene relevance
Direct output of 2K/4K ultra-high-definition images
Further improved detail consistency

User Guide ->

Thank you for your support and understanding!

I Got It