The Kling 3.0 series models API is now fully available Learn More Get Started Overview Quick Start Changelog API Reference General Info Rate Limits Callback Schema Video Generation Models Video Omni Text to Video Image to Video Reference to Video Motion Control Multi-elements to video Extend Video Lip Sync Avatar Text to Audio Video to Audio Text to Speech Voice Clone Image Recognize Element Effects Effect Templates NEW Video Effects Image Generation Models Image Omni Image Generation Reference to Image Extend Image AI Multi-Shot Virtual Try-On Others Query user info Pricing Billing Info Prepaid Resource Packs Protocols Privacy Policy of API Service Terms of API Service API Service Level Agreement Video to Audio Create Task POST /v1/audio/video-to-audio cURL Copy Collapse curl --request POST \ --url https://api-singapore.klingai.com/v1/audio/video-to-audio \ --header 'Authorization: Bearer ' \ --header 'Content-Type: application/json' \ --data '{ "video_url": "https://p1-kling.klingai.com/kcdn/cdn-kcdn112452/kling-qa-test/20fps-7s.mov", "external_task_id": "", "callback_url": "" }' 200 Copy Collapse { "code": 0, // Error codes; Specific definitions can be found in "Error Code" "message": "string", // Error information "request_id": "string", // Request ID, generated by the system, is used to track requests and troubleshoot problems "data": { "task_id": "string", // Task ID, generated by the system "task_info": { // Task creation parameters "external_task_id": "string" // Customer-defined task ID }, "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms "updated_at": 1722769557708 // Task update time, Unix timestamp, unit: ms } } Request Header Content-Type string Required Default to application/json Data Exchange Format Authorization string Required Authentication information, refer to API authentication Request Body video_id string Optional The ID of the video generated by the Kling AI Either the video_id parameter or the video_url parameter, cannot be empty or have a value at the same time. Only supports videos generated within 30 days and with a duration between 3.0s and 20.0s. video_url string Optional Link for uploaded video Either the video_id parameter or the video_url parameter, cannot be empty or have a value at the same time. Only .mp4/.mov formats are supported. File size does not exceed 100MB. Video duration between 3.0s and 20.0s. sound_effect_prompt string Optional Sound effect prompt Cannot exceed 200 characters bgm_prompt string Optional BGM prompt Cannot exceed 200 characters asmr_mode boolean Optional Default to false Enable ASMR mode; This mode enhances detailed sound effects and is suitable for highly immersive content scenarios true means enabled, false means disabled (default) external_task_id string Optional Customized Task ID Users can provide a customized task ID, which will not overwrite the system-generated task ID but can be used for task queries. Please note that the customized task ID must be unique within a single user account. callback_url string Optional The callback notification address for the result of this task. If configured, the server will actively notify when the task status changes The specific message schema of the notification can be found in Callback Protocol Query Task (Single) GET /v1/audio/video-to-audio/{id} cURL Copy Collapse curl --request GET \ --url https://api-singapore.klingai.com/v1/audio/video-to-audio/{task_id} \ --header 'Authorization: Bearer ' 200 Copy Collapse { "code": 0, // Error codes; Specific definitions can be found in "Error Code" "message": "string", // Error information "request_id": "string", // Request ID, generated by the system, is used to track requests and troubleshoot problems "data": { "task_id": "string", // Task ID, generated by the system "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed "task_status_msg": "string", // Task status information, displaying the failure reason when the task fails (such as triggering the content risk control of the platform, etc.) "task_info": { // Task creation parameters "external_task_id": "string", // Customer-defined task ID "parent_video": { // Original video information "id": "string", // Original video ID "url": "string", // Original video URL "duration": "string" // Original video duration, unit: s (seconds) } }, "task_result": { "videos": [ { "id": "string", // Generated video ID; globally unique "url": "string", // URL for generating videos (To ensure information security, generated images/videos will be cleared after 30 days. Please make sure to save them promptly.) "duration": "string" // Total video duration, unit: s (seconds) } ], "audios": [ { "id": "string", // Generated audio ID; globally unique, will be cleared after 30 days "url_mp3": "string", // URL for generating audio in MP3 format (To ensure information security, generated audio will be cleared after 30 days. Please make sure to save them promptly.) "url_wav": "string", // URL for generating audio in WAV format (To ensure information security, generated audio will be cleared after 30 days. Please make sure to save them promptly.) "duration_mp3": "string", // Total MP3 audio duration, unit: s (seconds) "duration_wav": "string" // Total WAV audio duration, unit: s (seconds) } ] }, "final_unit_deduction": "string", // The deduction units of task "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms "updated_at": 1722769557708 // Task update time, Unix timestamp, unit: ms } } Request Header Content-Type string Required Default to application/json Data Exchange Format Authorization string Required Authentication information, refer to API authentication Path Parameters task_id string Optional The task ID for audio generation Request path parameter, fill the value directly in the request path You can choose to query by external_task_id or task_id external_task_id string Optional Customized Task ID for audio generation The external_task_id filled in when creating the task. You can choose to query by external_task_id or task_id Query Task (List) GET /v1/audio/video-to-audio cURL Copy Collapse curl --request GET \ --url 'https://api-singapore.klingai.com/v1/audio/video-to-audio?pageNum=1&pageSize=30' \ --header 'Authorization: Bearer ' 200 Copy Collapse { "code": 0, // Error codes; Specific definitions can be found in Error codes "message": "string", // Error information "request_id": "string", // Request ID, generated by the system, to track requests and troubleshoot problems "data": [ { "task_id": "string", // Task ID, generated by the system "task_status": "string", // Task status, Enum values: submitted, processing, succeed, failed "task_status_msg": "string", // Task status information, displaying the failure reason when the task fails (such as triggering the content risk control of the platform, etc.) "task_info": { // Task creation parameters "external_task_id": "string", // Customer-defined task ID "parent_video": { // Original video information "id": "string", // Original video ID "url": "string", // Original video URL "duration": "string" // Original video duration, unit: s (seconds) } }, "task_result": { "videos": [ { "id": "string", // Generated video ID; globally unique "url": "string", // URL for generating videos (To ensure information security, generated images/videos will be cleared after 30 days. Please make sure to save them promptly.) "duration": "string" // Total video duration, unit: s (seconds) } ], "audios": [ { "id": "string", // Generated audio ID; globally unique, will be cleared after 30 days "url_mp3": "string", // URL for generating audio in MP3 format (To ensure information security, generated audio will be cleared after 30 days. Please make sure to save them promptly.) "url_wav": "string", // URL for generating audio in WAV format (To ensure information security, generated audio will be cleared after 30 days. Please make sure to save them promptly.) "duration_mp3": "string", // Total MP3 audio duration, unit: s (seconds) "duration_wav": "string" // Total WAV audio duration, unit: s (seconds) } ] }, "final_unit_deduction": "string", // The deduction units of task "created_at": 1722769557708, // Task creation time, Unix timestamp, unit: ms "updated_at": 1722769557708 // Task update time, Unix timestamp, unit: ms } ] } Request Header Content-Type string Required Default to application/json Data Exchange Format Authorization string Required Authentication information, refer to API authentication Query Parameters pageNum int Optional Default to 1 Page number Value range: [1, 1000] pageSize int Optional Default to 30 Number of items per page Value range: [1, 500] Previous chapter:Text to Audio Next chapter:Text to Speech Create Task Query Task (Single) Query Task (List) The Kling 3.0 Series Models API is Now Fully Available – All in One, One for All! Models Available in This Release Kling 3.0 Motion Control, Kling Video 3.0, Kling Video 3.0 Omni, Kling Image 3.0, Kling Image 3.0 Omni Refer to Key Highlights of the Models 3.0 All-in-One: A unified model for multi-modal input and output. Most powerful consistency across the universe: Subject consistency (supports cameo, subject with voice control, i2v + subject) and text consistency. Narrative control at your fingertips: More freedom, precision, and control—up to 15 seconds long, video scene cuts, ultra-high-definition storyboards/images, custom seconds. Upgraded native audio-visual output: Supports multiple speakers and languages (with accents). Kling 3.0 Motion Control Consistent Facial Identity from any angle Complex Emotions faithfully reproduced High fidelity Restoration, Even with Face Occlusions Consistent Facial Clarity Across Dynamic Framing User Guide -> Kling Video 3.0 Compared to 2.6, expected improvements: Supports subject upload in I2V scenarios for enhanced consistency Significant improvement in multi-character referencing, especially for three-person scenarios Supports Japanese, Korean, and Spanish in addition to Chinese and English Capable of generating certain dialects and accents Better distinction and control over different types of audio (speech, sound effects, BGM) Improved text retention in I2V scenarios Supports scene transitions, with up to 6 shots and customizable storyboarding User Guide -> Kling Video 3.0 Omni Compared to O1, expected improvements: Native audio-visual synchronization Supports video subject creation Further improved consistency in reference-based tasks, especially for characters and products Combined capabilities of reference + storyboarding + audio-visual sync significantly enhance usability Supports scene transitions, with up to 6 shots Extended generation duration up to 15 seconds User Guide -> Kling Image 3.0 Highly consistent feature retention Precise response to detail modifications Accurate control over style and tone Rich imaginative capabilities User Guide -> Kling Image 3.0 Omni Enhanced narrative sense New storyboard image set generation, retaining reference image features with scene relevance Direct output of 2K/4K ultra-high-definition images Further improved detail consistency User Guide -> Thank you for your support and understanding! I Got It