{
  "Parameters": "Please note, if you use the Base64 method, make sure all image data parameters you pass are in Base64 encoding format. When submitting data, do not add any prefixes to the Base64-encoded string, such as data:image/png;base64,. The correct parameter format should be the Base64-encoded string itself. Please provide only the Base64-encoded string portion so that the system can correctly process and parse your data.\nSupported image formats: .jpg / .jpeg / .png. The image file size cannot exceed 10MB, and the width and height dimensions of the image shall not be less than 300px, and the aspect ratio of the image should be between 1:2.5 ~ 2.5:1.\nPrevious chapter：Voice Clone\nNext chapter：Element\nThe Kling 3.0 Series Models API is Now Fully Available\n– All in One, One for All！\nModels Available in This Release\nKling 3.0 Motion Control, Kling Video 3.0, Kling Video 3.0 Omni, Kling Image 3.0, Kling Image 3.0 Omni\nRefer to <Kling AI Series 3.0 Model API Specification>\nKey Highlights of the Models\n3.0 All-in-One: A unified model for multi-modal input and output.\nMost powerful consistency across the universe: Subject consistency (supports cameo, subject with voice control, i2v + subject) and text consistency.\nNarrative control at your fingertips: More freedom, precision, and control—up to 15 seconds long, video scene cuts, ultra-high-definition storyboards/images, custom seconds.\nUpgraded native audio-visual output: Supports multiple speakers and languages (with accents).\nKling 3.0 Motion Control\nConsistent Facial Identity from any angle\nComplex Emotions faithfully reproduced\nHigh fidelity Restoration, Even with Face Occlusions\nConsistent Facial Clarity Across Dynamic Framing\nUser Guide ->\nKling Video 3.0\nCompared to 2.6, expected improvements:\nSupports subject upload in I2V scenarios for enhanced consistency\nSignificant improvement in multi-character referencing, especially for three-person scenarios\nSupports Japanese, Korean, and Spanish in addition to Chinese and English\nCapable of generating certain dialects and accents\nBetter distinction and control over different types of audio (speech, sound effects, BGM)\nImproved text retention in I2V scenarios\nSupports scene transitions, with up to 6 shots and customizable storyboarding\nUser Guide ->\nKling Video 3.0 Omni\nCompared to O1, expected improvements:\nNative audio-visual synchronization\nSupports video subject creation\nFurther improved consistency in reference-based tasks, especially for characters and products\nCombined capabilities of reference + storyboarding + audio-visual sync significantly enhance usability\nSupports scene transitions, with up to 6 shots\nExtended generation duration up to 15 seconds\nUser Guide ->\nKling Image 3.0\nHighly consistent feature retention\nPrecise response to detail modifications\nAccurate control over style and tone\nRich imaginative capabilities\nUser Guide ->\nKling Image 3.0 Omni\nEnhanced narrative sense\nNew storyboard image set generation, retaining reference image features with scene relevance\nDirect output of 2K/4K ultra-high-definition images\nFurther improved detail consistency\nUser Guide ->\nThank you for your support and understanding!\nI Got It"
}