Video Models
kling-video-o1 | std(3s~10s) | pro(3s~10s) | |
|---|---|---|---|
text to video | single-shot-video generation | ✅(only 5s、10s) | ✅(only 5s、10s) |
voice control | ❌ | ❌ | |
others | - | - | |
image to video | single-shot-video generation (only start frame) | ✅(only 5s、10s) | ✅(only 5s、10s) |
start & end frame | ✅ | ✅ | |
element control (only multi-image elements) | ✅ | ✅ | |
cideo reference (including multi-image elements) | ✅ | ✅ | |
voice control | ❌ | ❌ | |
others | - | - | |
kling-v3-omni | std(3s~15s) | pro(3s~15s) | |
|---|---|---|---|
text to video | single-shot-video generation | ✅ | ✅ |
multi-shot-video generation | ✅ | ✅ | |
voice control | ❌ | ❌ | |
others | - | - | |
image to video | single-shot-video generation | ✅ | ✅ |
multi-shot-video generation | ✅ | ✅ | |
start & end frame | ✅ | ✅ | |
element control (video character elements & multi-image elements) | ✅ | ✅ | |
reference video | ✅(only 3s~10s) | ✅(only 3s~10s) | |
voice control | ❌ | ❌ | |
others | - | - | |
kling-v1 | std 5s | std 10s | pro 5s | pro10s | |
|---|---|---|---|---|---|
text to video | video generation | ✅ | ✅ | ✅ | ✅ |
camera control | ✅ | - | - | - | |
image to video | video generation | ✅ | ✅ | ✅ | ✅ |
start/end frame | ✅ | - | ✅ | - | |
motion brush | ✅ | - | ✅ | - | |
others | - | - | - | - | |
video extension (Not supported negative_prompt and cfg_scale) | ✅ | ✅ | ✅ | ✅ | |
video effects Dual-character: Hug, Kiss, heart_gesture | ✅ | ✅ | ✅ | ✅ | |
others | - | - | - | - | |
kling-v1-5 | std 5s | std 10s | pro 5s | pro10s | |
|---|---|---|---|---|---|
text to video | video generation | - | - | - | - |
others | - | - | - | - | |
image to video | video generation | ✅ | ✅ | ✅ | ✅ |
start/end frame | - | - | ✅ | ✅ | |
end frame | - | - | ✅ | ✅ | |
motion brush | - | - | ✅ | - | |
camera control (simple only) | - | - | ✅ | - | |
others | - | - | - | - | |
video extension | ✅ | ✅ | ✅ | ✅ | |
video effects Dual-character: Hug, Kiss, heart_gesture | ✅ | ✅ | ✅ | ✅ | |
others | - | - | - | - | |
kling-v1-6 | std 5s | std 10s | pro 5s | pro10s | |
|---|---|---|---|---|---|
text to video | video generation | ✅ | ✅ | ✅ | ✅ |
others | - | - | - | - | |
image to video | video generation | ✅ | ✅ | ✅ | ✅ |
start/end frame | - | - | ✅ | ✅ | |
end frame | - | - | ✅ | ✅ | |
others | - | - | - | - | |
multi-image2video | ✅ | ✅ | ✅ | ✅ | |
multi-elements | ✅ | ✅ | ✅ | ✅ | |
video extension | ✅ | ✅ | ✅ | ✅ | |
video effects Dual-character: Hug, Kiss, heart_gesture | ✅ | ✅ | ✅ | ✅ | |
kling-v2-master | 5s | 10s | |
|---|---|---|---|
text to video | video generation | ✅ | ✅ |
others | - | - | |
image to video | video generation | ✅ | ✅ |
others | - | - | |
others | - | - | |
kling-v2-1 | std 5s | std 10s | pro 5s | pro10s | |
|---|---|---|---|---|---|
text to video | all | - | - | - | - |
image to video | video generation | ✅ | ✅ | ✅ | ✅ |
start/end frame | - | - | ✅ | ✅ | |
others | - | - | - | - | |
others | - | - | - | - | |
kling-v2-1-master | 5s | 10s | |
|---|---|---|---|
text to video | video generation | ✅ | ✅ |
others | - | - | |
image to video | video generation | ✅ | ✅ |
others | - | - | |
others | - | - | |
kling-v2-5-turbo | std 5s | std 10s | pro 5s | pro10s | |
|---|---|---|---|---|---|
text to video | video generation | ✅ | ✅ | ✅ | ✅ |
others | - | - | - | - | |
image to video | video generation | ✅ | ✅ | ✅ | ✅ |
start/end frame | - | - | ✅ | ✅ | |
others | - | - | - | - | |
others | - | - | - | - | |
kling-v2-6 | std 5s | std 10s | std x other duration | pro 5s | pro10s | pro x other duration | |
|---|---|---|---|---|---|---|---|
text to video | video generation | ✅ (only no audio) | ✅ (only no audio) | - | ✅ | ✅ | - |
others | - | - | - | - | - | - | |
image to video | video generation | ✅ (only no audio) | ✅ (only no audio) | - | ✅ | ✅ | - |
start/end frame | - | - | - | ✅ (only no audio) | ✅ (only no audio) | - | |
voice control | - | - | - | ✅ | ✅ | - | |
motion control | - | - | ✅ | - | - | ✅ | |
others | - | - | - | - | - | - | |
kling-v3 | std(3~15s) | pro(3~15s) | |
|---|---|---|---|
text to video | single-shot-video generation | ✅ | ✅ |
multi-shot-video generation | ✅ | ✅ | |
voice control | ❌ | ❌ | |
others | - | - | |
image to video | single-shot-video generation (only start frame) | ✅ | ✅ |
multi-shot-video generation | ✅ | ✅ | |
start & end frame | ✅ | ✅ | |
element control (video character elements & multi-image elements) | ✅ | ✅ | |
motion control | ✅ | ✅ | |
voice control | ❌ | ❌ | |
others | - | - | |
no related of model | support or not | description |
|---|---|---|
avatar | ✅ | Generate digital human broadcast-style videos with just one photo |
lip sync | ✅ | Can be combined with text or audio to drive the mouth shape of characters in the video |
video to audio | ✅ | Supports adding audio to all videos generated by Kling models and user-uploaded videos |
text to audio | - | Supports generating audio by text prompts |
others | - | - |
Model | kling-v1 | kling-v1-5 | kling-v1-6 Image to Video | kling-v1-6 Text to Video | kling-v2 Master | ||||
|---|---|---|---|---|---|---|---|---|---|
Mode | STD | PRO | STD | PRO | STD | PRO | STD | PRO | - |
Resolution | 720p | 720p | 720p | 1080p | 720p | 1080p | 720p | 1080p | 720p |
Frame Rate | 30fps | 30fps | 30fps | 30fps | 30fps | 30fps | 24fps | 24fps | 24fps |
Model | kling-v2-1 Image to Video | kling-v2-1 Master | kling-v2-5 Image to Video | kling-v2-5 Text to Video | |
|---|---|---|---|---|---|
Mode | STD | PRO | - | PRO | PRO |
Resolution | 720p | 1080p | 1080p | 1080p | 1080p |
Frame Rate | 24fps | 24fps | 24fps | 24fps | 24fps |