Kling O1
Instruction-driven Kling model for transforming existing video content or generating new clips with optional multimodal references.
Instruction-driven Kling model for transforming existing video content or generating new clips with optional multimodal references.
Model Overview
Kling O1 focuses on natural-language video transformation and controlled generation. It can operate with prompt-only generation, image guidance, or reference-video workflows.
Best At
- Rewriting scene style and content from natural-language instructions.
- Video variation workflows with optional reference images and video.
- Maintaining motion continuity while changing visual context.
Limitations / Not Good At
- Strong edits can drift from original composition intent.
- Reference-heavy runs require careful prompt and asset alignment.
- Results can vary when prompts are underspecified.
Ideal Use Cases
- Iterative video editing with text-driven direction.
- Creative adaptation of existing clips into new visual styles.
- Prompt-led generation workflows with optional reference assets.
Input & Output Format
- Input: required
prompt; optionalstart_image,end_image,reference_images,reference_video,mode,video_reference_type,aspect_ratio,duration, andkeep_original_sound. - Output: generated video URI returned on
response.
Performance Notes
modechanges quality/cost behavior (stdvspro).- Video-reference usage can improve continuity for edit-style tasks.
Prompt
StringText instructions describing desired visual transformation or generation.
Start Image
StringOptional first frame reference image.
End Image
StringOptional final frame reference image. Requires start image.
Reference Images
StringOptional one-to-many image references for scene, style, or subject guidance.
Reference Video
StringOptional reference video for style/camera guidance or base-video editing.
Prompt
StringText instructions describing desired visual transformation or generation.
Mode
StringGeneration quality mode.
proVideo Reference Type
StringControls whether reference video is used as guidance or as base footage for editing.
featureAspect Ratio
StringOutput aspect ratio when reference framing does not override it.
16:9Duration
NumberTarget output duration in seconds.
5Keep Original Sound
BooleanKeep original audio from reference video when available.
trueOutput
InferredOutput
Nodespell Team
Type
Node
Status
Official
Package
Nodespell AI
Category
AI / Video / KwaivgiInput
Output