Kandinsky
Kandinsky 5.0 is a fast and high-quality text-to-video generation model. It converts text prompts into videos with customizable resolution and length.
Kandinsky 5.0 is a fast and high-quality text-to-video generation model. It converts text prompts into videos with customizable resolution and length.
Model Overview
A text-to-video diffusion model that generates high-quality videos from text descriptions, supporting multiple aspect ratios and durations.
Best At
The sweet spot is creating short, stylized videos (5-10 seconds) from vivid text prompts with diverse artistic styles.
Limitations / Not Good At
May occasionally exhibit motion inconsistencies or struggle with complex narratives beyond simple moments. Longer durations or intricate instructions might reduce quality.
Ideal Use Cases
- Conceptual demos or storyboards
- Social media shorts (likes TikTok/Reels)
- Artistic experiments & abstract sequences
- Quick explainer animations
Input & Output Format
- Input: Text prompt + optional parameters (resolution, video length, inference steps)
- Output: Generated video file (MP4)
Performance Notes
Fast for typical 5-10s outputs. Higher inference steps add quality but increase generation time. Best suited for quick, single-shot video creation.
Prompt
StringThe text prompt to guide video generation.
Prompt
StringThe text prompt to guide video generation.
Resolution
StringResolution of the generated video in W:H format. One of (768x512, 512x768, 512x512).
768x512Num Inference Steps
NumberThe number of inference steps.
30video_length
String5sOutput
InferredOutput
Type
Node
Status
Official
Package
Nodespell AI
Category
AI / Video / Open SourceInput
Output