Kling Text To Speech
Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.
Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.
Model Overview
Kling TTS is a text-to-speech model that leverages advanced AI techniques to create high-quality synthetic speech.
Best At
Excelling at generating natural-sounding speech across a wide range of voices and languages. It performs well for narrations, voiceovers, and any application requiring synthetic voices.
Limitations / Not Good At
May struggle with very rare or complex prosody in highly emotional or dramatic readings. Also, the quality might vary with extremely long texts or very unusual phrasings.
Ideal Use Cases
- Audiobook narrations
- Voiceovers for videos and presentations
- Accessibility features (e.g., screen readers with customizable voices)
- Interactive voice response systems
- Voice acting for games and animations
Input & Output Format
Input: Text prompt and optional parameters (voice_id for voice selection and voice_speed for speech rate adjustment) via an HTTP POST request.
Output: Generated audio in MP3 format with a download URL provided in the response.
Performance Notes
The model is efficient and designed for quick generation, making it suitable for real-time applications. However, generation time may increase slightly with longer text inputs.
Text
StringThe text to be converted to speech
Text
StringThe text to be converted to speech
Voice Id
StringThe voice ID to use for speech synthesis
genshin_vindi2Voice Speed
NumberRate of speech
1Output
InferredOutput
Type
Node
Status
Official
Package
Nodespell AI
Category
AI / Audio / KuaishouInput
Output