Back to Nodes
Kling Text To Speech

Kling Text To Speech

Official

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Nodespell AI
AI / Audio / Kuaishou

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Model Overview

Kling TTS is a text-to-speech model that leverages advanced AI techniques to create high-quality synthetic speech.

Best At

Excelling at generating natural-sounding speech across a wide range of voices and languages. It performs well for narrations, voiceovers, and any application requiring synthetic voices.

Limitations / Not Good At

May struggle with very rare or complex prosody in highly emotional or dramatic readings. Also, the quality might vary with extremely long texts or very unusual phrasings.

Ideal Use Cases

  • Audiobook narrations
  • Voiceovers for videos and presentations
  • Accessibility features (e.g., screen readers with customizable voices)
  • Interactive voice response systems
  • Voice acting for games and animations

Input & Output Format

Input: Text prompt and optional parameters (voice_id for voice selection and voice_speed for speech rate adjustment) via an HTTP POST request.
Output: Generated audio in MP3 format with a download URL provided in the response.

Performance Notes

The model is efficient and designed for quick generation, making it suitable for real-time applications. However, generation time may increase slightly with longer text inputs.

Model Examples (3)

Example Index01 / 03
Example 01

Urgent voicemail

In-world dramatic message for a thriller or mystery scene.

Source Inputs01
Text

If you get this before sunrise, don't come through the front entrance. The keypad is dead, the cameras are looping, and somebody is already inside.

Parameters03
Text
If you get this before sunrise, don't come through the front entrance. The keypad is dead, the cameras are looping, and somebody is already inside.
Voice Id
oversea_male1
Voice Speed
0.98
ttscharacter
Response
Inputs (1)

Text

String

The text to be converted to speech

Multi InputMin: 0Max: 100
Parameters (3)

Text

String

The text to be converted to speech

Default:

Voice Id

String

The voice ID to use for speech synthesis

Default: genshin_vindi2

Voice Speed

Number

Rate of speech

Default: 1
Outputs (1)

Output

Inferred

Output

Nodespell

Nodespell

London

Building the future. Join us!

Creator profile

Type

Node

Status

Official

Package

Nodespell AI

Category

AI / Audio / Kuaishou

Input

Text

Output

Audio

Keywords

Image GenerationLength ControlStructured Output
Use in Workflow