Kling Text To Speech

Official

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Nodespell AI

AI / Audio / Kuaishou

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Model Overview

Kling TTS is a text-to-speech model that leverages advanced AI techniques to create high-quality synthetic speech.

Best At

Excelling at generating natural-sounding speech across a wide range of voices and languages. It performs well for narrations, voiceovers, and any application requiring synthetic voices.

Limitations / Not Good At

May struggle with very rare or complex prosody in highly emotional or dramatic readings. Also, the quality might vary with extremely long texts or very unusual phrasings.

Ideal Use Cases

Audiobook narrations
Voiceovers for videos and presentations
Accessibility features (e.g., screen readers with customizable voices)
Interactive voice response systems
Voice acting for games and animations

Input & Output Format

Input: Text prompt and optional parameters (voice_id for voice selection and voice_speed for speech rate adjustment) via an HTTP POST request.
Output: Generated audio in MP3 format with a download URL provided in the response.

Performance Notes

The model is efficient and designed for quick generation, making it suitable for real-time applications. However, generation time may increase slightly with longer text inputs.

Model Examples (3)

Example Index01 / 03

Example 01

Urgent voicemail

In-world dramatic message for a thriller or mystery scene.

Open

Source Inputs01

Text

If you get this before sunrise, don't come through the front entrance. The keypad is dead, the cameras are looping, and somebody is already inside.

Parameters03

Text

If you get this before sunrise, don't come through the front entrance. The keypad is dead, the cameras are looping, and somebody is already inside.

Voice Id

oversea_male1

Voice Speed

0.98

ttscharacter

Response

Inputs (1)

Text

String

The text to be converted to speech

Multi InputMin: 0Max: 100

Parameters (3)

Text

String

The text to be converted to speech

Default:

Voice Id

String

The voice ID to use for speech synthesis

Default: genshin_vindi2

Voice Speed

Number

Rate of speech

Default: 1

Outputs (1)

Output

Inferred

Output

Nodespell

London

Building the future. Join us!

nodespell.com nodespell.app NodespellAI

Creator profile

Type

Node

Status

Official

Package

Nodespell AI

Keywords

Image GenerationLength ControlStructured Output

Use in Workflow