Back to Nodes
Kling Text To Speech

Kling Text To Speech

Official

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Nodespell AI
AI / Audio / Kuaishou

Generate speech from text using advanced AI techniques. Offers multiple voice options and speed control. High-quality text-to-speech.

Model Overview

Kling TTS is a text-to-speech model that leverages advanced AI techniques to create high-quality synthetic speech.

Best At

Excelling at generating natural-sounding speech across a wide range of voices and languages. It performs well for narrations, voiceovers, and any application requiring synthetic voices.

Limitations / Not Good At

May struggle with very rare or complex prosody in highly emotional or dramatic readings. Also, the quality might vary with extremely long texts or very unusual phrasings.

Ideal Use Cases

  • Audiobook narrations
  • Voiceovers for videos and presentations
  • Accessibility features (e.g., screen readers with customizable voices)
  • Interactive voice response systems
  • Voice acting for games and animations

Input & Output Format

Input: Text prompt and optional parameters (voice_id for voice selection and voice_speed for speech rate adjustment) via an HTTP POST request.
Output: Generated audio in MP3 format with a download URL provided in the response.

Performance Notes

The model is efficient and designed for quick generation, making it suitable for real-time applications. However, generation time may increase slightly with longer text inputs.

Inputs (1)

Text

String

The text to be converted to speech

Multi InputMin: 0Max: 100
Parameters (3)

Text

String

The text to be converted to speech

Default:

Voice Id

String

The voice ID to use for speech synthesis

Default: genshin_vindi2

Voice Speed

Number

Rate of speech

Default: 1
Outputs (1)

Output

Inferred

Output

Nodespell

Nodespell

📍 London

Building the future. Join us!

Type

Node

Status

Official

Package

Nodespell AI

Category

AI / Audio / Kuaishou

Input

Text

Output

Audio

Keywords

Image GenerationLength ControlStructured Output
Use in Workflow