Overview
Destined Voice uses advanced neural TTS models to generate natural-sounding speech from any speaker in our library.Single Synthesis
Generate audio for a single text:Batch Synthesis
Generate multiple audio files in one request:Audio Format
Generated audio uses these specifications:| Property | Value |
|---|---|
| Format | WAV |
| Sample Rate | 24,000 Hz |
| Bit Depth | 16-bit |
| Channels | Mono |
Character Limits
| Tier | Characters/Request | Characters/Month |
|---|---|---|
| Starter | 500 | 1,000 |
| Pro | 2,000 | 100,000 |
| Enterprise | 5,000 | 1,000,000 |
Usage Tracking
Monitor your character usage:Best Practices
Batch similar requests
Batch similar requests
Group multiple synthesis requests into batch jobs for better performance.
Preprocess text
Preprocess text
Clean and normalize text before synthesis. Remove special characters and format numbers as words.
Cache audio
Cache audio
Store generated audio URLs. Re-synthesis of the same text with the same speaker produces identical audio.
Handle long text
Handle long text
For text longer than the character limit, split into sentences and combine audio files.