Files that you have generated using Text to Speech or Voice Changer can be downloaded either as MP3 or WAV files. WAV files ...
You can download the generated files in two ways: You can download a generated file immediately by clicking the download butt...
In Speech Synthesis, using the website, you can generate up to 5,000 characters in a single generation on any paid plan and ...
The way the AI is currently trained doesn't allow for selecting the specific language you want the AI to speak via a tag or m...
Any voice can speak any language currently supported by the AI; however, if you do not use a voice that is native to the lang...
If you want to force a certain pronunciation, you can use SSML phoneme tags with our English v1 and Turbo v2 models. You can ...
Mispronunciations can happen for a few different reasons. The most common one is that the word is just misspelled. The AI wil...
There are a few ways to introduce a pause or break and influence the rhythm and cadence of the speaker. The most consistent w...
In most cases, however, we strongly recommend writing out numbers, symbols, and acronyms fully to ensure that the AI has the ...
Unfortunately, at this time, we do not offer download-based deduction as an alternative to generation-based deduction. There ...
Our latest model, Turbo v2.5, is a highly optimized model, specifically tailored for low-latency applications without sacrif...
All pre-made voices and generated voices are English. This means that they might not have the correct accent or pronunciation...
Any voice can speak any of the supported languages. The way the current model works is that you don't select a specific langu...
This is currently not possible.
The model is sensitive to the wider situation surrounding each utterance - it assesses whether something makes sense by how i...
We plan on introducing features allowing emotions such as laughter in the future.
We are working on features that will allow for speed optimization.