We recently launched realtime ultra-realistic voices using a redesigned version of Sesame’s CSM-1B model. To learn more about using Sesame voices, see the Sesame page.
- Price Tiers: Voices are grouped into two price tiers: Standard and Premium. Premium voices come with an additional per-minute charge.
- Spelling: Voices with the Spelling tag are optimized to spell words and numbers naturally.
Can’t find a voice that fits your needs? Contact us or let us know on Discord and we can explore adding a voice that works for you.
Our Enterprise tier comes with custom-trained voices.