What is Zyphra Zonos?

Zyphra is excited to announce the beta launch of Zonos-v0.1, featuring two advanced and real-time text-to-speech models that incorporate high-fidelity voice cloning technology. This release includes a 1.6B transformer model and a 1.6B hybrid model, both distributed under the Apache 2.0 license. Considering the difficulties in measuring audio quality quantitatively, we assert that the quality of output generated by Zonos matches or exceeds that of leading proprietary TTS systems currently on the market. Moreover, we believe that providing access to such high-quality models will significantly enhance progress in TTS research. The model weights for Zonos are readily available on Huggingface, along with sample inference code hosted in our GitHub repository. In addition, Zonos can be accessed through our model playground and API, which offers simple and competitive flat-rate pricing options for users. To showcase Zonos's performance, we have compiled a series of sample comparisons against existing proprietary models that illustrate its exceptional capabilities. This project underscores our dedication to promoting innovation within the text-to-speech technology sector, and we anticipate that it will inspire further advancements in the field.

Pricing

Price Starts At:
$0.02 per minute
Free Trial Offered?:
Yes

Integrations

Offers API?:
Yes, Zyphra Zonos provides an API

Screenshots and Video

Zyphra Zonos Screenshot 1

Company Facts

Company Name:
Zyphra
Company Location:
United States
Company Website:
www.zyphra.com/post/beta-release-of-zonos-v0-1

Product Details

Deployment
SaaS
Training Options
Documentation Hub
Support
Web-Based Support

Product Details

Target Company Sizes
Individual
1-10
11-50
51-200
201-500
501-1000
1001-5000
5001-10000
10001+
Target Organization Types
Mid Size Business
Small Business
Enterprise
Freelance
Nonprofit
Government
Startup
Supported Languages
English

Zyphra Zonos Categories and Features

Text to Speech Software

API
Adjust Speaking Rate / Pitch
Audio Optimization
Custom Lexicons
Different Voice Choices
Multi-Language Support
Synchronize Speech