Kokoro 82M is an 82-million-parameter text-to-speech model that beats many TTS APIs while running locally on CPUs, including Apple Silicon ...
Tether’s new toolkit lets developers build AI applications that run entirely on-device, marking an expanded push into ...
Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.
Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.
I’m not a tech expert or a content creator. I’m just a regular person who sometimes needs to turn MP4 videos into text.
All core features are implemented and functional.
French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets ...
This repository provides executables (CPU and GPU version) that can be run without having python or any other packages installed. They behave as the original PaddleOCR install for example via pip. The ...
Abstract: Speech synthesis, the technology that converts text into spoken words, has advanced significantly for high-resource languages like English, Spanish, and Mandarin. However, many languages ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results