View in Telegram
Speech Technology
https://x.com/LiuXub/status/1863622470709690575
TAAE β the first Transformer-based Audio AutoEncoder scaled to 1B parameters for neural speech coding!
π₯
TAAE achieves state-of-the-art speech quality at ultra-low bitrates of 400 or 700 bits-per-second, delivering reconstruction quality remarkably close to real audio. It sets a new benchmark for efficient and high-quality speech tokenization.
π
Paper:
https://arxiv.org/abs/2411.19842v1
π
Demos:
https://stability-ai.github.io/stable-codec-demo/
π»
GitHub:
https://github.com/Stability-AI/stable-codec
Code and pre-trained models will be released to empower the community!
arXiv.org
Scaling Transformers for Low-Bitrate High-Quality Speech Coding
The tokenization of speech with neural audio codec models is a vital part of modern AI pipelines for the generation or understanding of speech, alone or in a multimodal context. Traditionally such...
Share
Love Center - Dating, Friends & Matches, NY, LA, Dubai, Global
Find friends or serious relationships easily
Start