Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

The Studio Show XDR Is Apple’s Boldest Show Improve Ever

March 5, 2026

Qwen {Hardware} Head: "One-Sentence Activity Completion" to Drive AI Glasses Demand

March 5, 2026

AI startup Intron expands speech recognition to 57 languages

March 5, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • The Studio Show XDR Is Apple’s Boldest Show Improve Ever
  • Qwen {Hardware} Head: "One-Sentence Activity Completion" to Drive AI Glasses Demand
  • AI startup Intron expands speech recognition to 57 languages
  • YuanLab AI Releases Yuan 3.0 Extremely: A Flagship Multimodal MoE Basis Mannequin, Constructed for Stronger Intelligence and Unequalled Effectivity
  • Novari referral tech deployed at Niagara Well being
  • 👨🏿‍🚀TechCabal Each day – Africa’s first Tesla dealership is open
  • DoorDash expands retail class, presents Joe Contemporary as first-ever companion
  • Alifor launches partnership with Piat, research in Nigeria
Thursday, March 5
NextTech NewsNextTech News
Home - Asia - Alibaba Tongyi Unveils Enjoyable-CosyVoice3.5 and Enjoyable-AudioGen-VD With FreeStyle Voice Technology
Asia

Alibaba Tongyi Unveils Enjoyable-CosyVoice3.5 and Enjoyable-AudioGen-VD With FreeStyle Voice Technology

NextTechBy NextTechMarch 3, 2026No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Alibaba Tongyi Unveils Enjoyable-CosyVoice3.5 and Enjoyable-AudioGen-VD With FreeStyle Voice Technology
Share
Facebook Twitter LinkedIn Pinterest Email



Alibaba’s Tongyi Lab speech group has launched two new fashions—Enjoyable-CosyVoice3.5 and Enjoyable-AudioGen-VD—each supporting “FreeStyle” instruction-based voice era by way of pure language instructions.

In keeping with Alibaba Group, the fashions permit customers to generate and management voice output straight via textual content prompts—whether or not fine-tuning vocal expression or designing completely new timbres and soundscapes from scratch. Whereas each fashions assist pure language-controlled speech synthesis, they aim completely different use instances: Enjoyable-CosyVoice3.5 focuses on multilingual voice cloning and fine-grained expressive management, whereas Enjoyable-AudioGen-VD facilities on voice design and immersive scene-based audio era.

Enjoyable-CosyVoice3.5 upgrades the corporate’s Instruct-TTS capabilities, enabling customers to generate speech freely with a single sentence of instruction. Customers can describe supply type in pure language—akin to “sound extra decided,” “decrease the pitch barely and gradual the tempo,” or “add delicate emotional variation”—and the mannequin interprets and renders the specified impact.

The mannequin now provides assist for Thai, Indonesian, Portuguese, and Vietnamese. Alibaba claims that throughout 13 languages, Enjoyable-CosyVoice3.5 maintains industry-leading efficiency in Phrase Error Fee (WER) and speaker similarity (SpkSim) benchmarks. It has additionally been optimized for uncommon characters and complicated sentences, decreasing mispronunciation charges for unusual characters from 15.2% to five.3%, whereas delivering extra steady efficiency on long-form textual content.

Fun-CosyVoice3.5-1.avif Fun-CosyVoice3.5-2.avif Fun-CosyVoice3.5-3.avif

By way of reinforcement learning-based fine-tuning, the mannequin improves total naturalness and expressive layering. On the efficiency aspect, its tokenizer body price has been halved, and first-packet latency diminished by 35%, enabling sooner responses and smoother experiences in real-time interplay situations.

Enjoyable-AudioGen-VD, in the meantime, permits customers to generate not solely voices however full auditory scenes primarily based on pure language descriptions—integrating character and setting right into a unified output.

The mannequin helps detailed management over:

  • Fundamental attributes: gender, age, accent, pitch, speech price
  • Timbre qualities: husky, vivid, deep, magnetic
  • Feelings: anger, unhappiness, pleasure, willpower
  • Position simulation: customer support agent, veteran, baby, AI assistant, broadcaster
  • Complicated psychological states: nuanced expressions akin to “calm on the floor however trembling inside”

Fun-AudioGen-VD.avif

Past voice era, Enjoyable-AudioGen-VD can create immersive sound environments, together with layered background noise (city streets, cafés, battlefields), spatial reverb results (cathedrals, metallic cells, underwater acoustics), device-style audio filters (classic radio, walkie-talkie, respiratory masks), and dynamic environmental interactions akin to fluctuating wind noise or shifting echoes.

Collectively, the 2 fashions sign Alibaba’s continued push into controllable, high-fidelity speech and audio era—increasing the boundaries of AI-driven voice interplay and immersive media creation.

Supply: IT House

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the most recent breakthroughs, get unique updates, and join with a world community of future-focused thinkers.
Unlock tomorrow’s traits immediately: learn extra, subscribe to our e-newsletter, and grow to be a part of the NextTech group at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

Qwen {Hardware} Head: "One-Sentence Activity Completion" to Drive AI Glasses Demand

March 5, 2026

Moneyboxx Finance Raises ₹33.4 Crore in Fairness to Speed up Progress and Strengthen Capital Base

March 5, 2026

Reworking early studying in India; Captain Contemporary acquires Frime

March 5, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

The Studio Show XDR Is Apple’s Boldest Show Improve Ever

By NextTechMarch 5, 2026

Apple has formally launched the Studio Show XDR, and people who count on the most…

Qwen {Hardware} Head: "One-Sentence Activity Completion" to Drive AI Glasses Demand

March 5, 2026

AI startup Intron expands speech recognition to 57 languages

March 5, 2026
Top Trending

The Studio Show XDR Is Apple’s Boldest Show Improve Ever

By NextTechMarch 5, 2026

Apple has formally launched the Studio Show XDR, and people who count…

Qwen {Hardware} Head: "One-Sentence Activity Completion" to Drive AI Glasses Demand

By NextTechMarch 5, 2026

Track Gang, head of Qwen's AI {hardware} division, mentioned in a latest…

AI startup Intron expands speech recognition to 57 languages

By NextTechMarch 5, 2026

Intron, a Nigerian AI startup that gives speech-to-text and text-to-speech transcription instruments…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!