Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

What is going to increased oil costs do to Canada’s financial system?

March 12, 2026

Zoox now testing robotaxis in 10 cities

March 12, 2026

Restoring surgeons’ sense of contact with robotic fingertips

March 12, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • What is going to increased oil costs do to Canada’s financial system?
  • Zoox now testing robotaxis in 10 cities
  • Restoring surgeons’ sense of contact with robotic fingertips
  • What to Do in Dumbo If You’re Right here for Enterprise (2026)
  • Why an LMS for Manufacturing Firms Is Important for Workforce Upskilling
  • From Forklifts to Freight: Truck1 Helps IntraLogisteX 2026 as a Media Associate
  • French insurtech Alan surges to €5bn valuation mark
  • Burning Plastic Isn’t Renewable: Rethinking Waste & Energy In Hawaii
Thursday, March 12
NextTech NewsNextTech News
Home - Space & Deep Tech - Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra
Space & Deep Tech

Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra

NextTechBy NextTechNovember 13, 2025No Comments8 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Baidu unveils proprietary ERNIE 5 beating GPT-5 efficiency on charts, doc understanding and extra
Share
Facebook Twitter LinkedIn Pinterest Email



Mere hours after OpenAI up to date its flagship basis mannequin GPT-5 to GPT-5.1, promising decreased token utilization general and a extra nice character with extra preset choices, Chinese language search big Baidu unveiled its next-generation basis mannequin, ERNIE 5.0, alongside a collection of AI product upgrades and strategic worldwide expansions.

The objective: to place as a worldwide contender within the more and more aggressive enterprise AI market.

Introduced on the firm's Baidu World 2025 occasion, ERNIE 5.0 is a proprietary, natively omni-modal mannequin designed to collectively course of and generate content material throughout textual content, photos, audio, and video.

Not like Baidu’s lately launched ERNIE-4.5-VL-28B-A3B-Considering, which is open supply beneath an enterprise-friendly and permissive Apache 2.0 license, ERNIE 5.0 is a proprietary mannequin and is accessible solely by way of Baidu’s ERNIE Bot web site (I wanted to pick out it manuallyu from the mannequin picker dropdown) and the Qianfan cloud platform software programming interface (API) for enterprise prospects.

Alongside the mannequin launch, Baidu launched main updates to its digital human platform, no-code instruments, and general-purpose AI brokers — all focused at increasing its AI footprint past China.

The corporate additionally launched ERNIE 5.0 Preview 1022, a variant optimized for text-intensive duties, alongside the overall preview mannequin that balances throughout modalities.

Baidu emphasised that ERNIE 5.0 represents a shift in how intelligence is deployed at scale, with CEO Robin Li stating: “While you internalize AI, it turns into a local functionality and transforms intelligence from a price right into a supply of productiveness.”

The place ERNIE 5.0 outshines GPT-5 and Gemini 2.5 Professional

ERNIE 5.0’s benchmark outcomes recommend that Baidu has achieved parity—or near-parity—with the highest Western basis fashions throughout a large spectrum of duties.

In public benchmark slides shared in the course of the Baidu World 2025 occasion, ERNIE 5.0 Preview outperformed or matched OpenAI’s GPT-5-Excessive and Google’s Gemini 2.5 Professional in multimodal reasoning, doc understanding, and image-based QA, whereas additionally demonstrating sturdy language modeling and code execution talents.

The corporate emphasised its potential to deal with joint inputs and outputs throughout modalities, quite than counting on post-hoc modality fusion, which it framed as a technical differentiator.

On visible duties, ERNIE 5.0 achieved main scores on OCRBench, DocVQA, and ChartQA, three benchmarks that check doc recognition, comprehension, and structured information reasoning.

Baidu claims the mannequin beat each GPT-5-Excessive and Gemini 2.5 Professional on these doc and chart-based benchmarks, areas it describes as core to enterprise purposes like automated doc processing and monetary evaluation.

In picture technology, ERNIE 5.0 tied or exceeded Google’s Veo3 throughout classes together with semantic alignment and picture high quality, in response to Baidu’s inner GenEval-based analysis. Baidu claimed that the mannequin’s multimodal integration permits it to generate and interpret visible content material with better contextual consciousness than fashions counting on modality-specific encoders.

For audio and speech duties, ERNIE 5.0 demonstrated aggressive outcomes on MM-AU and TUT2017 audio understanding benchmarks, in addition to query answering from spoken language inputs. Its audio efficiency, whereas not as closely emphasised as imaginative and prescient or textual content, suggests a broad functionality footprint supposed to assist full-spectrum multimodal purposes.

In language duties, the mannequin confirmed sturdy outcomes on instruction following, factual query answering, and mathematical reasoning—core areas that outline the enterprise utility of huge language fashions.

The Preview 1022 variant of ERNIE 5.0, tailor-made for textual efficiency, confirmed even stronger language-specific ends in early developer entry. Whereas Baidu doesn’t declare broad superiority basically language reasoning, its inner evaluations recommend that ERNIE 5.0 Preview 1022 closes the hole with top-tier English-language fashions and outperforms them in Chinese language-language efficiency.

Whereas Baidu didn’t launch full benchmark particulars or uncooked scores publicly, its efficiency positioning suggests a deliberate try to border ERNIE 5.0 not as a distinct segment multimodal system however as a flagship mannequin aggressive with the most important closed fashions in general-purpose reasoning.

The place Baidu claims a transparent lead is in structured doc understanding, visible chart reasoning, and integration of a number of modalities right into a single, native modeling structure. Unbiased verification of those outcomes stays pending, however the breadth of claimed capabilities positions ERNIE 5.0 as a critical different within the multimodal basis mannequin panorama.

Enterprise Pricing Technique

ERNIE 5.0 is positioned on the premium finish of Baidu’s mannequin pricing construction. The corporate has launched particular pricing for API utilization on its Qianfan platform, aligning the price with different top-tier choices from Chinese language opponents like Alibaba.

Mannequin

Enter Value (per 1K tokens)

Output Value (per 1K tokens)

Supply

ERNIE 5.0

$0.00085 (¥0.006)

$0.0034 (¥0.024)

Qianfan

ERNIE 4.5 Turbo (ex.)

$0.00011 (¥0.0008)

$0.00045 (¥0.0032)

Qianfan

Qwen3 (Coder ex.)

$0.00085 (¥0.006)

$0.0034 (¥0.024)

Qianfan

The distinction in price between ERNIE 5.0 and earlier fashions similar to ERNIE 4.5 Turbo underscores Baidu’s technique to differentiate between high-volume, low-cost fashions and high-capability fashions designed for advanced duties and multimodal reasoning.

In comparison with different U.S. alternate options, it stays mid-range in pricing:

Mannequin

Enter (/1 M tokens)

Output (/1 M tokens)

Supply

GPT-5.1

$1.25

$10.00

OpenAI

ERNIE 5.0

$0.85

$3.40

Qianfan

ERNIE 4.5 Turbo (ex.)

$0.11

$0.45

Qianfan

Claude Opus 4.1

$15.00

$75.00

Anthropic

Gemini 2.5 Professional

$1.25 (≤200k) / $2.50 (>200k)

$10.00 (≤200k) / $15.00 (>200k)

Google Vertex AI Pricing

Grok 4 (grok-4-0709)

$3.00

$15.00

xAI API

International Enlargement: Merchandise and Platforms

In tandem with the mannequin launch, Baidu is increasing internationally:

  • GenFlow 3.0, now with 20M+ customers, is the corporate’s largest general-purpose AI agent and options enhanced reminiscence and multimodal activity dealing with.

  • Famou, a self-evolving agent able to dynamically fixing advanced issues, is now commercially accessible by way of invite.

  • MeDo, the worldwide model of Baidu’s no-code builder Miaoda, is stay globally by way of medo.dev.

  • Oreate, a productiveness workspace with doc, slide, picture, video, and podcast assist, has reached over 1.2M customers worldwide.

Baidu’s digital human platform, already rolled out in Brazil, can also be a part of the worldwide push. In accordance with firm information, 83% of livestreamers throughout this yr’s “Double 11” purchasing occasion in China used Baidu’s digital human tech, contributing to a 91% improve in GMV.

In the meantime, Baidu’s autonomous ride-hailing service Apollo Go has surpassed 17 million rides, working driverless fleets in 22 cities and claiming the title of the world’s largest robotaxi community.

Open-Supply Imaginative and prescient-Language Mannequin Garners Business Consideration

Two days earlier than the flagship ERNIE 5.0 occasion, Baidu additionally launched an open-source multimodal mannequin beneath the Apache 2.0 license: ERNIE-4.5-VL-28B-A3B-Considering.

As reported by my colleague Michael Nuñez at VentureBeat, the mannequin prompts simply 3 billion parameters whereas sustaining a complete of 28 billion, utilizing a Combination-of-Specialists (MoE) structure for environment friendly inference.

Key technical improvements embody:

  • “Considering with Pictures”, which allows dynamic zoom-based visible evaluation

  • Assist for chart interpretation, doc understanding, visible grounding, and temporal consciousness in video

  • Runtime on a single 80GB GPU, making it accessible to mid-sized organizations

  • Full compatibility with Transformers, vLLM, and Baidu’s FastDeploy toolkits

This launch provides stress on closed-source opponents. With Apache 2.0 licensing, ERNIE-4.5-VL-28B-A3B-Considering turns into a viable basis mannequin for business purposes with out licensing restrictions — one thing few high-performing fashions on this class provide.

Group Suggestions and Baidu’s Response

Following the launch of ERNIE 5.0, developer and AI evaluator Lisan al Gaib (@scaling01) posted a combined evaluation on X. Whereas initially impressed by the mannequin’s benchmark efficiency, they reported a persistent challenge the place ERNIE 5.0 would repeatedly invoke instruments — even when explicitly instructed to not — throughout SVG technology duties.

“ERNIE 5.0 benchmarks appeared insane till I examined it… sadly it’s RL braindamaged or they’ve a critical challenge with their chat platform / system immediate,” Lisan wrote.

In a matter of hours, Baidu’s developer-focused assist account, @ErnieforDevs, responded:

“Thanks for the suggestions! It’s a identified bug — sure syntax can constantly set off it. We’re engaged on a repair. You’ll be able to attempt rephrasing or altering the immediate to keep away from it for now.”

The fast turnaround displays Baidu’s growing emphasis on developer communication, particularly because it courts worldwide customers by means of each proprietary and open-source choices.

Outlook for Baidu and its ERNIE foundational LLM household

Baidu’s ERNIE 5.0 marks a strategic escalation within the world basis mannequin race. With efficiency claims that put it on par with probably the most superior techniques from OpenAI and Google, and a mixture of premium pricing and open-access alternate options, Baidu is signaling its ambition to change into not only a home AI chief, however a reputable world infrastructure supplier.

At a time when enterprise AI customers are more and more demanding multimodal efficiency, versatile licensing, and deployment effectivity, Baidu’s two-track strategy—premium hosted APIs and open-source releases—might broaden its enchantment throughout each company and developer communities.

Whether or not the corporate’s efficiency claims maintain up beneath third-party testing stays to be seen. However in a panorama formed by rising prices, mannequin complexity, and compute bottlenecks, ERNIE 5.0 and its supporting ecosystem give Baidu a aggressive place within the subsequent wave of AI deployment.

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the most recent breakthroughs, get unique updates, and join with a worldwide community of future-focused thinkers.
Unlock tomorrow’s tendencies at the moment: learn extra, subscribe to our publication, and change into a part of the NextTech neighborhood at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

What to Do in Dumbo If You’re Right here for Enterprise (2026)

March 12, 2026

Searching for Supermassive Black Gap Binaries with a Flash of Starlight

March 11, 2026

Jupiter’s moons go away chilly ‘footprints’ within the planet’s auroras, James Webb House Telescope finds

March 11, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

What is going to increased oil costs do to Canada’s financial system?

By NextTechMarch 12, 2026

A current surge in oil costs tied to geopolitical tensions within the Center East is…

Zoox now testing robotaxis in 10 cities

March 12, 2026

Restoring surgeons’ sense of contact with robotic fingertips

March 12, 2026
Top Trending

What is going to increased oil costs do to Canada’s financial system?

By NextTechMarch 12, 2026

A current surge in oil costs tied to geopolitical tensions within the…

Zoox now testing robotaxis in 10 cities

By NextTechMarch 12, 2026

The Amazon subsidiary added Dallas and Phoenix and is utilizing “a whole…

Restoring surgeons’ sense of contact with robotic fingertips

By NextTechMarch 12, 2026

By Anthony King Fashionable surgical procedure has gone from lengthy incisions to…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!