Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

Egypt Made a Shock Cameo in Dangerous Bunny’s Tremendous Bowl Present

February 12, 2026

Payd, Noah launch stablecoin funds for African freelancers

February 12, 2026

405,000 Singaporeans earn S$10K per 30 days or extra

February 12, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • Egypt Made a Shock Cameo in Dangerous Bunny’s Tremendous Bowl Present
  • Payd, Noah launch stablecoin funds for African freelancers
  • 405,000 Singaporeans earn S$10K per 30 days or extra
  • Motorola Razr FIFA World Cup 26 Version now accessible in Canada
  • Voice is Africa’s gateway to AI, and Google needs to guide it
  • How a 17-year-old is utilizing OpenAI instruments to make animal healthcare smarter
  • Organigram. Purchase, Promote or Maintain?
  • z.ai's open supply GLM-5 achieves document low hallucination price and leverages new RL 'slime' approach
Thursday, February 12
NextTech NewsNextTech News
Home - Space & Deep Tech - z.ai's open supply GLM-5 achieves document low hallucination price and leverages new RL 'slime' approach
Space & Deep Tech

z.ai's open supply GLM-5 achieves document low hallucination price and leverages new RL 'slime' approach

NextTechBy NextTechFebruary 12, 2026No Comments7 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
z.ai's open supply GLM-5 achieves document low hallucination price and leverages new RL 'slime' approach
Share
Facebook Twitter LinkedIn Pinterest Email



Chinese language AI startup Zhupai aka z.ai is again this week with an eye-popping new frontier massive language mannequin: GLM-5.

The newest in z.ai's ongoing and regularly spectacular GLM sequence, it retains an open supply MIT License — good for enterprise deployment – and, in considered one of a number of notable achievements, achieves a record-low hallucination price on the impartial Synthetic Evaluation Intelligence Index v4.0.

With a rating of -1 on the AA-Omniscience Index—representing an enormous 35-point enchancment over its predecessor—GLM-5 now leads your entire AI business, together with U.S. rivals like Google, OpenAI and Anthropic, in information reliability by understanding when to abstain slightly than fabricate data.

Past its reasoning prowess, GLM-5 is constructed for high-utility information work. It options native "Agent Mode" capabilities that permit it to show uncooked prompts or supply supplies immediately into skilled workplace paperwork, together with ready-to-use .docx, .pdf, and .xlsx recordsdata.

Whether or not producing detailed monetary stories, highschool sponsorship proposals, or complicated spreadsheets, GLM-5 delivers ends in real-world codecs that combine immediately into enterprise workflows.

It’s also disruptively priced at roughly $0.80 per million enter tokens and $2.56 per million output tokens, roughly 6x cheaper than proprietary rivals like Claude Opus 4.6, making state-of-the-art agentic engineering less expensive than ever earlier than. Right here's what else enterprise determination makers ought to know in regards to the mannequin and its coaching.

Know-how: scaling for agentic effectivity

On the coronary heart of GLM-5 is an enormous leap in uncooked parameters. The mannequin scales from the 355B parameters of GLM-4.5 to a staggering 744B parameters, with 40B lively per token in its Combination-of-Specialists (MoE) structure. This development is supported by a rise in pre-training knowledge to twenty-eight.5T tokens.

To handle coaching inefficiencies at this magnitude, Zai developed "slime," a novel asynchronous reinforcement studying (RL) infrastructure.

Conventional RL usually suffers from "long-tail" bottlenecks; Slime breaks this lockstep by permitting trajectories to be generated independently, enabling the fine-grained iterations vital for complicated agentic conduct.

By integrating system-level optimizations like Lively Partial Rollouts (APRIL), slime addresses the era bottlenecks that usually eat over 90% of RL coaching time, considerably accelerating the iteration cycle for complicated agentic duties.

The framework’s design is centered on a tripartite modular system: a high-performance coaching module powered by Megatron-LM, a rollout module using SGLang and customized routers for high-throughput knowledge era, and a centralized Knowledge Buffer that manages immediate initialization and rollout storage.

By enabling adaptive verifiable environments and multi-turn compilation suggestions loops, slime gives the strong, high-throughput basis required to transition AI from easy chat interactions towards rigorous, long-horizon techniques engineering.

To maintain deployment manageable, GLM-5 integrates DeepSeek Sparse Consideration (DSA), preserving a 200K context capability whereas drastically decreasing prices.

Finish-to-end information work

Zai is framing GLM-5 as an "workplace" instrument for the AGI period. Whereas earlier fashions centered on snippets, GLM-5 is constructed to ship ready-to-use paperwork.

It could autonomously remodel prompts into formatted .docx, .pdf, and .xlsx recordsdata—starting from monetary stories to sponsorship proposals.

In observe, this implies the mannequin can decompose high-level objectives into actionable subtasks and carry out "Agentic Engineering," the place people outline high quality gates whereas the AI handles execution.

Excessive efficiency

GLM-5’s benchmarks make it the brand new strongest open supply mannequin on this planet, in line with Synthetic Evaluation, surpassing Chinese language rival Moonshot's new Kimi K2.5 launched simply two weeks in the past, exhibiting that Chinese language AI firms are practically caught up with much better resourced proprietary Western rivals.

Based on z.ai's personal supplies shared right now, GLM-5 ranks close to state-of-the-art on a number of key benchmarks:

SWE-bench Verified: GLM-5 achieved a rating of 77.8, outperforming Gemini 3 Professional (76.2) and approaching Claude Opus 4.6 (80.9).

Merchandising Bench 2: In a simulation of working a enterprise, GLM-5 ranked #1 amongst open-source fashions with a closing stability of $4,432.12.

Past efficiency, GLM-5 is aggressively undercutting the market. Stay on OpenRouter as of February 11, 2026, it’s priced at roughly $0.80–$1.00 per million enter tokens and $2.56–$3.20 per million output tokens. It falls within the mid-range in comparison with different main LLMs, however based mostly on its top-tier bechmarking efficiency, it's what one would possibly name a "steal."

Mannequin

Enter (per 1M tokens)

Output (per 1M tokens)

Complete Value (1M in + 1M out)

Supply

Qwen 3 Turbo

$0.05

$0.20

$0.25

Alibaba Cloud

Grok 4.1 Quick (reasoning)

$0.20

$0.50

$0.70

xAI

Grok 4.1 Quick (non-reasoning)

$0.20

$0.50

$0.70

xAI

deepseek-chat (V3.2-Exp)

$0.28

$0.42

$0.70

DeepSeek

deepseek-reasoner (V3.2-Exp)

$0.28

$0.42

$0.70

DeepSeek

Gemini 3 Flash Preview

$0.50

$3.00

$3.50

Google

Kimi-k2.5

$0.60

$3.00

$3.60

Moonshot

GLM-5

$1.00

$3.20

$4.20

Z.ai

ERNIE 5.0

$0.85

$3.40

$4.25

Qianfan

Claude Haiku 4.5

$1.00

$5.00

$6.00

Anthropic

Qwen3-Max (2026-01-23)

$1.20

$6.00

$7.20

Alibaba Cloud

Gemini 3 Professional (≤200K)

$2.00

$12.00

$14.00

Google

GPT-5.2

$1.75

$14.00

$15.75

OpenAI

Claude Sonnet 4.5

$3.00

$15.00

$18.00

Anthropic

Gemini 3 Professional (>200K)

$4.00

$18.00

$22.00

Google

Claude Opus 4.6

$5.00

$25.00

$30.00

Anthropic

GPT-5.2 Professional

$21.00

$168.00

$189.00

OpenAI

That is roughly 6x cheaper on enter and practically 10x cheaper on output than Claude Opus 4.6 ($5/$25). This launch confirms rumors that Zhipu AI was behind "Pony Alpha," a stealth mannequin that beforehand crushed coding benchmarks on OpenRouter.

Nevertheless, regardless of the excessive benchmarks and low price, not all early customers are enthusiastic in regards to the mannequin, noting its excessive efficiency doesn't inform the entire story.

Lukas Petersson, co-founder of the safety-focused autonomous AI protocol startup Andon Labs, remarked on X: "After hours of studying GLM-5 traces: an extremely efficient mannequin, however far much less situationally conscious. Achieves objectives by way of aggressive ways however doesn't motive about its scenario or leverage expertise. That is scary. That is the way you get a paperclip maximizer."

The "paperclip maximizer" refers to a hypothetical scenario described by Oxford thinker Nick Bostrom again in 2003, during which an AI or different autonomous creation unintentionally results in an apocalyptic situation or human extinction by following a seemingly benign instruction — like maximizing the variety of paperclips produced — to an excessive diploma, redirecting all sources vital for human (or different life) or in any other case making life unimaginable by means of its dedication to fulfilling the seemingly benign goal.

Ought to your enterprise undertake GLM-5?

Enterprises in search of to flee vendor lock-in will discover GLM-5’s MIT License and open-weights availability a big strategic benefit. Not like closed-source rivals that maintain intelligence behind proprietary partitions, GLM-5 permits organizations to host their very own frontier-level intelligence.

Adoption will not be with out friction. The sheer scale of GLM-5—744B parameters—requires an enormous {hardware} ground which may be out of attain for smaller corporations with out vital cloud or on-premise GPU clusters.

Safety leaders should weigh the geopolitical implications of a flagship mannequin from a China-based lab, particularly in regulated industries the place knowledge residency and provenance are strictly audited.

Moreover, the shift towards extra autonomous AI brokers introduces new governance dangers. As fashions transfer from "chat" to "work," they start to function throughout apps and recordsdata autonomously. With out the strong agent-specific permissions and human-in-the-loop high quality gates established by enterprise knowledge leaders, the danger of autonomous error will increase exponentially.

In the end, GLM-5 is a "purchase" for organizations which have outgrown easy copilots and are able to construct a very autonomous workplace.

It’s for engineers who must refactor a legacy backend or requires a "self-healing" pipeline that doesn't sleep.

Whereas Western labs proceed to optimize for "Considering" and reasoning depth, Zai is optimizing for execution and scale.

Enterprises that undertake GLM-5 right now aren’t simply shopping for a less expensive mannequin; they’re betting on a future the place essentially the most useful AI is the one that may end the mission with out being requested twice.

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the newest breakthroughs, get unique updates, and join with a world community of future-focused thinkers.
Unlock tomorrow’s developments right now: learn extra, subscribe to our e-newsletter, and change into a part of the NextTech neighborhood at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

CareerSprinter Professional combines résumé and interview instruments for $49.99

February 12, 2026

How one can Set Up an Apple Look ahead to Your Children (2026)

February 11, 2026

Astrophotography Improve: Stepping As much as CMOS

February 11, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

Egypt Made a Shock Cameo in Dangerous Bunny’s Tremendous Bowl Present

By NextTechFebruary 12, 2026

  Dangerous Bunny’s headline efficiency on the Tremendous Bowl LX halftime present has obtained widespread…

Payd, Noah launch stablecoin funds for African freelancers

February 12, 2026

405,000 Singaporeans earn S$10K per 30 days or extra

February 12, 2026
Top Trending

Egypt Made a Shock Cameo in Dangerous Bunny’s Tremendous Bowl Present

By NextTechFebruary 12, 2026

  Dangerous Bunny’s headline efficiency on the Tremendous Bowl LX halftime present…

Payd, Noah launch stablecoin funds for African freelancers

By NextTechFebruary 12, 2026

Payd, a Kenyan-born pan-African fintech startup, has partnered with UK-based funds infrastructure…

405,000 Singaporeans earn S$10K per 30 days or extra

By NextTechFebruary 12, 2026

Disclaimer: Except in any other case said, any opinions expressed under belong…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!