Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

Logitech G325 LIGHTSPEED Wi-fi Gaming Headset Evaluate: Light-weight Consolation

April 1, 2026

Elgato’s Stream Deck Plus Lever Defined

April 1, 2026

Samsung Galaxy A57 5G and Galaxy A37 5G arrive in Australia with Enhanced AI Options and Professional-Grade Cameras

April 1, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • Logitech G325 LIGHTSPEED Wi-fi Gaming Headset Evaluate: Light-weight Consolation
  • Elgato’s Stream Deck Plus Lever Defined
  • Samsung Galaxy A57 5G and Galaxy A37 5G arrive in Australia with Enhanced AI Options and Professional-Grade Cameras
  • Flow into Capital raises $220M as first shut of Fund II
  • 33 banks elevate ₦3.37 trillion from Nigerians
  • Google’s reported screenless Fitbit band noticed on Stephen Curry
  • The best way to Construct a Manufacturing-Prepared Gemma 3 1B Instruct Technology AI Pipeline with Hugging Face Transformers, Chat Templates, and Colab Inference
  • CORSAIR unveils all-new MM Mousepad Household with first-ever Glass Gaming Mousepad
Wednesday, April 1
NextTech NewsNextTech News
Home - AI & Machine Learning - Liquid AI Launched LFM2.5-350M: A Compact 350M Parameter Mannequin Educated on 28T Tokens with Scaled Reinforcement Studying
AI & Machine Learning

Liquid AI Launched LFM2.5-350M: A Compact 350M Parameter Mannequin Educated on 28T Tokens with Scaled Reinforcement Studying

NextTechBy NextTechApril 1, 2026No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Liquid AI Launched LFM2.5-350M: A Compact 350M Parameter Mannequin Educated on 28T Tokens with Scaled Reinforcement Studying
Share
Facebook Twitter LinkedIn Pinterest Email


Within the present panorama of generative AI, the ‘scaling legal guidelines’ have typically dictated that extra parameters equal extra intelligence. Nevertheless, Liquid AI is difficult this conference with the discharge of LFM2.5-350M. This mannequin is definitely a technical case research in intelligence density with extra pre-training (from 10T to 28T tokens) and large-scale reinforcement studying

The importance of LFM2.5-350M lies in its structure and coaching effectivity. Whereas essentially the most AI firms has been targeted on frontier fashions, Liquid AI is focusing on the ‘edge’—units with restricted reminiscence and compute—by proving {that a} 350-million parameter mannequin can outperform fashions greater than twice its dimension on a number of evaluated benchmarks.

69ca8b787a45a5bfb5a222b8 LFM2.5 350M Benchmarks 1
https://www.liquid.ai/weblog/lfm2-5-350m-no-size-left-behind

Structure: The Hybrid LIV Spine

The core technical differentiator of the LFM2.5-350M is its departure from the pure Transformer structure. It makes use of a hybrid construction constructed on Linear Enter-Various Programs (LIVs).

Conventional Transformers rely totally on self-attention mechanisms, which undergo from quadratic scaling points: because the context window grows, the reminiscence and computational necessities for the Key-Worth (KV) cache improve. Liquid AI addresses this through the use of a hybrid spine consisting of:

  • 10 Double-Gated LIV Convolution Blocks: These deal with nearly all of the sequence processing. LIVs operate equally to superior Recurrent Neural Networks (RNNs) however are designed to be extra parallelizable and secure throughout coaching. They keep a constant-state reminiscence, decreasing the I/O overhead.
  • 6 Grouped Question Consideration (GQA) Blocks: By integrating a small variety of consideration blocks, the mannequin retains high-precision retrieval and long-range context dealing with with out the total reminiscence overhead of an ordinary Transformer.

This hybrid method permits the LFM2.5-350M to help a 32k context window (32,768 tokens) whereas sustaining a particularly lean reminiscence footprint.

Efficiency and Intelligence Density

The LFM2.5-350M was pre-trained on 28 trillion tokens with a particularly excessive training-to-parameter ratio. This ensures that the mannequin’s restricted parameter rely is utilized to its most potential, leading to excessive ‘intelligence density.’

Benchmarks and Use Instances

The LFM2.5-350M is a specialist mannequin designed for high-speed, agentic duties moderately than general-purpose reasoning.

Benchmark Rating
IFEval (Instruction Following) 76.96
GPQA Diamond 30.64
MMLU-Professional 20.01

The excessive IFEval rating signifies the mannequin is environment friendly at following complicated, structured directions, making it appropriate for software use, operate calling, and structured knowledge extraction (e.g., JSON). Nevertheless, the documentation explicitly states that LFM2.5-350M will not be beneficial for arithmetic, complicated coding, or inventive writing. For these duties, the reasoning capabilities of bigger parameter counts stay obligatory.

Screenshot 2026 03 31 at 10.01.19 PM 1Screenshot 2026 03 31 at 10.01.19 PM 1
https://www.liquid.ai/weblog/lfm2-5-350m-no-size-left-behind

{Hardware} Optimization and Inference Effectivity

A serious hurdle for AI devs is the ‘reminiscence wall’—the bottleneck created by transferring knowledge between the processor and reminiscence. As a result of the LFM2.5-350M makes use of LIVs and GQA, it drastically reduces KV cache dimension, boosting throughput. On a single NVIDIA H100 GPU, the mannequin can attain a throughput of 40.4K output tokens per second at excessive concurrency.

Liquid AI crew stories device-specific low-memory inference outcomes that make native deployment viable:

  • Snapdragon 8 Elite NPU: 169MB peak reminiscence utilizing RunAnywhere This autumn.
  • Snapdragon GPU: 81MB peak reminiscence utilizing RunAnywhere This autumn.
  • Raspberry Pi 5: 300MB utilizing Cactus Engine int8.

Key Takeaways

  • Excessive Intelligence Density: By coaching a 350M parameter mannequin on 28 trillion tokens, Liquid AI crew achieved an tremendous excessive 80,000:1 token-to-parameter ratio, permitting it to outperform fashions greater than twice its dimension on a number of benchmarks.
  • Hybrid LIV Structure: The mannequin departs from pure Transformers through the use of Linear Enter-Various Programs (LIVs) mixed with a small variety of Grouped Question Consideration (GQA) blocks, considerably decreasing the reminiscence overhead of the KV cache.
  • Edge-First Effectivity: It’s designed for native deployment with a 32k context window and a remarkably low reminiscence footprint—reaching as little as 81MB on cellular GPUs and 169MB on NPUs through specialised inference engines.
  • Specialised Agentic Functionality: The mannequin is extremely optimized for instruction following (IFEval: 76.96) and power use, although it’s explicitly not beneficial for complicated coding, arithmetic, or inventive writing.
  • Huge Throughput: The architectural effectivity permits high-speed utility, processing as much as 40.4K output tokens per second on a single H100, making it very best for high-volume knowledge extraction and real-time classification.

Try the Technical particulars and Mannequin Weight. Additionally, be at liberty to observe us on Twitter and don’t overlook to affix our 120k+ ML SubReddit and Subscribe to our Publication. Wait! are you on telegram? now you’ll be able to be a part of us on telegram as effectively.


Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the newest breakthroughs, get unique updates, and join with a worldwide community of future-focused thinkers.
Unlock tomorrow’s tendencies immediately: learn extra, subscribe to our publication, and change into a part of the NextTech neighborhood at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

The best way to Construct a Manufacturing-Prepared Gemma 3 1B Instruct Technology AI Pipeline with Hugging Face Transformers, Chat Templates, and Colab Inference

April 1, 2026

Google AI Releases Veo 3.1 Lite: Giving Builders Low Value Excessive Velocity Video Era by way of The Gemini API

April 1, 2026

Hugging Face Releases TRL v1.0: A Unified Put up-Coaching Stack for SFT, Reward Modeling, DPO, and GRPO Workflows

April 1, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

Logitech G325 LIGHTSPEED Wi-fi Gaming Headset Evaluate: Light-weight Consolation

By NextTechApril 1, 2026

The Logitech G325 LIGHTSPEED Wi-fi Gaming Headset delivers good audio high quality for gaming and…

Elgato’s Stream Deck Plus Lever Defined

April 1, 2026

Samsung Galaxy A57 5G and Galaxy A37 5G arrive in Australia with Enhanced AI Options and Professional-Grade Cameras

April 1, 2026
Top Trending

Logitech G325 LIGHTSPEED Wi-fi Gaming Headset Evaluate: Light-weight Consolation

By NextTechApril 1, 2026

The Logitech G325 LIGHTSPEED Wi-fi Gaming Headset delivers good audio high quality…

Elgato’s Stream Deck Plus Lever Defined

By NextTechApril 1, 2026

Stream Deck units have grow to be lifesavers for all of you…

Samsung Galaxy A57 5G and Galaxy A37 5G arrive in Australia with Enhanced AI Options and Professional-Grade Cameras

By NextTechApril 1, 2026

Samsung has simply introduced the Galaxy A57 5G and Galaxy A37 5G,…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!