Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

The whole lot Lenovo introduced at MWC 2026, together with foldables and modular laptops

March 3, 2026

Cartridge After Cartridge, Pokémon’s Tiny Sport Boy Jukebox Revives Kanto Tunes

March 3, 2026

Alibaba simply launched Qwen 3.5 Small fashions: a household of 0.8B to 9B parameters constructed for on-device purposes

March 3, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • The whole lot Lenovo introduced at MWC 2026, together with foldables and modular laptops
  • Cartridge After Cartridge, Pokémon’s Tiny Sport Boy Jukebox Revives Kanto Tunes
  • Alibaba simply launched Qwen 3.5 Small fashions: a household of 0.8B to 9B parameters constructed for on-device purposes
  • ICAI–RBI MoU Ushers in Actual-Time UDIN Verification, Boosting Transparency and Regulatory Confidence
  • Cursor has reportedly surpassed $2B in annualized income
  • SUPERCentral Launches Enhanced SMSF Options to Assist Australians Take Higher Management of Their Retirement Planning
  • GSK’s next-gen pulmonary hypertension drug
  • Sustainable Spring Cleanup in Ithaca: A Sensible, Native Information to Retaining Renovation Waste Out of Landfills
Tuesday, March 3
NextTech NewsNextTech News
Home - AI & Machine Learning - Interview: From CUDA to Tile-Based mostly Programming: NVIDIA’s Stephen Jones on Constructing the Way forward for AI
AI & Machine Learning

Interview: From CUDA to Tile-Based mostly Programming: NVIDIA’s Stephen Jones on Constructing the Way forward for AI

NextTechBy NextTechDecember 8, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Interview: From CUDA to Tile-Based mostly Programming: NVIDIA’s Stephen Jones on Constructing the Way forward for AI
Share
Facebook Twitter LinkedIn Pinterest Email


As AI fashions develop in complexity and {hardware} evolves to satisfy the demand, the software program layer connecting the 2 should additionally adapt. We lately sat down with Stephen Jones, a Distinguished Engineer at NVIDIA and one of many unique architects of CUDA.

Jones, whose background spans from fluid mechanics to aerospace engineering, provided deep insights into NVIDIA’s newest software program improvements, together with the shift towards tile-based programming, the introduction of “Inexperienced Contexts,” and the way AI is rewriting the principles of code growth.

Listed below are the important thing takeaways from our dialog.

The Shift to Tile-Based mostly Abstraction

For years, CUDA programming has revolved round a hierarchy of grids, blocks, and threads. With the newest updates, NVIDIA is introducing the next degree of abstraction: CUDA Tile.

In keeping with Jones, this new method permits builders to program on to arrays and tensors slightly than managing particular person threads. “It extends the prevailing CUDA,” Jones defined. “What we’ve completed is we’ve added a approach to speak about and program on to arrays, tensors, vectors of knowledge… permitting the language and the compiler to see what the high-level information was that you simply’re working on opened up a complete realm of latest optimizations”.

This shift is partly a response to the speedy evolution of {hardware}. As Tensor Cores change into bigger and denser to fight the slowing of Moore’s Regulation, the mapping of code to silicon turns into more and more advanced.

  • Future-Proofing: Jones famous that by expressing packages as vector operations (e.g., Tensor A occasions Tensor B), the compiler takes on the heavy lifting of mapping information to the precise {hardware} technology.
  • Stability: This ensures that program construction stays secure even because the underlying GPU structure modifications from Ampere to Hopper to Blackwell.

Python First, However Not Python Solely

Recognizing that Python has change into the lingua franca of Synthetic Intelligence, NVIDIA launched CUDA Tile assist with Python first. “Python’s the language of AI,” Jones acknowledged, including that an array-based illustration is “rather more pure to Python programmers” who’re accustomed to NumPy.

Nonetheless, efficiency purists needn’t fear. C++ assist is arriving subsequent yr, sustaining NVIDIA’s philosophy that builders ought to be capable to speed up their code whatever the language they select.

“Inexperienced Contexts” and Lowering Latency

For engineers deploying Massive Language Fashions (LLMs) in manufacturing, latency and jitter are vital considerations. Jones highlighted a brand new function known as Inexperienced Contexts, which permits for exact partitioning of the GPU.

“Inexperienced contexts helps you to partition the GPU… into completely different sections,” Jones stated. This enables builders to dedicate particular fractions of the GPU to completely different duties, akin to operating pre-fill and decode operations concurrently with out them competing for assets. This micro-level specialization inside a single GPU mirrors the disaggregation seen on the information heart scale.

No Black Bins: The Significance of Tooling

One of many pervasive fears relating to high-level abstractions is the lack of management. Jones, drawing on his expertise as a CUDA consumer within the aerospace business, emphasised that NVIDIA instruments won’t ever be black bins.

“I actually consider that crucial a part of CUDA is the developer instruments,” Jones affirmed. He assured builders that even when utilizing tile-based abstractions, instruments like Nsight Compute will permit inspection right down to the person machine language directions and registers. “You’ve bought to have the ability to tune and debug and optimize… it can’t be a black field,” he added.

Accelerating Time-to-Outcome

In the end, the purpose of those updates is productiveness. Jones described the target as “left shifting” the efficiency curve, enabling builders to achieve 80% of potential efficiency in a fraction of the time.

“If you happen to can come to market [with] 80% of efficiency in every week as a substitute of a month… then you definitely’re spending the remainder of your time simply optimizing,” Jones defined. Crucially, this ease of use doesn’t come at the price of energy; the brand new mannequin nonetheless gives a path to 100% of the height efficiency the silicon can provide.

Conclusion

As AI algorithms and scientific computing converge, NVIDIA is positioning CUDA not simply as a low-level instrument for {hardware} consultants, however as a versatile platform that adapts to the wants of Python builders and HPC researchers alike. With assist extending from Ampere to the upcoming Blackwell and Rubin architectures, these updates promise to streamline growth throughout your complete GPU ecosystem.

For the complete technical particulars on CUDA Tile and Inexperienced Contexts, go to the NVIDIA developer portal.


Jean-marc is a profitable AI enterprise government .He leads and accelerates development for AI powered options and began a pc imaginative and prescient firm in 2006. He’s a acknowledged speaker at AI conferences and has an MBA from Stanford.

🙌 Observe MARKTECHPOST: Add us as a most well-liked supply on Google.

Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the newest breakthroughs, get unique updates, and join with a worldwide community of future-focused thinkers.
Unlock tomorrow’s developments immediately: learn extra, subscribe to our e-newsletter, and change into a part of the NextTech group at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

Alibaba simply launched Qwen 3.5 Small fashions: a household of 0.8B to 9B parameters constructed for on-device purposes

March 3, 2026

Meet NullClaw: The 678 KB Zig AI Agent Framework Working on 1 MB RAM and Booting in Two Milliseconds

March 2, 2026

FireRedTeam Releases FireRed-OCR-2B Using GRPO to Resolve Structural Hallucinations in Tables and LaTeX for Software program Builders

March 2, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

The whole lot Lenovo introduced at MWC 2026, together with foldables and modular laptops

By NextTechMarch 3, 2026

I am an enormous gamer, so I am at all times on the lookout for…

Cartridge After Cartridge, Pokémon’s Tiny Sport Boy Jukebox Revives Kanto Tunes

March 3, 2026

Alibaba simply launched Qwen 3.5 Small fashions: a household of 0.8B to 9B parameters constructed for on-device purposes

March 3, 2026
Top Trending

The whole lot Lenovo introduced at MWC 2026, together with foldables and modular laptops

By NextTechMarch 3, 2026

I am an enormous gamer, so I am at all times on…

Cartridge After Cartridge, Pokémon’s Tiny Sport Boy Jukebox Revives Kanto Tunes

By NextTechMarch 3, 2026

Pokémon turns 30 this 12 months, and the Pokémon Firm needs to…

Alibaba simply launched Qwen 3.5 Small fashions: a household of 0.8B to 9B parameters constructed for on-device purposes

By NextTechMarch 3, 2026

Alibaba’s Qwen group has launched the Qwen3.5 Small Mannequin Collection, a set…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!