AMD unveiled its complete end-to-end built-in AI platform imaginative and prescient and launched its open, scalable rack-scale AI infrastructure constructed on {industry} requirements at its annual Advancing AI occasion.
The Santa Clara, California-based chip maker introduced its new AMD Intuition MI350 Sequence accelerators, that are 4 occasions sooner on AI compute and 35 occasions sooner on inferencing than prior chips.
AMD and its companions showcased AMD Intuition-based merchandise and the continued development of the AMD ROCm ecosystem. It additionally confirmed its highly effective, new, open rack-scale designs and roadmap that deliver management Rack Scale AI efficiency past 2027.
“We will now say we’re on the inference inflection level, and will probably be the motive force,” stated Lisa Su, CEO of AMD, in a keynote on the Advancing AI occasion.
In closing, in a jab at Nvidia, she stated, “The way forward for AI is not going to be constructed by anybody firm or inside a closed system. It will likely be formed by open collaboration throughout the {industry} with everybody bringing their greatest concepts.”
AMD unveiled the Intuition MI350 Sequence GPUs, setting a brand new benchmark for efficiency, effectivity and scalability in generative AI and high-performance computing. The MI350 Sequence, consisting of each Intuition MI350X and MI355X GPUs and platforms, delivers a 4 occasions generation-on-generation AI compute improve and a 35 occasions generational leap in inferencing, paving the best way for transformative AI options throughout industries.
“We’re tremendously excited concerning the work you’re doing at AMD,” stated Sam Altman, CEO of Open AI, on stage with Lisa Su.
He stated he couldn’t consider it when he heard concerning the specs for MI350 from AMD, and he was grateful that AMD took his firm’s suggestions.

AMD demonstrated end-to-end, open-standards rack-scale AI infrastructure—already rolling out with AMD Intuition MI350 Sequence accelerators, fifth Gen AMD Epyc processors and AMD Pensando Pollara community interface playing cards (NICs) in hyperscaler deployments akin to Oracle Cloud Infrastructure (OCI) and set for broad availability in 2H 2025. AMD additionally previewed its subsequent technology AI rack referred to as Helios.
It will likely be constructed on the next-generation AMD Intuition MI400 Sequence GPUs, the Zen 6-based AMD Epyc Venice CPUs and AMD Pensando Vulcano NICs.
“I feel they’re concentrating on a special kind of buyer than Nvidia,” stated Ben Bajarin, analyst at Artistic Methods, in a message to GamesBeat. “Particularly I feel they see the neocloud alternative and an entire host of tier two and tier three clouds and the on-premise enterprise deployments.”
Bajarin added, “We’re bullish on the shift to full rack deployment techniques and that’s the place Helios suits through which will align with Rubin timing. However because the market shifts to inference, which we’re simply initially with, AMD is effectively positioned to compete to seize share. I additionally assume, there are many clients on the market who will worth AMD’s TCO the place proper now Nvidia could also be overkill for his or her workloads. In order that is space to look at, which once more will get again to who the appropriate buyer is for AMD and it may be a really completely different buyer profile than the shopper for Nvidia.”
The most recent model of the AMD open-source AI software program stack, ROCm 7, is engineered to satisfy the rising calls for of generative AI and high-performance computing workloads— whereas dramatically enhancing developer expertise throughout the board. (Radeon Open Compute is an open-source software program platform that permits for GPU-accelerated computing on AMD GPUs, notably for high-performance computing and AI workloads). ROCm 7 options improved assist for industry-standard frameworks, expanded {hardware} compatibility, and new growth instruments, drivers, APIs and libraries to speed up AI growth and deployment.
In her keynote, Su stated, “Opennesss ought to be greater than only a buzz phrase.”
The Intuition MI350 Sequence exceeded AMD’s five-year aim to enhance the power effectivity of AI coaching and high-performance computing nodes by 30 occasions, in the end delivering a 38 occasions enchancment. AMD additionally unveiled a brand new 2030 aim to ship a 20 occasions improve in rack-scale power effectivity from a 2024 base 12 months, enabling a typical AI mannequin that at this time requires greater than 275 racks to be skilled in fewer than one totally utilized rack by 2030, utilizing 95% much less electrical energy.
AMD additionally introduced the broad availability of the AMD Developer Cloud for the worldwide developer and open-source communities. Function-built for speedy, high-performance AI growth, customers could have entry to a totally managed cloud surroundings with the instruments and adaptability to get began with AI tasks – and develop with out limits. With ROCm 7 and the AMD Developer Cloud, AMD is decreasing boundaries and increasing entry to next-gen compute. Strategic collaborations with leaders like Hugging Face, OpenAI and Grok are proving the ability of co-developed, open options. The announcement obtained some cheers from people within the viewers, as the corporate stated it will give attendees developer credit.
Broad Associate Ecosystem Showcases AI Progress Powered by AMD

AMD clients mentioned how they’re utilizing AMD AI options to coach at this time’s main AI fashions, energy inference at scale and speed up AI exploration and growth.
Meta detailed the way it has leveraged a number of generations of AMD Intuition and Epyc options throughout its information heart infrastructure, with Intuition MI300X broadly deployed for Llama 3 and Llama 4 inference. Meta continues to collaborate carefully with AMD on AI roadmaps, together with plans to leverage MI350 and MI400 Sequence GPUs and platforms.
Oracle Cloud Infrastructure is among the many first {industry} leaders to undertake the AMD open rack-scale AI infrastructure with AMD Intuition MI355X GPUs. OCI leverages AMD CPUs and GPUs to ship balanced, scalable efficiency for AI clusters, and introduced it’s going to supply zettascale AI clusters accelerated by the newest AMD Intuition processors with as much as 131,072 MI355X GPUs to allow clients to construct, prepare, and inference AI at scale.

Microsoft introduced Intuition MI300X is now powering each proprietary and open-source fashions in manufacturing on Azure.
HUMAIN mentioned its landmark settlement with AMD to construct open, scalable, resilient and cost-efficient AI infrastructure leveraging the complete spectrum of computing platforms solely AMD can present.Cohere shared that its high-performance, scalable Command fashions are deployed on Intuition MI300X, powering enterprise-grade LLM inference with excessive throughput, effectivity and information privateness.
Within the keynote, Crimson Hat described how its expanded collaboration with AMD permits production-ready AI environments, with AMD Intuition GPUs on Crimson Hat OpenShift AI delivering highly effective, environment friendly AI processing throughout hybrid cloud environments.
“They will get essentially the most out of the {hardware} they’re utilizing,” stated the Crimson Hat exec on stage.
Astera Labs highlighted how the open UALink ecosystem accelerates innovation and delivers larger worth to clients and shared plans to supply a complete portfolio of UALink merchandise to assist next-generation AI infrastructure.Marvell joined AMD to share the UALink swap roadmap, the primary really open interconnect, bringing the last word flexibility for AI infrastructure.

