AMD Next Horizon Event Live Coverage AMD EPYC 2 Details

5
AMD Next Horizon Cover
AMD Next Horizon Cover

Today we are in San Francisco, California at the AMD Next Horizon event. We are going to update our coverage live. We expect to hear more about AMD EPYC “Rome” generation CPUs and the next generation AMD Radeon Instinct GPUs. We had a hint that both will be 7nm products we expect to launch in the coming months.

Opening Comments From Dr. Lisa Su

Today’s focus is going to be on the data center and how AMD will shape the data center.

AMD is pushing all-new design approaches to CPU and GPU architectures, not competing head-on with legacy Intel monolithic designs. Decisions made years ago were bets on where the company saw the market going. Today’s design choices are being made to impact the company’s roadmap for years in the future.

AMD EPYC In The Data Center Gen 1
AMD EPYC In The Data Center Gen 1

AMD EPYC has been in the market for about 15 months. AMD EPYC the company has said is doing well is the market. We hope to hear about the next generation today. AMD confirmed we are getting a Rome preview today.

Radeon Instinct has been in the market since 2017, but the company knows that it needs to push software to compete. AMD believes that its key value going forward will be in open source software.

Announcing AMD EPYC on AWS

Today AWS and AMD are announcing that there are immediately available AMD EPYC instances in AWS. This news is huge.

AMD EPYC On AWS Announcement
AMD EPYC On AWS Announcement

Today R5a, M5a, and T3a instances are going to be available today on Amazon AWS EC2. Changing from R5 to R5a or M5 to M5a will cost 10% less than their Intel counterparts. R5 instances are for big memory instances. M5 is for mainstream applications such as databases and enterprise applications. T3 instances are for blogs and other lower-end applications.

AWS And AMD EPYC Instances
AWS And AMD EPYC Instances

Again, this is a big deal. Lower cost instances from AWS based on AMD EPYC and a simple change from R5 to R5a.

Mark Papermaster on the new 7nm AMD Technology

Mark is now talking about Zen 2. More importantly, the company is showing that it can deliver roadmap. Zen 2 7nm is sampling, and Zen 3 is on track via 7nm+ in 2020.

AMD Delivering Roadmap To 7nm
AMD Delivering Roadmap To 7nm

There is a big discussion on how roadmap is important and AMD is committed to delivering on its roadmap.

AMD believes that it made a big bet with 7nm. The company made a bet that it could produce chips on 7nm. It saw 7nm as a long and sustaining node in the industry.

AMD Mark Papermaster On 7nm
AMD Mark Papermaster On 7nm

The company saw this as a difficult jump to 7nm, but felt that if they could execute, it could leapfrog their performance and power consumption. They are saying that TSMC 7nm product is doing very well.

AMD Mark Papermaster 7nm Benefits
AMD Mark Papermaster 7nm Benefits

The company was aiming to achieve parity with Intel using 7nm versus Intel 10nm when it made the original bet. Instead, it is going to be first to market on the new node.

AMD 7nm Versus Intel Process
AMD 7nm Versus Intel Process

AMD is not just aiming for IPC improvements or lower power. AMD is aiming for 2x performance per compute node.

AMD Zen 2 Architecture 2x Throughput
AMD Zen 2 Architecture 2x Throughput

Here are the new front end andvances helping to deliver twice the throughput.

AMD Zen 2 New Front End
AMD Zen 2 New Front End

Doubling floating point performance through width (256-bit), double load/ store bandwidth, faster dispatch and retire bandwidth, and maintain high throughput for all processing modes.

AMD Zen 2 Floating Point Advances
AMD Zen 2 Floating Point Advances

New security features are coming with Zen 2. Hardware enhanced Spectre mitigations with AMD Zen 2.

AMD Zen 2 Security Enhancements
AMD Zen 2 Security Enhancements

We are now getting into interconnects. You can read about the current AMD EPYC interconnect versus Intel in: AMD EPYC Infinity Fabric v. Intel Broadwell-EP QPI Architecture Explained.

AMD is using a 14nm I/O die and then optimizing on CPU performance by using 7nm CPU chiplets. This is a “revolutionary” approach. As a note here, Intel, Xilinx and others have multi-chip package designs using different process nodes.

AMD EPYC Zen 2 14nm IO Die
AMD EPYC Zen 2 14nm IO Die

AMD is delivering Zen 2. Zen 3 is on track and Zen 4 is nearing completion. AMD wants to show it is delivering on its roadmap.

AMD Zen 2 To Zen 4 Roadmap
AMD Zen 2 To Zen 4 Roadmap

AMD Radeon Technologies Group David Wang Presentation

AMD is making a push into the data center with AMD Radeon. This is where we expect AMD will start a renewed push against NVIDIA. AMD 7nm Vega on track to ship “later this year.”

AMD David Wang Next Horizon Event
AMD David Wang Next Horizon Event

AMD is talking about double precision being important in the data center. This is an area where NVIDIA has been strong.

AMD Radeon Instinct MI60 is the world’s first 7nm GPU for the data center.

AMD Radeon Instinct MI60 For The Data Center
AMD Radeon Instinct MI60 For The Data Center

New enhanced AMD Vega architecture. The new architecture supports FP64 down to 4-bit Integer operations for flexible compute.

AMD Radeon Vega Architecture For 7nm
AMD Radeon Vega Architecture For 7nm

32GB of memory and 1TB/s of memory bandwidth with ECC.

AMD Radeon Instinct MI60 32GB And 1TBps Bandwidth
AMD Radeon Instinct MI60 32GB And 1TBps Bandwidth

AMD EPYC Rome to MI60 PCIe Gen4!

AMD EPYC 2 Rome To Radeon MI60 PCIe Gen4
AMD EPYC 2 Rome To Radeon MI60 PCIe Gen4

On the software side, AMD ROCm 2.0 is being announced and upstreamed in Linux. AMD also says it is embracing containers with ROCm 2.0 (finally!!!!)

AMD ROCm 2.0 Launched
AMD ROCm 2.0 Launched

AMD is claiming 8.8x double precision performance MI60 over MI25 previous generation. Resnet-50 2.8x.

AMD Radeon Instinct MI60 Over MI25 Resnet 50
AMD Radeon Instinct MI60 Over MI25 Resnet 50

AMD is claiming similar or better performance in single and double precision performance to the NVIDIA Tesla V100 PCIe.

AMD Radeon Instinct MI60 Single And Double Precision Performance To V100
AMD Radeon Instinct MI60 Single And Double Precision Performance To V100

It is great to see AMD getting competitive with NVIDIA here. At the same time, the NVIDIA Tesla V100 has been out for a long time. Running close to a 2017 era GPU that has enormous adoption with a GPU launching in late 2018 is good, but we need to see pricing information. If it is on price parity with a Tesla V100, AMD will still have work to do. Also, using the PCIe version means that the company is not using the DGX-1 SXM2 modules or the DGX-2 modules which have additional thermal headroom. AMD is comparing against the “low end” NVIDIA Tesla V100. You can read more about the NVIDIA DGX-1 class SXM2 modules in our How to Install NVIDIA Tesla SXM2 GPUs in DeepLearning12 article.

Here is the roadmap for the AMD Radeon Instinct GPU.

AMD Radeon Instinct Roadmap 2018
AMD Radeon Instinct Roadmap 2018

This section is done. We expect AMD EPYC 2 after the break!

AMD EPYC Rome with Forrest Norrod

We are going to get more details on AMD EPYC Rome 7nm Zen 2 product today. Rome is sampling with a roadmap to Zen 3 thereafter. We already have a preview.

AMD EPYC Rome
AMD EPYC Rome

As we get ready for the next part of the session to start we know a few details about Rome:

Forrest Norrod is going into a recap of how AMD EPYC gen 1 has gone since its inception.

AMD EPYC Forrest Norrod
AMD EPYC Forrest Norrod

Here is an interesting slide. Today top seven server customers account for about 40% of servers.

AMD EPYC Forrest Norrod Market Overview
AMD EPYC Forrest Norrod Market Overview

Forrest is talking about how the company is doing with AMD EPYC in the market today. This is important because the company needs to show the adoption of AMD EPYC. Cloud service providers are offering differentiated instances and services. Forrest is talking about how Microsoft and AWS have different types of uses for AMD EPYC. Packet, Hivelocity, and Baidu are using single socket platforms for hosting and services. Tencent is using AMD EPYC both for cloud and internal operations.

VMware video is rolling saying that the company will allow AMD EPYC’s high core count, memory, and single socket benefits in their existing licensing structure.

AMD EPYC 7551p V Dual Intel Xeon Gold 5118
AMD EPYC 7551p V Dual Intel Xeon Gold 5118

AMD says that most buyers will just buy the cost equivalent next generation. Replacing dual Intel Xeon E5-2660 V3 servers to dual Intel Xeon Gold 5118 servers.

Forrest says that you can use a single AMD EPYC 7551P. Between administrative, hardware, licensing and space, AMD is claiming one can get a 45% per VM TCO.

AMD EPYC 45 Percent Lower Per VM TCO VMware
AMD EPYC 45 Percent Lower Per VM TCO VMware

In high-performance computing, something we will see more of at SC18 next week. The company is talking about Cray CS500 Supercomputer Platform. Next, they are talking about the Shasta supercomputer with the Slingshot interconnect.

AMD EPYC And Cray Shasta
AMD EPYC And Cray Shasta

AMD EPYC 2 Rome details are coming next. AMD “Naples” is just the beginning.

Dr. Lisa Su is Back on Stage for AMD EPYC 2 “Rome”

Here we go on the AMD EPYC 2 “Rome” generation.

AMD EPYC Rome Preview 2
AMD EPYC Rome Preview 2

Up to 64 Zen 2 cores per socket. Faster IPC. Higher I/O and memory bandwidth. This is double the Naples generation.

AMD EPYC 2 Rome 64 Core Specs
AMD EPYC 2 Rome 64 Core Specs

AMD EPYC will be the first PCIe 4.0 x86 CPU.

AMD EPYC 2 Rome PCIe Gen4
AMD EPYC 2 Rome PCIe Gen4

Existing Naples platforms will run Rome (with PCIe 3.0) and Milan after that.

AMD EPYC 2 Road To Rome Though Naples
AMD EPYC 2 Road To Rome Though Naples

Rome will deliver 2x the per socket performance and 4x the floating point performance.

AMD EPYC 2x Performance Per Socket And 4x For Double Precision
AMD EPYC 2x Performance Per Socket And 4x For Floating Point

There are 8x core die in 7nm plus a 14nm I/O die. Each of the 8x core die has 8 cores.

Lisa Su Showing Off Rome
Lisa Su Showing Off Rome

Rome v. Intel head to head with best in class Skylake. One prototype AMD Rome v. dual Intel Xeon 8180M CPUs.

C Ray Intel Xeon 8180M Dual V. Prototype AMD EPYC 64 Core Rome Single Socket
C Ray Intel Xeon 8180M Dual V. Prototype AMD EPYC 64 Core Rome Single Socket

If you read our AMD EPYC or Intel Xeon benchmarks, we often use c-ray, and have been doing so for years. C-ray is an extremely favorable benchmark for AMD’s Zen architectures over Intel Xeon architectures. Please keep that in mind. Still, this is an impressive result!

Wrapping Up

We will be heading to breakout sessions soon and will summarize the live notes above later.

5 COMMENTS

  1. @Misha

    With 7nm, 0.5x the power and 2x the density (core count), so 1x the power per package.

    2x the performance per watt (from 7nm), but 1x the IPC performance?

  2. The NorthBridge functionality has been migrated to the CPU over the years, now AMD is taking all that out of the CPU again. Interesting.

LEAVE A REPLY

Please enter your comment!
Please enter your name here