AMD 400G Adaptive Exotic SmartNIC Architecture at HC34

1
AMD 400G Adaptive SmartNIC PSX Subsystem
AMD 400G Adaptive SmartNIC PSX Subsystem

AMD has a new FPGA and hardened function NIC that it is presenting at Hot Chips 34. The AMD 400G Adaptive Exotic SmartNIC Architecture shown at HC34 has programmable logic, a hardnened NIC, and a 12-core Arm system. Let us get to it, but first, let us discuss what AMD is actually presenting. This is not the next Pensando part.

Note: We are doing this piece live at HC34 during the presentation so please excuse typos.

AMD 400G Adaptive Exotic SmartNIC Architectural Inflection Point

AMD decided to call this a “SmartNIC”. Since terms were used inconsistently, we created the STH NIC Continuum framework that has been used by not only STH (we are probably the primary industry coverage point for this class of device) and we have let several industry analysts use the framework.

In our framework, this is considered an “Exotic” NIC like the Intel IPU that we looked at in This Changes Networking Intel IPU Hands-on with Big Spring Canyon. Indeed, AMD even uses the use case we showed with Intel below as one of the use cases for its Adaptive Exotic Smart NIC.

The reason behind devices like this is that it allows one card to run in multiple customer data centers. Xilinx sells a lot to China, and that is a geography often with different cipher needs than in the US and Europe. As a result, having a FPGA allows a device to be flexible enough to address not just different use cases but also different customers.

AMD 400G Adaptive SmartNIC SoC Motivations
AMD 400G Adaptive SmartNIC SoC Motivations

AMD recognizes this and there are a number of different customers for this type of device.

AMD 400G Adaptive SmartNIC Space
AMD 400G Adaptive SmartNIC Space

Instead of playing tug-of-war between architectural needs. AMD-Xilinx is building a product for all of these.

AMD 400G Adaptive SmartNIC Architectural Inflection Point
AMD 400G Adaptive SmartNIC Architectural Inflection Point

Here is AMD’s 400G NIC objectives. CXL 2.0 and PCIe Gen5 clearly show this is a next-gen product.

AMD 400G Adaptive SmartNIC Objectives
AMD 400G Adaptive SmartNIC Objectives

With the solution, AMD is targeting higher-end use cases as those tend to be where we see FPGA exotic NICs deployed.

AMD 400G Adaptive SmartNIC Solution
AMD 400G Adaptive SmartNIC Solution

AMD has a number of devices onboard the NIC.

AMD 400G Adaptive SmartNIC Blocks
AMD 400G Adaptive SmartNIC Blocks

The host subsystem has PCIe Gen5 connectivity but it can also act as a CXL 2.0 endpoint.

AMD 400G Adaptive SmartNIC CPM5N Subsystem
AMD 400G Adaptive SmartNIC CPM5N Subsystem

Onboard there is a hardened (e.g. not on FPGA logic) network subsystem.

AMD 400G Adaptive SmartNIC HNICX Subsystem
AMD 400G Adaptive SmartNIC HNICX Subsystem

The NIC has its own memory subsystem as we would expect that uses DDR5 or LPDDR.

AMD 400G Adaptive SmartNIC Memory Subsystem
AMD 400G Adaptive SmartNIC Memory Subsystem

This is the processing subsystem PSX. LPD is the 4-core (Arm R52) Low Power Domain. FPD is the 8 core (Arm A78E) Full Power Domain. There are also some hardware accelerators here for things like TLS.

AMD 400G Adaptive SmartNIC PSX Subsystem
AMD 400G Adaptive SmartNIC PSX Subsystem

The next two slides talk about coupling between different components.

AMD 400G Adaptive SmartNIC CPM5N PSX Coupling
AMD 400G Adaptive SmartNIC CPM5N PSX Coupling

AMD says tight coupling is important for higher-speed ~400Gbps environments.

AMD 400G Adaptive SmartNIC Programmable Logic PSX Coupling
AMD 400G Adaptive SmartNIC Programmable Logic PSX Coupling

OVS is a big application for these devices. Most solutions in the space cover this.

AMD 400G Adaptive SmartNIC Full OVS Offload
AMD 400G Adaptive SmartNIC Full OVS Offload

VirtIO is really interesting because it allows a cloud (public or on-prem) provider to virtualize resources and present them to tenants/ customers in a standardized way, and then manage infrastructure separately on the back-end.

AMD 400G Adaptive SmartNIC VirtIO Net
AMD 400G Adaptive SmartNIC VirtIO Net

Here is a very interesting use case where there the NIC is being used for custom flows on the networking side. This is a pretty common use case.

AMD 400G Adaptive SmartNIC PCIe Endpoint 400G
AMD 400G Adaptive SmartNIC PCIe Endpoint 400G

The NIC can virtualize NVMe.

AMD 400G Adaptive SmartNIC PCIe Endpoint 200G
AMD 400G Adaptive SmartNIC PCIe Endpoint 200G

This is the solution that we showed with the Intel IPU solution in the article/ video above.

With the FPGA, this can be a PCIe root, similar to an ASIC DPU. One can then put NVMe off of the device and avoid using a traditional host.

AMD 400G Adaptive SmartNIC PCIe Root Complex 400G
AMD 400G Adaptive SmartNIC PCIe Root Complex 400G

We actually showed this concept with the NVIDIA BlueField-2 DPUs a few months ago in ZFS without a Server Using the NVIDIA BlueField-2 DPU.

Here are the performance specs from AMD.

AMD 400G Adaptive SmartNIC Summary
AMD 400G Adaptive SmartNIC Summary

This is a fairly cool solution, but it would have been nice to see more about it in this talk.

Final Words

It is very confusing to call this a SmartNIC at this point. It is certainly something more capable than most DPUs or at least more flexible. Still, we probably need to re-brand Exotic for folks. AMD needs to address this because it now has Pensando which competes with this solution in some markets.

The card makes a lot of sense for AMD as Xilinx has a business making these types of FPGA NICs. Having a new generation is great. We were told by AMD prior to this announcement that this is not a product announcement, so this is a future product apparently. Hopefully, we get to see one live in the future, especially as AMD goes PCIe Gen5.

1 COMMENT

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.