Storage

KIOXIA GP Series and CM9 Launched for the Era of Agentic AI Storage

March 18, 2026

At NVIDIA GTC 2026, KIOXIA made one of the more interesting storage announcements of the show. The company revealed two new SSD products targeting the rapidly growing AI inference market, and the positioning here is meaningfully different from standard data center NVMe fare. These are drives designed explicitly for GPU-initiated access, a use case that will only become more important as AI model complexity outpaces local HBM capacities. At GTC 2026, NVIDIA outlined the key reason. There will be many more agents making many more requests on storage than in years past, and the KV Cache is becoming an important but also a different class of storage.

NVIDIA Storage-Next and Why It Matters

To understand what KIOXIA is doing here, it helps to understand NVIDIA’s Storage-Next initiative. The program calls on SSD vendors to engineer drives that GPUs can access directly, effectively extending the GPU’s usable memory hierarchy beyond the limits of on-package High Bandwidth Memory. As AI workloads shift from being purely compute-intensive to increasingly data-intensive, think trillion-parameter models and multi-million token context windows, the bottleneck shifts toward memory capacity rather than raw FLOPS. DRAM simply cannot keep up with those demands at any reasonable cost.

NVIDIA BlueField 4 STX Content Memory Storage And Spectrum 6 SPX Racks At GTC 2026 1

Storage-Next is NVIDIA’s architectural answer. It pulls high-performance flash into the GPU’s memory space so that data-hungry workloads like KV caches for large-scale inference have somewhere to live. That, in turn, will require a step function in higher IOPS (100 million!) and also better handling of smaller transfer sizes to keep GPUs fed. This is an architecture designed to keep PCIe buses utilized so that the GPUs are not idle waiting for data. For this, Kioxia has the GP series.

KIOXIA GP Series: Super High IOPS SSD

The GP Series is the headline product. It uses KIOXIA’s XL-FLASH Storage Class Memory rather than conventional TLC NAND, which has a few implications. XL-FLASH is a SLC-based storage class memory that KIOXIA has had in its portfolio for some time. It trades raw density for latency and IOPS, which is exactly the tradeoff needed when the GPU is doing fine-grained, low-latency reads rather than the large sequential transfers that data center SSDs typically optimize for. KIOXIA is specifically highlighting 512B access granularity with the GP series, far finer than the typical 4K minimum of conventional NVMe SSDs. That matters a great deal when serving attention head activations or KV cache lookups from GPU-directed requests rather than host CPU I/O since 4K access might leave the PCIe bus underutilized.

KIOXIA CM9 Series: PCIe 5.0 E3.S for KV Cache

The CM9 is the more near-term product and takes a different angle. Where the GP Series is a novel architecture play, the CM9 is a high-capacity, high-endurance PCIe 5.0 E3.S SSD aimed at KV cache workloads in large-scale AI inference clusters. We have previously covered the announcement of the Kioxia CM9 PCIe Gen5 NVMe SSDs. Kioxia’s 3 DWPD rating on a 25.6 TB drive is worth doing the math on: 76.8 TB of writes per day.

One Kioxia CM9 Feeding Five NVIDIA H100 MLPerf Large

For inference infrastructure that is writing and invalidating KV cache entries at high throughput, this is the kind of endurance specification that starts to make sense. To give you some sense of why this matters, imagine an agent or sub-agent is spun up for a task, and needs its KV cache data, but it only lives for a minute before it is destroyed. KIOXIA is positioning the CM9 alongside NVIDIA’s Context Memory Storage (CMX) architecture, which defines how inference systems should tier memory between HBM, DRAM, and high-performance storage.

Final Words

The GP Series is really the more provocative of the two announcements. XL-FLASH has been around for some time, but it has normally been a hot data tier to make CPU access of storage faster. Now, with GPUs and agents, there will be a new class of storage that can afford to act differently, as it is designed to be storage for GPU applications. Positioning it as a GPU-accessible memory extension under the NVIDIA Storage-Next banner gives it a much more specific and defensible use case. Whether that translates into design wins depends largely on how aggressively NVIDIA pushes Storage-Next adoption in its next generation of GPU platforms and systems, but the timing alongside GTC is not accidental and NVIDIA is making a major push on storage with its STX racks.

2027 might be when storage gets really exciting again.

5 COMMENTS

MR March 18, 2026 At 8:35 pm

Everything old that IBM did, is new again! :)
fuzzyfuzzyfungus March 19, 2026 At 9:34 am

How much, if any, difference is there at the drive/firmware level to support this behavior? Is this ‘storage-next’ thing just built on top of being able to DMA to NVMe devices (and choosing ones that won’t die horribly under write-heavy workloads) or are you talking some sort of nonstandard extension of an NMVe device to support it/
DarkServant March 20, 2026 At 2:40 am

Yeah, not quite impressive. As long as the devices are based on NAND-Flash, it’s not really an innovation (R.I.P Optane). The drives will probably not be available to individuals, even at the tremendous prices ahead. For Toshiba/Kioxia datacenter/enterprise drives, firmware-updates are even harder to get than for the Samsung ones.
The Intel P4800X/P5800X although very expensive, were at least accessible to everyone, support included.
And as the previous commentators wrote, it’s somehow just reinventing the wheel over and over again just with another color.
IM March 24, 2026 At 8:52 am

Any estimate of price per TB for KIOXIA GP Series?
Ian March 25, 2026 At 3:25 pm

Man… if only we had an ultra low latency non-volatile storage technology we could be using.
Maybe one that didn’t need fancy meddling by the controller to achieve reasonable speeds.
Not requiring a 2GHz CPU(Controller), RAM 10% of the storage size, and the power to run said CPU and RAM would also be super nice.

And could you imagine if this magical storage technology ALSO offered an order of magnitude better endurance than NAND Flash?

Wow it would be SO nice if we could have designed that huh…

*COUGH* 3D XPoint/Optane *COUGH*

This site uses Akismet to reduce spam. Learn how your comment data is processed.

NVIDIA Storage-Next and Why It Matters

KIOXIA GP Series: Super High IOPS SSD

KIOXIA CM9 Series: PCIe 5.0 E3.S for KV Cache

Final Words

RELATED ARTICLESMORE FROM AUTHOR

Kioxia BG8 Series Brings PCIe 5.0 to Mainstream Client SSDs

Kioxia EG7 M.2 SSD Line Launched

MiTAC Shows Servers with Next-Gen CPUs and Solidigm SSDs at NVIDIA GTC 2026

5 COMMENTS

LEAVE A REPLY

RELATED ARTICLES MORE FROM AUTHOR