Intel Atom C3958 Benchmarks
For this exercise, we are using our legacy Linux-Bench scripts which help us see cross-platform “least common denominator” results. We do have a full set of expanded benchmarks from our next-gen test suite (Linux-Bench2) which you may see in other STH reviews that include this chip. The target market of the Intel Atom C3958 is on embedded applications making the original tests more useful. Generally, embedded applications such as storage controllers and networking appliances will not see heavy workloads where AVX2 / AVX-512 will be useful.
From what we saw in the Intel Atom C2000 series, there are only two OSes that matter for these embedded parts: Linux and FreeBSD. OSes like Windows have a negligible market share on these platforms and we would not recommend using an Atom C3000 series as a desktop. There are many offerings in the market more appropriate for that use case.
Python Linux 4.4.2 Kernel Compile Benchmark
This is one of the most requested benchmarks for STH over the past few years. The task was simple, we have a standard configuration file, the Linux 4.4.2 kernel from kernel.org, and make the standard auto-generated configuration utilizing every thread in the system. We are expressing results in terms of compiles per hour to make the results easier to read.
Here we see a solid performance, not quite up to what the Intel Atom C3955 compute-focused part can offer. Keen eyes will place performance around that of a Xeon D 8 core part. The microarchitecture difference is going to highlight some bigger performance differences than we would be otherwise accustomed to in our other tests.
c-ray 1.1 Performance
We have been using c-ray for our performance testing for years now. It is a ray tracing benchmark that is extremely popular to show differences in processors under multi-threaded workloads.
Here you can see solid performance due to having more cores and L1 cache. The Intel Xeon E3 line is not really a competitor as it lacks the features of the Atom and has significantly higher power consumption.
7-zip Compression Performance
7-zip is a widely used compression/ decompression program that works cross-platform. We started using the program during our early days with Windows testing. It is now part of Linux-Bench.
There is a fairly large chasm between the 16 core Atom C3000 series part and the 16 core Xeon D part. This compression is not using QAT offload which we will have more on soon. We also sorted the chart based on compression speed which puts the Intel Atom C3958 between the six and eight core Xeon D low power parts. Decompression sort would have put it between the eight and twelve core Xeon D parts. That is solid performance either way.
Sysbench CPU test
Sysbench is another one of those widely used Linux benchmarks. We specifically are using the CPU test, not the OLTP test that we use for some storage testing.
We had to remove the 2-core Atom CPUs such as the C2358 and D525 from this list as those generations made this chart borderline unreadable. This test tends to favor many cores and have strong scaling based on core counts which is why the C3958 performs so well here.
OpenSSL is widely used to secure communications between servers. This is an important protocol in many server stacks. We first look at our sign tests:
We also have the verify results sorted in the same order to make comparison easier.
Here we see the Intel Atom C3958 competitive with the Xeon Silver 4108. The Intel Xeon 4108 is a similar price part for higher power, more expandable Xeon Scalable servers. The other key point to look at here is the generational improvement. The Intel Atom C2758 was the top-end Rangeley generation Intel Atom C2000 series SKU with QuickAssist. Even without leveraging QAT, the top-bin performance has increased 4x on this test. OpenSSL is a key metric for these parts as they are commonly used in network and storage appliances.
UnixBench Dhrystone 2 and Whetstone Benchmarks
One of our longest running tests is the venerable UnixBench 5.1.3 Dhrystone 2 and Whetstone results. They are certainly aging, however, we constantly get requests for them, and many angry notes when we leave them out. UnixBench is widely used so we are including it in this data set. Here are the Dhrystone 2 results:
Here are the whetstone results.
Having a lot of cores makes up for some of the microarchitecture trade-offs made to keep power consumption low. Still, we see some solid performance out of this part.
Gone are the days of the “wimpy” Atom. The Atom C3958 sports a low clock speed (2.0 GHz) and does not have turbo boost, L3 cache, nor higher-end features such as AVX2/ AVX-512 support. Yet with 1MB L2 cache per core, massive IPC improvements, and 16 cores, the Intel Atom C3958 is competitive with the Xeon D and Xeon Bronze/ Silver lines in terms of performance. Although the Xeon lines are better for virtualization and general purpose compute, for most networking and storage appliances this is a very fast chip.
From a competitive side, there is a lot of talk about AMD EPYC in the market. AMD does not yet have a competitive offering in this segment since even the EPYC 7251 is a 120W TDP CPU before adding any other component to the system, or about 2x what we are seeing an entire configured Gigabyte MA10-ST0 test system pull at the outlet. Being fair, AMD Is not targeting this market with EPYC. Likewise, ARM has made lots of noise but the Intel Atom C3958 provides a solid mix of core performance and acceleration for crypto and compression. The Intel Atom C3000 series is certainly enough to hold current ARM offerings at bay for the near-term future.
Looking at the top-end QAT SKU from this generation versus the previous generation (Atom C2758) one can see that the lineup has significantly expanded its market coverage at the top end. Clock speeds are down ~17% but that is the only area where we are seeing specs decline. Core count has doubled from 8 to 16 cores. Cache size and RAM capacity have quadrupled to 16MB and 256GB respectively. Networking is effectively 10x the speed of the previous generation. PCe and SATA have moved up a generation and greatly expanded in numbers. TDP is up 55% to match the massive performance and platform upgrades. At the same time pricing is now much higher up around 116%. Of course, Intel has parts like the Atom C3758 which address a similar market segment to the previous top of the line part, but it shows how Intel is allowing the Intel Atom C3000 line to creep up higher in the performance stack.
Overall, this is an enormous generational upgrade in performance, but we expect the Intel Atom C3958 to be a lower volume part given its hefty price tag. At $449 for the CPU it is competing with the Intel Xeon Silver 4108 and Xeon D lines.
If you want to learn more, we have complete coverage at our Denverton Day Official STH Intel Atom C3000 Launch Coverage Central