Intel Xeon E5-2695 V4 benchmarks using a HPE ProLiant DL380 Gen9

1
HPE Intel Xeon E5-2695 V4 dual system
HPE Intel Xeon E5-2695 V4 dual system

Today we have benchmarks of dual Intel Xeon E5-2695 V4 processors. Each Intel Xeon E5-2695 V4 has 18 core, 36 thread processor with 45MB L3 cache. With 2.1GHz base clock speeds and turbo boost speeds up to 3.3GHz, these chips are a significant upgrade to the 14C/ 28T E5-2695 V3 which sported only 35MB L3 cache. We have published a number of benchmarks across different dual socket Intel Xeon E5-2600 V4 configurations already which we have linked below.

Test Configuration

Our test platform was a standard EATX motherboard upgraded for Xeon E5 V4 support via a simple BIOS upgrade. This is one of the NVMe servers we use in the Fremont colocation that we brought offline and upgraded to the V4 part.

This was an exciting comparison point from us as it is one of the first HPE machines we have published benchmark results for.

HPE Intel Xeon E5-2695 V4 dual system
HPE Intel Xeon E5-2695 V4 dual system

The big question is how will the HPE ProLiant DL380 Gen9 fare with two of Intel’s newest processors?

Intel Xeon E5-2695 V4 Benchmarks

For our testing we are using Linux-Bench scripts which help us see cross platform “least common denominator” results. We are using gcc due to its ubiquity as a default compiler. One can see details of each benchmark here. We are likely going to update the Linux-Bench in the near future with a few new tests as well as an even simpler to use/ faster revision, but for now, we are using our old Ubuntu 14.04.3 LTS version.

Python Linux 4.4.2 Kernel Compile Benchmark

This is one of the most requested benchmarks for STH over the past few years. We (finally) have a Linux kernel compile benchmark script that is consistent. Expect to see this functionality migrate into Linux-Bench soon (we are just awaiting the parser work on it.) The task was simple, we have a standard configuration file, the Linux 4.4.2 kernel from kernel.org, and make with every thread in the system. We are expressing results in terms of complies per hour to make the results easier to read.

Intel Xeon E5-2600 V4 Initial Linux Kernel Compile Benchmarks
Intel Xeon E5-2600 V4 Initial Linux Kernel Compile Benchmarks

As you can see, the dual Xeon E5-2695 V4 falls right in-line with what we would expect from this chip given its specs.

c-ray 1.1 Performance

We have been using c-ray for our performance testing for years now. It is a ray tracing benchmark that is extremely popular to show differences in processors under multi-threaded workloads.

Dual Intel Xeon E5-2695 V4 c-ray benchmarks
Dual Intel Xeon E5-2695 V4 c-ray benchmarks

Our dual Xeon E5-2695 V4 configuration is clearly showing just how much of an improvement we are seeing even over Sandy Bridge generation parts. This is due to the greatly increased core counts with Broadwell-EP.

7-zip Compression

7-zip is a widely used compression/ decompression program that works cross platform. We started using the program during our early days with Windows testing. It is now part of Linux-Bench.

Dual Intel Xeon E5-2695 V4 7-zip benchmarks
Dual Intel Xeon E5-2695 V4 7-zip benchmarks

Compression is a major operation we see in today’s workloads so we load the CPU with compression and decompression tasks. As one can see, performance is again excellent.

NAMD Performance

NAMD is a molecular modeling benchmark developed by the Theoretical and Computational Biophysics Group in the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign. More information on the benchmark can be found here.

Dual Intel Xeon E5-2695 V4 NAMD benchmarks
Dual Intel Xeon E5-2695 V4 NAMD benchmarks

Somewhat surprisingly we find the Intel Xeon E5-2697 V4 perform better than several of the older configurations we have tested. Performance was very good on this complex benchmark.

Sysbench CPU test

Sysbench is another one of those widely used Linux benchmarks. We specifically are using the CPU test, not the OLTP test that we use for some storage testing.

Dual Intel Xeon E5-2695 V4 Sysbench CPU Multi-Threaded
Dual Intel Xeon E5-2695 V4 Sysbench CPU Multi-Threaded

We received some feedback that the single-threaded results we had were all within a fairly tight band while the multi-threaded results showed a very high delta. We are removing those results however you can compare results using Linux-Bench‘s interactive charting feature.

OpenSSL Performance

OpenSSL is widely used to secure communications between servers. This is an important protocol in many server stacks. We first look at our sign tests:

Dual Intel Xeon E5-2695 V4 OpenSSL Sign
Dual Intel Xeon E5-2695 V4 OpenSSL Sign

Moving to the verify results:

Dual Intel Xeon E5-2695 V4 OpenSSL Verify
Dual Intel Xeon E5-2695 V4 OpenSSL Verify

OpenSSL is a hot topic right now and you can see the impact of high core counts on the benchmark on both the sign and verify side of the benchmark.

UnixBench Dhrystone 2 and Whetstone Benchmarks

Of course, these chips are not meant for heavy compute but we pick out the UnixBench 5.1.3 Dhrystone 2 and Whetstone results to show some of the raw performance they are capable of. UnixBench is widely used so it is a good comparison point.

Here are the single threaded workloads:

Dual Intel Xeon E5-2695 V4 UnixBench dhrystone 2 single threaded
Dual Intel Xeon E5-2695 V4 UnixBench dhrystone 2 single threaded
Dual Intel Xeon E5-2695 V4 UnixBench whetstone single threaded
Dual Intel Xeon E5-2695 V4 UnixBench whetstone single threaded

As you can see, the results are relatively tight on our single threaded benchmarks likely due to the fact that compilers are good these days and these are not the most demanding tests out there.

Now the E5 V4’s sweet spot, the multi-threaded workloads:

Dual Intel Xeon E5-2695 V4 UnixBench dhrystone 2 multi threaded
Dual Intel Xeon E5-2695 V4 UnixBench dhrystone 2 multi threaded
Dual Intel Xeon E5-2695 V4 UnixBench whetstone multi threaded
Dual Intel Xeon E5-2695 V4 UnixBench whetstone multi threaded

Here we can see another strong showing. It is worth noting that the two processors have a tray price of around $2400. That figure is quite a bit more than some of these other configurations.

Conclusion

Again we see the Intel Xeon E5-2600 V4 trend continue with the E5-2695 V4. THe chips are incrementally faster than their previous generation counterparts but not by such a large margin where there is a major reason to upgrade from V3 to V4. On the other hand, the E5-2695 V4 is about 60% more expensive, per chip, than the E5-2670 V1, yet offers a 2x-3x performance improvement. We are starting to see where consolidation can greatly benefit lower power older E5 machines.

You can find more STH Xeon E5 V4 coverage here:

Subscribe to STH to get the latest benchmarks and platform reviews as they are published. We have a huge back log of content coming.

1 COMMENT

  1. Is there any chance we can get the E5 2697 V2 performance in the Linux Compile chart? It’s the only chart that it is missing. The reason I ask, is that comparing apples to apples, it’s interesting to see the performance differences between the three generations of 2697 SKU.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.