Dell Precision T7920 Workstation Deep Learning Benchmarks
Deep learning training and inferencing workloads utilize GPUs heavily. Here, we can lean on our large data set to give some comparisons showing just how fast the Quadro RTX 8000 is.
ResNet-50 Inferencing in TensorTR using Tensor Cores.
ImageNet is an image classification database launched in 2007 designed for use in visual object recognition research. Organized by the WordNet hierarchy, hundreds of image examples represent each node (or category of specific nouns).
Using precision of INT8 is by far the fastest inferencing method if at all possible converting code to INT8 will yield faster runs. Installed memory has one of the largest impacts on these benchmarks which the Inferencing on NVIDIA RTX graphics cards does not tax the GPU’s to a great deal, however additional memory allows for larger batch sizes, the NVIDIA Quadro RTX 8000 could easily do batch sizes of 500+.
Let’s look at FP16 and FP32 results.
For inferencing workloads using NVIDIA CUDA, the Dell Precision T7920 performs extremely well.
ResNet-50 Training using Tensor Cores.
We also wanted to train the venerable ResNet-50 using Tensorflow. During training the neural network is learning features of images, (e.g. objects, animals, etc.) and determining what features are important. Periodically (every 1000 iterations), the neural network will test itself against the test set to determine training loss, which affects the accuracy of training the network. Accuracy can be increased through repetition (or running a higher number of epochs.)
Here, again we can see excellent performance from the Quadro RTX 8000 in the Dell Precision T7920.
Training using OpenSeq2Seq (GNMT)
While Resnet-50 is a Convolutional Neural Network (CNN) that is typically used for image classification, Recurrent Neural Networks (RNN) such as Google Neural Machine Translation (GNMT) are used for applications such as real-time language translations.
We should note that other consumer GPUs we have tried on this benchmark could not complete training due to the lack of installed memory.
As the NVIDIA Quadro RTX 8000 has 48GB of installed memory, double that of the Titan RTX. The Quadro RTX 8000 is easily equal to the Titan RTX but offers larger batch sizes on a single GPU. Offering the ability to house multiple NVIDIA Quadro RTX 8000 GPUs in the Dell Precision T7920 allows for desktop data science to be done at levels formerly reserved for the data center.
Next, we are going to look at AIDA64 CPU Benchmarks.