Powering AI with the Latest 4th Gen Intel Xeon Scalable Processors

March 17, 2023

The 4th Gen Intel Xeon Scalable Processors have proven to transform performance, but they now can do so much more. Learn how the latest generation's advancements power AI in newer platforms to help your solution scale.

People standing in Data Center, Rack Server Cabinet

The data is in, and the 4th Gen Intel Xeon Scalable processors (code name Sapphire Rapids) and related technology have demonstrated impressive results. However, the true revolution in this latest generation is in more than just their performance data. Thanks to newer, more powerful platforms, AI's growing lists of use cases are poised to accomplish more than ever.

Intel's 4th Gen and AI - Performance Improvements

First, regarding performance, the 4th Gen Intel Xeon Scalable processors offer improvements in three key areas:

AI Inference: Compared with the prior generation, Intel's 4th Gen delivers up to 10x higher PyTorch real-time inference performance with built-in Intel Advanced Matrix Extensions (Intel AMX) (BF16), an AI accelerator.

AI Training: The latest generation delivers up to 10x higher PyTorch training performance with built-in Intel Matrix Extensions (Intel AMX) (BF 16).

Deep Learning: Intel's 4th Gen accelerates AI workloads 3x to 5X for deep learning inference on SSD-ResNet34 and up to 2x training on ResNet50 v1.5 with Intel Advanced Matrix Extensions (Intel AMX) compared with the previous generation

4th Gen Xeon Scalable Processors Benefits by AI Use Case

For anyone building or upgrading an AI environment, the above performance advances result in improved real-world performance in a variety of areas, including:

Natural Language Processing

Whether for chatbots, predictive text, language translation, smart assistants, or more, the performance of nearly any NLP-based AI system is CPU-sensitive. For example, test results show an up to 5.7x improvement in end-to-end real-time inference leveraging Intel AMX regarding Document Level Sentiment Analysis (DLSA) with Hugging Face (IMDB). Additionally, 4th Gen processors showed up to 6.2X higher real-time natural language processing inference performance (BERT) with Intel AMX vs. the prior generation.

Recommender Systems

Speed is of the essence when you need to deliver personalized and contextually relevant content to users in real-time. The 4th Gen Intel Xeon Scalable processors delivered up to 6.3x higher batch recommendation inference performance (DLRM) with Intel AMX over the prior generation. They also provided up to 4x higher recommendation system training performance while leveraging Intel AMX.

Confidential Computing - Multi-Party Machine Learning

To secure and privatize sensitive data in multiple-party environments, 4th Gen Intel Xeon Scalable processors feature Intel Software Guard Extensions (Intel SGX). This highly-researched hardware-based security platform creates an access-restricted enclave with the smallest trust boundary.

AI Data Preprocessing

The performance of any AI is highly dependent upon the quality of the data ingested. To enhance the training of AI models, 4th Gen Intel Xeon Scalable processors offer Intel Advanced Vector Extensions (Intel AVX-512). Test results have shown an up to 18% higher performance on HiBench Kmeans using the Intel oneAPI Data Analytics Library (oneDAL) compared to the prior generation at the same core count (32). Additionally, up to 53% higher performance was achieved on HiBench Kmeans using oneDAL compared to OpenBLAS.

Classical Machine Learning

Whether for finance, life science, or academic research, classical machine language continues to play a vital role. Leveraging Intel AVX 512, 4th Gen Intel Xeon Scalable processors produced 37% higher average machine learning training and inference performance vs. the prior generation. Intel also offers a variety of analytics toolkits, APIs, and frameworks to assist ML and AI developers as they create even better applications.

UNICOM Engineering's Platforms

To power AI from classical ML deployments to the latest applications, UNICOM Engineering offers multiple platforms that can be configured to your needs:

E-1800 R7 Series

In a 1U rack configuration, the E-1800 R7 series of platforms leverage up to two 16 to 40-core 4th Gen Xeon Scalable Processors and offer 8 to 12 drive bays, four expansion slots, and multiple memory configurations, including options for Intel Optane Persistent Memory.

E-2900 R7 Series

The E-2900 R7 series offers the same CPU and memory options as the E-1800 but in a 2U solution. And they also accommodate the expansion of up to nine cards and 12 or 24 hot swap drives.

H-4448

To address the 4U market, UNICOM Engineering has introduced the H-4448 platform that offers the same processor, disk, and memory configurations of the E series but with the added benefit of seven expansion card ports and accommodation for up to 48 hot swap drives.

Your AI's Success Depends on Hardware - UNICOM Engineering is Here to Help

When building or upgrading your AI deployment, remember the role of hardware in creating and powering the latest use cases.

As an Intel Technology Provider and Dell Technologies Titanium OEM partner, UNICOM Engineering stands ready to design, build, and deploy the right hardware solution for your next AI, Deep Learning, or HPC initiative. Our deep technical expertise can drive your transitions to next-gen platforms and provide the flexibility and agility required to bring your solutions to market.

Leading technology providers trust UNICOM Engineering as their application deployment and systems integration partner. And our global footprint allows your solutions to be built and supported worldwide by a single company. Schedule a consultation today to learn how UNICOM Engineering can assist your business.

Subscribe to Our Blog for Updates

Get expert blog content delivered straight to your inbox.