X

The browser version you are using is not recommended for this site.
Please consider upgrading to the latest version of your browser by clicking one of the following links.

Document Library

Reference architectures, white papers, and solutions briefs to help build and enhance your network infrastructure, at any level of deployment.

Related Content

Intel® Deep Learning Boost (Intel® DL Boost) - Improve Inference Performance of Hugging Face BERT Base Model in Google Cloud Platform (GCP) Technology Guide

Last Updated: Apr 19, 2023

BERT is the best model to detect malicious and phishing websites/emails attacks as it provides very high accuracy. However, it takes longer inference time when compared to the traditional methods. This guide shows how to take advantage of Intel® AVX-512 Vector Neural Network Instructions (Intel® AVX-512 VNNI), oneDNN, and IPEX tool to boost AI Inference performance using Hugging Face BERT base model as an example. The evaluations were conducted on Google Cloud Platform* service (GCP) using three different hardware configurations.

Download PDF

Your browser does not support PDFs. Download the PDF.

Categories
Categories Open Source Solutions Intel Technologies and Platforms Accelerators 2nd gen Intel Xeon Scalable processor 1st gen Intel Xeon Scalable processor 3rd gen Intel Xeon Scalable processor Network Location Hybrid Cloud Data Center On Premises Core Network Network Technologies Network Edge SD-WAN/uCPE Network Security AI and Automation Optimizations IPEX Verticals/Industries Industrial Telecommunications Enterprise Cloud Service Providers VNFS and CNFs Wide Area Network (WAN) Optimization Workloads and Use cases Artificial Intelligence (AI) Security SD-WAN and uCPE Secure Access Service Edge (SASE)