4th Generation Intel® Xeon® Scalable Processors Overview Video
Engagement / Document Library / Intel® Deep Learning Boost (Intel® DL Boost) - Improve Inference Performance of Hugging Face BERT Base Model in Google Cloud Platform (GCP) Technology Guide
Intel® Deep Learning Boost (Intel® DL Boost) - Improve Inference Performance of Hugging Face BERT Base Model in Google Cloud Platform (GCP) Technology Guide
Intel® Deep Learning Boost (Intel® DL Boost) - Improve Inference Performance of Hugging Face BERT Base Model in Google Cloud Platform (GCP) Technology Guide
https://networkbuilders.intel.com/solutionslibrary/intel-deep-learning-boost-intel-dl-boost-improve-inference-performance-of-hugging-face-bert-base-model-in-google-cloud-platform-gcp-technology-guide
Last Updated: Apr 19, 2023
BERT is the best model to detect malicious and phishing websites/emails attacks as it provides very high accuracy. However, it takes longer inference time when compared to the traditional methods. This guide shows how to take advantage of Intel® AVX-512 Vector Neural Network Instructions (Intel® AVX-512 VNNI), oneDNN, and IPEX tool to boost AI Inference performance using Hugging Face BERT base model as an example. The evaluations were conducted on Google Cloud Platform* service (GCP) using three different hardware configurations.