Talk: Hyperparameter Optimization for MLPerf Training with SigOpt
4:30 - 5pm
Large scale deep learning training workload runtime optimization is computationally expensive, requires contributions from a multidisciplinary team and relies on complex hyperparameter optimization (HPO) techniques. Habana Labs developed a computationally cost efficient methodology to reduce the MLPerf training workload’s runtime, namely the reduction of the number of training epochs required to reach target accuracy.
The Habana Labs team utilized HPO to optimize runtime and tested two different methods in the process: home-grown Grid Search and the SigOpt optimization library that is built with an ensemble of Bayesian and other global optimization algorithms. When comparing these two optimization methods, SigOpt provided a clear advantage over home-grown Grid Search in three ways. First, SigOpt required fewer observations to reach the same threshold accuracy. Second, it reached the same threshold accuracy with an even lower number of training epochs when compared to home-grown Grid Search. And, finally, it also provided insight into the potential relations among the hyperparameters in a Dashboard. In this talk, the Habana Labs team will share their experience working through these challenging workloads and the insights they gained in the process.
Add to Calendar
11/16/2021 4:30 pm11/16/2021 5:00 pmAmerica/Los_AngelesTalk: Hyperparameter Optimization for MLPerf Training with SigOptSigOpt AI & HPC Summit 2021 - Virtual & Free
Large Scale Machine Learning Engineer, Habana Labs
As a large scale machine learning engineer at Habana, Basem leads efforts to accelerate AI workloads on Habana Gaudi and other Habana AI accelerator hardware, including collaboration with the team on MLPerf submissions. Previously, Basem was a senior performance architect at Samsung. Before Samsung, Basem was a senior systems engineer at Freescale. Basem holds a Ph.D. in Physics from the University of Houston.
Evelyn is a a senior machine learning engineer at Habana Labs and previously occupied the same role for Intel. She focuses on data analytics, machine learning, application development, optimization and deployment for at scale machine learning and deep learning projects. Previously, Evelyn held advanced software engineering roles at Heristar, ExxonMobil, Dow and ENGlobal, among other companies. Evelyn graduated with a Ph.D. in Electrical Engineering from the University of Sheffield in the United Kingdom and received her bachelor’s degree in Electrical Engineering from the University of Science and Technology in China.