Cut the carbon footprint of AI training by up to 75% without new hardware or infrastructure

A variety of common deep learning models benefit from Zeus’ ability to tune GPU power limits and the training batch size. When both parameters were tuned, the software achieved up to 75% energy reduction.

Image credit: SymbioticLab, University of Michigan

Deep learning models that power giants like TikTok and Amazon, as well as tools like ChatGPT, could save energy without new hardware or infrastructure.

A new way to optimize the training of deep learning models, a rapidly evolving tool for powering artificial intelligence, could slash AI’s energy demands.

Developed at the University of Michigan, the open-source optimization framework studies deep learning models during training, pinpointing the best tradeoff between energy consumption and the speed of the training.

“At extreme scales, training the GPT-3 model just once consumes 1,287 MWh, which is enough to supply an average U.S. household for 120 years,” said Mosharaf Chowdhury, an associate professor of electrical engineering and computer science.

With Zeus, the new energy optimization framework developed by Chowdhury and his team, figures like this could be reduced by up to 75% without any new hardware—and with only minor impacts on the time it takes to train a model. It was presented at the 2023 USENIX Symposium on Networked Systems Design and Implementation (NSDI), in Boston.

Mainstream uses for hefty deep learning models have exploded over the past three years, ranging from image-generation models and expressive chatbots to the recommender systems powering TikTok and Amazon. With cloud computing already out-emitting commercial aviation, the increased climate burden from artificial intelligence is a significant concern.

“Existing work primarily focuses on optimizing deep learning training for faster completion, often without considering the impact on energy efficiency,” said Jae-Won Chung, a doctoral student in computer science and engineering and co-first author of the study. “We discovered that the energy we’re pouring into GPUs is giving diminishing returns, which allows us to reduce energy consumption significantly, with relatively little slowdown.”

Deep learning is a family of techniques making use of multilayered, artificial neural networks to tackle a range of common machine learning tasks. These are also known as deep neural networks (DNNs). The models themselves are extremely complex, learning from some of the most massive data sets ever used in machine learning. Because of this, they benefit greatly from the multitasking capabilities of graphical processing units (GPUs), which burn through 70% of the power that goes into training one of these models.

Zeus uses two software knobs to reduce energy consumption. One is the GPU power limit, which lowers a GPU’s power use while slowing down the model’s training until the setting is adjusted again. The other is the deep learning model’s batch size parameter, which controls how many samples from the training data the model works through before updating the way the model represents the relationships it finds in the data. Higher batch sizes reduce training time, but with increased energy consumption.

Zeus is able to tune each of these settings in real time, seeking the optimal tradeoff point at which energy usage is minimized with as little impact on training time as possible. In examples, the team was able to visually demonstrate this tradeoff point by showing every possible combination of these two parameters. While that level of thoroughness won’t happen in practice with a particular training job, Zeus will take advantage of the repetitive nature of machine learning to come very close.

“Fortunately, companies train the same DNN over and over again on newer data, as often as every hour. We can learn about how the DNN behaves by observing across those recurrences,” said Jie You, a recent doctoral graduate in computer science and engineering and co-lead author of the study.

Zeus is the first framework designed to plug into existing workflows for a variety of machine learning tasks and GPUs, reducing energy consumption without requiring any changes to a system’s hardware or datacenter infrastructure.

In addition, the team has developed complementary software that they layer on top of Zeus to reduce the carbon footprint further. This software, called Chase, privileges speed when low-carbon energy is available, and chooses efficiency at the expense of speed during peak times, which are more likely to require ramping up carbon-intensive energy generation such as coal. Chase took second place at last year’s CarbonHack hackathon and is to be presented May 4 at the International Conference on Learning Representations Workshop.

New AI technology uses trajectory to reveal the characteristics of animal behavior

“It is not always possible to readily migrate DNN training jobs to other locations due to large dataset sizes or data regulations,” said Zhenning Yang, a master’s student in computer science and engineering. “Deferring training jobs to greener time frames may not be an option either, since DNNs must be trained with the most up-to-date data and quickly deployed to production to achieve the highest accuracy.

“Our aim is to design and implement solutions that do not conflict with these realistic constraints, while still reducing the carbon footprint of DNN training.”

Original Article: Optimization could cut the carbon footprint of AI training by up to 75%

More from: University of Michigan

Go deeper with Bing News on:

Synthetic molecular motors

More efficient molecular motor widens potential applications
Light-driven molecular motors were first developed nearly 25 years ago at the University of Groningen, the Netherlands. This resulted in a shared Nobel Prize for Chemistry for Professor Ben Feringa in ...
Differentiating cerebral cortical neurons to decipher molecular mechanisms of neurodegeneration
A research team led by Professor Haruhisa Inoue (Department of Cell Growth and Differentiation) derived iPS cells (iPSC) from α-synucleinopathy patients with early-onset familial Parkinson's disease ...
Nathan Derr
The Derr lab also pursues synthetic biology and the application of molecular motors to engineered nanoscale transport devices. ** = undergraduate co-authors Derr ND. Interactions of multiple dynein ...
Molecular motors
Synthetic molecular motors represent a promising alternative approach to gain further insight into the principles by which biological motors function. Better understanding the mechanisms of ...
Molecular crystal motors move like microbes when exposed to light
NEW ORLEANS, March 19, 2024 — At first glance, Rabih O. Al-Kaysi’s molecular motors look like the microscopic worms you’d see in a drop of pond water. But these wriggling ribbons are not alive; ...

Go deeper with Bing News on:

Nano factory

The best sales to shop this weekend: Baby Foot, Athleta, Bose and more
This weekend, you’ll find a deal on an Eddie Bauer camping tent, a discounted travel steamer and savings on Bose QuietComfort Ultra Headphones. All that and more below.
Listen To The Magico M7 In London And Hear How Great Speakers Sound
The Magico M7 are four-way, six-driver floorstanding loudspeakers that use the latest Magico drive-unit technology. Be prepared to dig deep with a pair costing $375,000.
How a Tata Nano came into our lives in 2014: Long-term ownership review
BHPian cheeku recently shared this with other enthusiasts.This review is of our *now-sold* Tata Nano 2013 LX - nicknamed Mooshak since this was one of the first cars that I had learnt driving in and I ...
Best Monitors for Graphic Design in 2024
The ViewSonic VP2756-4K brings sharp details through UHD resolution on a 27-inch screen. It uses an IPS panel that comes with ultra-thin bezels for an immersive view and is ideal for color grading or ...
700 HP Chevy Silverado Fox Factory Wants To Blow Away The F-150 Raptor R
Apart from an extensive list of mechanical upgrades, Fox Factory has also made the interior of the Silverado that little more special ...

What's Your Reaction?

Don't Like it!

I Like it!

Cut the carbon footprint of AI training by up to 75% without new hardware or infrastructure

Deep learning models that power giants like TikTok and Amazon, as well as tools like ChatGPT, could save energy without new hardware or infrastructure.

The Latest Updates from Bing News

Go deeper with Bing News on:

Synthetic molecular motors

Go deeper with Bing News on:

Nano factory

Leave a Reply