North Carolina State University researchers have developed a technique that reduces training time for deep learning networks by more than 60 percent without sacrificing accuracy, accelerating the development of new artificial intelligence (AI) applications.
“Deep learning networks are at the heart of AI applications used in everything from self-driving cars to computer vision technologies,” says Xipeng Shen, a professor of computer science at NC State and co-author of a paper on the work.
“One of the biggest challenges facing the development of new AI tools is the amount of time and computing power it takes to train deep learning networks to identify and respond to the data patterns that are relevant to their applications. We’ve come up with a way to expedite that process, which we call Adaptive Deep Reuse. We have demonstrated that it can reduce training times by up to 69 percent without accuracy loss.”
Training a deep learning network involves breaking a data sample into chunks of consecutive data points. Think of a network designed to determine whether there is a pedestrian in a given image. The process starts by dividing a digital image into blocks of pixels that are adjacent to each other. Each chunk of data is run through a set of computational filters. The results are then run through a second set of filters. This continues iteratively until all of the data have been run through all of the filters, allowing the network to reach a conclusion about the data sample.
When this process has been done for every data sample in a data set, that is called an epoch. In order to fine-tune a deep learning network, the network will likely run through the same data set for hundreds of epochs. And many data sets consist of between tens of thousands and millions of data samples. Lots of iterations of lots of filters being applied to lots of data means that training a deep learning network takes a lot of computing power.
The breakthrough moment for Shen’s research team came when it realized that many of the data chunks in a data set are similar to each other. For example, a patch of blue sky in one image may be similar to a patch of blue sky elsewhere in the same image or to a patch of sky in another image in the same data set.
By recognizing these similar data chunks, a deep learning network could apply filters to one chunk of data and apply the results to all of the similar chunks of data in the same set, saving a lot of computing power.
“We were not only able to demonstrate that these similarities exist, but that we can find these similarities for intermediate results at every step of the process,” says Lin Ning, a Ph.D. student at NC State and lead author of the paper. “And we were able to maximize this efficiency by applying a method called locality sensitive hashing.”
But this raises two additional questions. How large should each chunk of data be? And what threshold do data chunks need to meet in order to be deemed “similar”?
The researchers found that the most efficient approach was to begin by looking at relatively large chunks of data using a relatively low threshold for determining similarity. In subsequent epochs, the data chunks get smaller and the similarity threshold more stringent, improving the deep learning network’s accuracy. The researchers designed an adaptive algorithm that automatically implements these incremental changes during the training process.
To evaluate their new technique, the researchers tested it using three deep learning networks and data sets that are widely used as testbeds by deep learning researchers: CifarNet using Cifar10; AlexNet using ImageNet; and VGG-19 using ImageNet.
Adaptive Deep Reuse cut training time for AlexNet by 69 percent; for VGG-19 by 68 percent; and for CifarNet by 63 percent – all without accuracy loss.
“This demonstrates that the technique drastically reduces training times,” says Hui Guan, a Ph.D. student at NC State and co-author of the paper. “It also indicates that the larger the network, the more Adaptive Deep Reuse is able to reduce training times – since AlexNet and VGG-19 are both substantially larger than CifarNet.”
“We think Adaptive Deep Reuse is a valuable tool, and look forward to working with industry and research partners to demonstrate how it can be used to advance AI,” Shen says.
The Latest on: Artificial intelligence training
via Google News
The Latest on: Artificial intelligence training
- Study: Indiana manufacturers ready to learn, implement data and artificial intelligenceon August 26, 2021 at 12:50 pm
A quarter of Indiana’s economic output is based in manufacturing, and the transition to becoming data-driven and using artificial intelligence holds the possibility for broad economic impact among ...
- heliosDX Adds Artificial Intelligence to its Suite of Diagnostics Services and Solutionson August 26, 2021 at 10:51 am
ALPHARETTA, GA / ACCESSWIRE / August 26, 2021 /RushNet, Inc (OTC PINK:RSHN), (the "Company" or "heliosDX") is pleased to announce through its subsidiary heliosDX the investment and adoption of ...
- NVIDIA Makes Case for Training AI Models On-Premiseson August 26, 2021 at 6:47 am
The NVIDIA Enterprise platform is designed to be deployed on instances of VMware deployed on servers that have been certified by NVIDIA.
- Better Together: Striking The Balance Between Artificial And Human Intelligenceon August 26, 2021 at 4:30 am
The past decade saw artificial intelligence (AI) advance by leaps and bounds. From the birth of Alexa to its application in vaccine development, AI has radically altered our personal and professional ...
- Is This CEO Real or Fake? How Artificial Intelligence Is Taking Over the Event Industryon August 25, 2021 at 11:03 am
Artificial intelligence (AI) goes beyond attendee personalization and networking. Now, digital twins are the new plus-ones.
- Cerebras Systems Announces World’s First Brain-Scale Artificial Intelligence Solutionon August 24, 2021 at 4:00 pm
Cerebras Systems, the pioneer in innovative compute solutions for Artificial Intelligence (AI), today unveiled the world’s first brain-scale AI solution. The human brain contains on the order of 100 ...
- Cerebras Systems Lays The Foundation For Huge Artificial Intelligenceon August 24, 2021 at 1:06 pm
Cerebras extends CS-2 system and software to train a 120-Trillion-parameter model on the way to brain-scale AI.
- An Artificial Intelligence Helped Write This Play. It May Contain Racismon August 23, 2021 at 7:57 pm
In a rehearsal room at London’s Young Vic theater last week, three dramatists were arguing with an artificial intelligence about how to write a play. After a period where it felt like the trio were ...
- Artificial Intelligence Market Share Forecast to Witness Considerable Growth from 2020 To 2030on August 23, 2021 at 8:14 am
Artificial Intelligence Market Introduction Transparency Market Research delivers key insights on the global artificial intelligence market In terms of revenue the global artificial intelligence ...
- Best Artificial Intelligence Stocks To Buy Right Now? 5 To Watchon August 19, 2021 at 8:56 am
Top Artificial Intelligence Stocks To Watch Ahead Of September 2021. While investors are wondering why stocks are down this week, tech stocks could be worth noting. In particula ...
via Bing News