Deep learning has created a resurgence of interest in neural networks and their application to everything from Internet search to self-driving cars. Results published in the scientific and technical literature show better-than-human accuracy on real-world tasks that include speech and facial recognition.
Fueled by modern massively parallel computing technology, it is now possible to train very complex multi-layer neural network architectures on large data sets to an acceptable degree of accuracy. This is referred to as deep-learning, as these multi-layer networks interpose many neuronal processing layers between the input data and the output results calculated by the neural network — hence the use of the word deep in the deep-learningcatchphrase. The resulting trained networks can be extremely valuable, as they have the ability to perform complex, real-world pattern recognition tasks very quickly on a variety of low-power devices including sensors, mobile phones, and FPGAs, as well as quickly and economically in the data center.
Generic applicability, high accuracy (sometimes better than human), and ability to be deployed nearly everywhere explains why scientists, technologists, entrepreneurs and companies are all scrambling to take advantage of deep-learning technology.
Machine learning went through a similar bandwagon stage in the 1980s where superlatives were lauded on the technology and futurists discussed how machine learning was going to change the world. The genesis of the 1980s machine- earning revolution was a seminal paper by Hopfield and Tank, “Neural Computation of Decisions in Optimization Problems,” which showed that good solutions to a wide class of combinatorial optimization problems could be found using networks of biology-inspired neurons. In particular, Hopfield and Tank demonstrated they could find good solutions to intractably large versions of the NP-Complete traveling salesman problem.
The advent of backpropagation by Rummelhart, Hinton and Williams allowed the adjustment of weights in a ‘neurone-like’ network so the network could be trained to solve a computational problem from example data. In particular, the ability of neural networks to adjust their weights to learn all the logic functions required to build a general purpose computer — including the non-linear XOR truth table — showed that artificial neural networks (ANNs) are computationally universal devices that can, in theory, be trained to solve any computable problem. I like to joke that machine learning made me one of the hardest working lazy men you would ever meet, as I was willing to work very hard to make the computer teach itself to solve a complex problem.
Nettalk, a beautiful example by Terry Sejnowski and Charles Rosenberg, showed that it was possible to teach a neural network to perform tasks at a human-level of complexity — specifically to read English text aloud. Even grade-school children immediately grasp the implications of machine learning through the NetTalk example, as people can literally hear the ANN learn to read aloud. Further, it is easy to show that the ANN had ‘learned’ a general solution to the problem of reading aloud, as it could correctly pronounce words that it had never seen before. I use NetTalk as a stellar example of how scientists can create simple and intuitively obvious examples to communicate their research to anyone.
The bandwagon faded for ANNs during the mid-1990s as overblown claims and a lull in the development of parallel computing exceeded both the patience of funding agencies and limited the size and complexity of the problems that could be addressed. Neural networks faded from the scientific limelight, while research continued to both expand and mature the technology. Still, examples such as Star Trek’s Commander Data preserved the popular perception of the potential of neural network technology.
The development of low-cost massively parallel devices like GPUs sparked a resurgence in the popularity of neural network research. Instead of spending $30M to purchase a 60 GF/s (billion flop/s) Connection Machine, modern researchers can now purchase a TF/s (trillion flop/s) capable GPU for around a hundred dollars. The parallel mapping pioneered by Farber on the Connection Machine at Los Alamos allows the computationally expensive training step to very efficiently map any SIMD architecture, be it a GPU or the vector architecture of an Intel Xeon or Intel Xeon Phi processor. Near-linear scalability in a distributed computing environment means that most computational clusters can achieve very efficient, near-peak performance during the training phase. For example, the 1980s mapping used on a Connection Machine was able to achieve over 13 PF/s (1015 flop/s) average sustained performance on the OakRidge National Laboratory Titan supercomputer. The ability to run efficiently on large numbers of either vector or GPU devices means that researchers can work with complex neural networks and large data sets to solve problems — sometimes as well or better than humans.
“The ability to run efficiently on large numbers of either vector or GPU devices means that researchers can work with complex neural networks and large data sets to solve problems — sometimes as well or better than humans.”
Convolutional neural networks (CNNs) are a form of ‘deep’ neural network architecture popularized by Yann LeCun and others in 1998. CNNs are behind many of the deep-learning successes that have been reported recently in image and speech recognition. Again inspired by biology, these neural networks find features in the data that permit correct classification or recognition of the training images without the help of a human. The lack of dependence on prior knowledge and human effort is considered a major advantage of CNNs over other approaches.
The Latest on: Deep Learning
via Google News
The Latest on: Deep Learning
- Feature Engineering and Deep Learning for Stereo Matching Under Adverse Driving Conditionson April 30, 2021 at 8:01 am
Stereo matching is a challenging research topic in driving assistance systems. Existing stereo matching methods work well under normal day-light conditions. However, they fail to operate under adverse ...
- Dynamic Yield’s Deep Learning Product Recommendations Generate Exponential Revenue Returnson April 30, 2021 at 6:37 am
Customers implementing the self-training algorithm have witnessed double-digit uplifts in purchases and incremental revenue, compared to other personalized recommendation strategies New York, April 28 ...
- Deep learning model to predict fracture mechanisms of grapheneon April 30, 2021 at 6:29 am
Understanding fracture is critical to the design of resilient nanomaterials. Molecular dynamics offers a way to study fracture at an atomistic level, but is computationally expensive with limitations ...
- COVID-CT-MD, COVID-19 computed tomography scan dataset applicable in machine learning and deep learningon April 29, 2021 at 12:03 pm
Low Dose Computed Tomography of the Chest • viral infectious disease Technology Type(s) digital curation • image processing technique Factor Type(s) sex • gender • age group • weight • clinical charac ...
- Deep-Learning Business Acquisition Makes Nano Dimension a Strong Buyon April 29, 2021 at 4:00 am
Some folks might not like Nano Dimensions’ aggressive pursuit of acquisitions. And, it is true that acquiring businesses is an expensive strategy. I’ll leave it to you to decide whether the company’s ...
- Deep Learning for Natural Language Processing - Lectures 2021on April 25, 2021 at 10:48 am
Deep Learning for Natural Language Processing - Lectures 2021 - dl4nlp-tuda2021/deep-learning-for-nlp-lectures ...
- Deep-learning models code more like humanson April 22, 2021 at 10:03 pm
Control Engineering - Whatever business a company may be in, software plays an increasingly vital role, from managing inventory to interfacing with customers. Software ...
- Deep Instinct Raises $100M for its Cybersecurity Threat Prevention Platform Built on Deep Learningon April 22, 2021 at 6:07 am
Deep Instinct, cofounded by Guy Caspi, is an end-to-end cybersecurity detection platform that leverages deep learning to process threats faster than other solutions.
- Best of arXiv.org for AI, Machine Learning, and Deep Learning – March 2021on April 21, 2021 at 6:00 am
In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning – from ...
- Deep Learning Market Research Report by Trends, by Type, by Application – Global Forecast to 2025 – Cumulative Impact of COVID-19on April 19, 2021 at 3:57 am
The Deep Learning Market is expected to exceed more than US$ 18 Billion by 2024 at a CAGR of 42% in the given forecast period. The report covers detailed competitive outlook including the market share ...
via Bing News