Image-processing system learns largely on its own, much like a human baby
Neuroscience and artificial intelligence experts from Rice University and Baylor College of Medicine have taken inspiration from the human brain in creating a new “deep learning” method that enables computers to learn about the visual world largely on their own, much as human babies do.
In tests, the group’s “deep rendering mixture model” largely taught itself how to distinguish handwritten digits using a standard dataset of 10,000 digits written by federal employees and high school students. In results presented this month at the Neural Information Processing Systems (NIPS) conference in Barcelona, Spain, the researchers described how they trained their algorithm by giving it just 10 correct examples of each handwritten digit between zero and nine and then presenting it with several thousand more examples that it used to further teach itself. In tests, the algorithm was more accurate at correctly distinguishing handwritten digits than almost all previous algorithms that were trained with thousands of correct examples of each digit.
“In deep-learning parlance, our system uses a method known as semisupervised learning,” said lead researcher Ankit Patel, an assistant professor with joint appointments in neuroscience at Baylor and electrical and computer engineering at Rice. “The most successful efforts in this area have used a different technique called supervised learning, where the machine is trained with thousands of examples: This is a one. This is a two.
“Humans don’t learn that way,” Patel said. “When babies learn to see during their first year, they get very little input about what things are. Parents may label a few things: ‘Bottle. Chair. Momma.’ But the baby can’t even understand spoken words at that point. It’s learning mostly unsupervised via some interaction with the world.”
Patel said he and graduate student Tan Nguyen, a co-author on the new study, set out to design a semisupervised learning system for visual data that didn’t require much “hand-holding” in the form of training examples. For instance, neural networks that use supervised learning would typically be given hundreds or even thousands of training examples of handwritten digits before they would be tested on the database of 10,000 handwritten digits in the Mixed National Institute of Standards and Technology (MNIST) database.
The semisupervised Rice-Baylor algorithm is a “convolutional neural network,” a piece of software made up of layers of artificial neurons whose design was inspired by biological neurons. These artificial neurons, or processing units, are organized in layers, and the first layer scans an image and does simple tasks like searching for edges and color changes. The second layer examines the output from the first layer and searches for more complex patterns. Mathematically, this nested method of looking for patterns within patterns within patterns is referred to as a nonlinear process.
“It’s essentially a very simple visual cortex,” Patel said of the convolutional neural net. “You give it an image, and each layer processes the image a little bit more and understands it in a deeper way, and by the last layer, you’ve got a really deep and abstract understanding of the image. Every self-driving car right now has convolutional neural nets in it because they are currently the best for vision.”
Like human brains, neural networks start out as blank slates and become fully formed as they interact with the world. For example, each processing unit in a convolutional net starts the same and becomes specialized over time as they are exposed to visual stimuli.
“Edges are very important,” Nguyen said. “Many of the lower layer neurons tend to become edge detectors. They’re looking for patterns that are both very common and very important for visual interpretation, and each one trains itself to look for a specific pattern, like a 45-degree edge or a 30-degree red-to-blue transition.
“When they detect their particular pattern, they become excited and pass that on to the next layer up, which looks for patterns in their patterns, and so on,” he said. “The number of times you do a nonlinear transformation is essentially the depth of the network, and depth governs power. The deeper a network is, the more stuff it’s able to disentangle. At the deeper layers, units are looking for very abstract things like eyeballs or vertical grating patterns or a school bus.”
Nguyen began working with Patel in January as the latter began his tenure-track academic career at Rice and Baylor. Patel had already spent more than a decade studying and applying machine learning in jobs ranging from high-volume commodities training to strategic missile defense, and he’d just wrapped up a four-year postdoctoral stint in the lab of Rice’s Richard Baraniuk, another co-author on the new study. In late 2015, Baraniuk, Patel and Nguyen published the first theoretical framework that could both derive the exact structure of convolutional neural networks and provide principled solutions to alleviate some of their limitations.
Baraniuk said a solid theoretical understanding is vital for designing convolutional nets that go beyond today’s state-of-the-art.
“Understanding video images is a great example,” Baraniuk said. “If I am looking at a video, frame by frame by frame, and I want to understand all the objects and how they’re moving and so on, that is a huge challenge. Imagine how long it would take to label every object in every frame of a video. No one has time for that. And in order for a machine to understand what it’s seeing in a video, it has to understand what objects are, the concept of three-dimensional space and a whole bunch of other really complicated stuff. We humans learn those things on our own and take them for granted, but they are totally missing in today’s artificial neural networks.”
Patel said the theory of artificial neural networks, which was refined in the NIPS paper, could ultimately help neuroscientists better understand the workings of the human brain.
“There seem to be some similarities about how the visual cortex represents the world and how convolutional nets represent the world, but they also differ greatly,” Patel said. “What the brain is doing may be related, but it’s still very different. And the key thing we know about the brain is that it mostly learns unsupervised.
“What I and my neuroscientist colleagues are trying to figure out is, What is the semisupervised learning algorithm that’s being implemented by the neural circuits in the visual cortex? and How is that related to our theory of deep learning?” he said. “Can we use our theory to help elucidate what the brain is doing? Because the way the brain is doing it is far superior to any neural network that we’ve designed.”
Receive an email update when we add a new SEMISUPERVISED LEARNING article.
The Latest on: Semisupervised learning
via Google News
The Latest on: Semisupervised learning
- Truly Autonomous AI Not Practical for Most Businesseson February 18, 2021 at 8:50 pm
Truly Autonomous AI Not Practical for Most Businesses By Scott Horn, CMO, 7 - On the heels of 2016’s announcements of Salesforce Einstein, Facebook Messenger chatbots and the new Partnership on ...
- Four Ways AI And Machine Learning Will Drive Future Innovation And Changeon February 18, 2021 at 5:40 am
AI and ML technologies are more than buzzwords or simple predictions: They offer businesses limitless possibilities to evolve use cases to improve productivity, expand their customer base, boost ROI ...
- Artificial Intelligence Supporting New Developments in Materials Scienceon February 18, 2021 at 3:09 am
Artificial Intelligence in discovering composite materials is a developing area in materials science. Machine learning algorithms can be used in virtual computations to create combinations for new ...
- Scalarr raises $7.5 million to expand mobile ad fraud productson February 16, 2021 at 10:48 am
Scalarr, the startup that fights ad fraud using machine learning has just raised $7.5 million in a Series A funding round, led by the European Bank of Reconstruction and Development, with ...
- Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastomaon February 16, 2021 at 3:53 am
Radiogenomics uses machine-learning (ML) to directly connect the morphologic and physiological appearance of tumors on clinical imaging with underlying genomic features. Despite extensive growth in ...
- Big Data Processing with Apache Spark - Part 4: Spark Machine Learningon February 15, 2021 at 3:59 pm
Supervised learning Unsupervised learning Semi-supervised Learning Reinforcement learning Let’s briefly look at each of these machine learning models and how they compare with each other.
- San Diego Supercomputer Center Helps Advance Computational Chemistryon February 9, 2021 at 8:04 am
Even though computational chemistry represents a challenging arena for machine learning, a team of researchers from the Massachusetts ...
- The Garrison Platoon Of Books: How To Read 43 Machine Learning Books in a Yearon February 1, 2021 at 1:31 am
Reading machine learning books should be more like riding a bike or driving a car. The fundamental building blocks for reading many books come from concentration and focus.
- The three ways Alexa is going to get smarter in the next decadeon January 28, 2021 at 6:20 am
Making machines artificially intelligent is a time-consuming practice of collecting and manually labelling data for AIs to learn from. Today, researchers are enabling machines to learn new ...
via Bing News