Researchers at the U.S. Army Research Laboratory and the University of Texas at Austin have developed new techniques for robots or computer programs to learn how to perform tasks by interacting with a human instructor.
The findings of the study will be presented and published at the Association for the Advancement of Artificial Intelligence Conference in New Orleans, Louisiana, Feb. 2-7.
ARL and UT researchers considered a specific case where a human provides real-time feedback in the form of critique. First introduced by collaborator Dr. Peter Stone, a professor at the University of Texas at Austin, along with his former doctoral student, Brad Knox, as TAMER, or Training an Agent Manually via Evaluative Reinforcement, the ARL/UT team developed a new algorithm called Deep TAMER.
It is an extension of TAMER that uses deep learning – a class of machine learning algorithms that are loosely inspired by the brain to provide a robot the ability to learn how to perform tasks by viewing video streams in a short amount of time with a human trainer.
According to Army researcher Dr. Garrett Warnell, the team considered situations where a human teaches an agent how to behave by observing it and providing critique, for example, “good job” or “bad job” -similar to the way a person might train a dog to do a trick. Warnell said the researchers extended earlier work in this field to enable this type of training for robots or computer programs that currently see the world through images, which is an important first step in designing learning agents that can operate in the real world.
Many current techniques in artificial intelligence require robots to interact with their environment for extended periods of time to learn how to optimally perform a task. During this process, the agent might perform actions that may not only be wrong, like a robot running into a wall for example, but catastrophic like a robot running off the side of a cliff. Warnell said help from humans will speed things up for the agents, and help them avoid potential pitfalls.
As a first step, the researchers demonstrated Deep TAMER’s success by using it with 15 minutes of human-provided feedback to train an agent to perform better than humans on the Atari game of bowling – a task that has proven difficult for even state-of-the-art methods in artificial intelligence. Deep-TAMER-trained agents exhibited superhuman performance, besting both their amateur trainers and, on average, an expert human Atari player.
Within the next one to two years, researchers are interested in exploring the applicability of their newest technique in a wider variety of environments: for example, video games other than Atari Bowling and additional simulation environments to better represent the types of agents and environments found when fielding robots in the real world.
Their work will be published in the AAAI 2018 conference proceedings.
“The Army of the future will consist of Soldiers and autonomous teammates working side-by-side,” Warnell said. “While both humans and autonomous agents can be trained in advance, the team will inevitably be asked to perform tasks, for example, search and rescue or surveillance, in new environments they have not seen before. In these situations, humans are remarkably good at generalizing their training, but current artificially-intelligent agents are not.”
Deep TAMER is the first step in a line of research its researchers envision will enable more successful human-autonomy teams in the Army. Ultimately, they want autonomous agents that can quickly and safely learn from their human teammates in a wide variety of styles such as demonstration, natural language instruction and critique.
Learn more: Army researchers develop new algorithms to train robots
The Latest on: Deep Learning
[google_news title=”” keyword=”Deep Learning” num_posts=”10″ blurb_length=”0″ show_thumb=”left”]
via Google News
The Latest on: Deep Learning
- New Deep Instinct AI assistant bridges the gap in malware analysison May 1, 2024 at 6:01 am
DIANNA integrates with Deep Instinct’s deep learning-powered prevention-first capabilities to provide in-depth insights into known and unknown attack behavior through static analysis. DIANNA doesn’t ...
- Deep Instinct Launches Advanced, GenAI-Based Security Analysis Assistant for Unknown Threatson April 30, 2024 at 5:00 pm
AI-based deep learning (DL) framework, is unveiling Deep Instinct’s Artificial Neural Network Assistant (DIANNA), a prevention-first, generative AI (GenAI) cyber companion designed to offer ...
- Deep Learning and Neural Networks Drive a Potential $7.9 Trillion AI Economyon April 29, 2024 at 12:30 pm
As artificial intelligence (AI) continues to permeate the corporate landscape, its potential economic impact is ...
- Intel quietly launched mysterious new AI CPU that promises to bring deep learning inference and computing to the edge — but you won't be able to plug them in a motherboard ...on April 27, 2024 at 5:33 am
I ntel has launched a new AI processor series for the edge, promising industrial-class deep learning inference. The new ‘Amston Lake’ Atom x7000RE chips offer up to double the cores and twice the ...
- New multi-task deep learning framework integrates large-scale single-cell proteomics and transcriptomics dataon April 26, 2024 at 7:35 am
The exponential progress in single-cell multi-omics technologies has led to the accumulation of large and diverse multi-omics datasets. However, the integration of single-cell proteomics and ...
- Europe taps deep learning to make industrial robots safer colleagueson April 26, 2024 at 1:07 am
European researchers have launched the RoboSAPIENS project to make adaptive industrial robots more efficient and safer to work with humans.
- AI-powered 'deep medicine' could transform health care in the NHS and reconnect staff with their patientson April 25, 2024 at 10:20 am
Today's NHS faces severe time constraints, with the risk of short consultations and concerns about the risk of misdiagnosis or delayed care. These challenges are compounded by limited resources and ...
- Researchers develop deep learning alternative to monitoring laser powder bed fusionon April 24, 2024 at 9:12 am
Many things can go wrong when additively manufacturing (AM) metal and without in-situ process monitoring, defects can only be detected and characterized after a product is built. Most commonly, ...
- Deep learning predicts heart arrhrythmia 30 minutes in advanceon April 23, 2024 at 4:15 am
Atrial fibrillation is the most common cardiac arrhythmia worldwide with around 59 million people concerned in 2019. This irregular heartbeat is associated with increased risks of heart failure, ...
via Bing News