
Teaching robots to learn new tricks faster with dog training methods
With a training technique commonly used to teach dogs to sit and stay, Johns Hopkins University computer scientists showed a robot how to teach itself several new tricks, including stacking blocks. With the method, the robot, named Spot, was able to learn in days what typically takes a month.
By using positive reinforcement, an approach familiar to anyone who’s used treats to change a dog’s behavior, the team dramatically improved the robot’s skills and did it quickly enough to make training robots for real-world work a more feasible enterprise. The findings are newly published in a paper called, “Good Robot!”
“The question here was how do we get the robot to learn a skill?” said lead author Andrew Hundt, a PhD student working in Johns Hopkins’ Computational Interaction and Robotics Laboratory. “I’ve had dogs so I know rewards work and that was the inspiration for how I designed the learning algorithm.”
Unlike humans and animals that are born with highly intuitive brains, computers are blank slates and must learn everything from scratch. But true learning is often accomplished with trial and error, and roboticists are still figuring out how robots can learn efficiently from their mistakes.
The team accomplished that here by devising a reward system that works for a robot the way treats work for a dog. Where a dog might get a cookie for a job well done, the robot earned numeric points.
Hundt recalled how he once taught his terrier mix puppy named Leah the command “leave it,” so she could ignore squirrels on walks. He used two types of treats, ordinary trainer treats and something even better, like cheese. When Leah was excited and sniffing around the treats, she got nothing. But when she calmed down and looked away, she got the good stuff. “That’s when I gave her the cheese and said, ‘Leave it! Good Leah!’”
Similarly, to stack blocks, Spot the robot needed to learn how to focus on constructive actions. As the robot explored the blocks, it quickly learned that correct behaviors for stacking earned high points, but incorrect ones earned nothing. Reach out but don’t grasp a block? No points. Knock over a stack? Definitely no points. Spot earned the most by placing the last block on top of a four-block stack.
The training tactic not only worked, it took just days to teach the robot what used to take weeks. The team was able to reduce the practice time by first training a simulated robot, which is a lot like a video game, then running tests with Spot.
“The robot wants the higher score,” Hundt said. “It quickly learns the right behavior to get the best reward. In fact, it used to take a month of practice for the robot to achieve 100% accuracy. We were able to do it in two days.”
Positive reinforcement not only worked to help the robot teach itself to stack blocks, with the point system the robot just as quickly learned several other tasks – even how to play a simulated navigation game. The ability to learn from mistakes in all types of situations is critical for designing a robot that could adapt to new environments.
“At the start the robot has no idea what it’s doing but it will get better and better with each practice. It never gives up and keeps trying to stack and is able to finish the task 100% of the time,” Hundt said.
The team imagines these findings could help train household robots to do laundry and wash dishes – tasks that could be popular on the open market and help seniors live independently. It could also help design improved self-driving cars.
“Our goal is to eventually develop robots that can do complex tasks in the real world — like product assembly, caring for the elderly and surgery,” said co-author Gregory D. Hager, the Mandell Bellmore Professor of Computer Science. “We don’t currently know how to program tasks like that — the world is too complex. But work like this shows us that there is promise to the idea that robots can learn how to accomplish such real-world tasks in a safe and efficient way.”
The team and co-authors included Johns Hopkins graduate students Andrew Hundt, Benjamin Killeen, Nicholas Greene, Heeyeon Kwon, and Hongtao Wu; and former graduate student Chris Paxton .
The Latest Updates from Bing News & Google News
Go deeper with Bing News on:
Robot learning
- South Carolina Aquarium receives “Aquabot”, virtual learning upgradeon January 26, 2021 at 9:10 am
The robot has temporarily been dubbed the “Aquabot” and the aquarium says it is intended to bolster the their distance learning offerings. The Aquabot was donated as part of Google’s Data Center ...
- Hyundai’s adorable AI powered robot advises customers to wear a face mask!on January 26, 2021 at 4:40 am
To kick-off things, DAL-e debuted yesterday in Hyundai Motor Showroom in southern Seoul for a pilot run, post which, the AI robot will be employed in more Hyundai and Kia showrooms if it all goes as ...
- Fanuc launches e-learning website for its collaborative roboton January 25, 2021 at 1:30 pm
Fanuc America has launched a new e-learning website to educate manufacturers and industry about its CRX collaborative robot. The site provides online training tutorials, a deep dive into a wide ...
- Learning with – and about – AI technology | MIT Newson January 25, 2021 at 11:51 am
MIT News. Between distance learning, spending more time at home, and working parents trying to keep their kids occ ...
- Learning with — and about — AI technologyon January 25, 2021 at 11:37 am
MIT Media Lab Personal Robots group head Cynthia Breazeal joined MIT Education Arcade Director Eric Klopfer for a conversation about AI's role in K-12 education as part of a new webinar series from ...
Go deeper with Google Headlines on:
Robot learning
Go deeper with Bing News on:
Positive reinforcement to teach robots
- 28 Dog Tricks In 1 Minute Is New Guinness World Record: Watchon January 25, 2021 at 5:23 pm
The Guinness Book of World Records is thick, with both predictable and unthinkable accomplishments listed. The distinction are often badges of honor for humans and animals alike. Among the latest ...
- Two California dogs set record for most tricks performed in a minuteon January 25, 2021 at 2:30 pm
Can you do 28 tricks in a minute? Two border collies named Wish and Halo can, and they have set the Guinness World Record for the feat.
- Two dogs from California just set the world record for most tricks performed in a minuteon January 25, 2021 at 5:27 am
Can you do 28 tricks in a minute? Two border collies named Wish and Halo can, and they have set the world record.
- Why seeing robots in pop culture is importanton January 22, 2021 at 6:04 pm
Chances are, the first robot you ever encountered was one from popular culture. Ever since the Czech writer Karel Čapek coined the term “robot” in his 1920 science fiction play “Rossum's Universal ...
- These virtual robot arms get smarter by training each otheron January 22, 2021 at 5:06 am
A virtual robot arm has learned to solve a wide range of different puzzles—stacking blocks, setting the table, arranging chess pieces—without having to be retrained for each task. It did this by ...