Princeton researchers have found that human-language descriptions of tools can accelerate the learning of a simulated robotic arm lifting and using a variety of tools. Animation by the researchers;
GIF by Neil Adelantar
Exploring a new way to teach robots, Princeton researchers have found that human-language descriptions of tools can accelerate the learning of a simulated robotic arm lifting and using a variety of tools.
The results build on evidence that providing richer information during artificial intelligence (AI) training can make autonomous robots more adaptive to new situations, improving their safety and effectiveness.
Adding descriptions of a tool’s form and function to the training process for the robot improved the robot’s ability to manipulate newly encountered tools that were not in the original training set. A team of mechanical engineers and computer scientists presented the new method, Accelerated Learning of Tool Manipulation with LAnguage, or ATLA, at the Conference on Robot Learning on Dec. 14.
Robotic arms have great potential to help with repetitive or challenging tasks, but training robots to manipulate tools effectively is difficult: Tools have a wide variety of shapes, and a robot’s dexterity and vision are no match for a human’s.
“Extra information in the form of language can help a robot learn to use the tools more quickly,” said study coauthor Anirudha Majumdar, an assistant professor of mechanical and aerospace engineering at Princeton who leads the Intelligent Robot Motion Lab.
The team obtained tool descriptions by querying GPT-3, a large language model released by OpenAI in 2020 that uses a form of AI called deep learning to generate text in response to a prompt. After experimenting with various prompts, they settled on using “Describe the [feature] of [tool] in a detailed and scientific response,” where the feature was the shape or purpose of the tool.
“Because these language models have been trained on the internet, in some sense you can think of this as a different way of retrieving that information,” more efficiently and comprehensively than using crowdsourcing or scraping specific websites for tool descriptions, said Karthik Narasimhan, an assistant professor of computer science and coauthor of the study. Narasimhan is a lead faculty member in Princeton’s natural language processing (NLP) group, and contributed to the original GPT language model as a visiting research scientist at OpenAI.
This work is the first collaboration between Narasimhan’s and Majumdar’s research groups. Majumdar focuses on developing AI-based policies to help robots — including flying and walking robots — generalize their functions to new settings, and he was curious about the potential of recent “massive progress in natural language processing” to benefit robot learning, he said.
For their simulated robot learning experiments, the team selected a training set of 27 tools, ranging from an axe to a squeegee. They gave the robotic arm four different tasks: push the tool, lift the tool, use it to sweep a cylinder along a table, or hammer a peg into a hole. The researchers developed a suite of policies using machine learning training approaches with and without language information, and then compared the policies’ performance on a separate test set of nine tools with paired descriptions.
This approach is known as meta-learning, since the robot improves its ability to learn with each successive task. It’s not only learning to use each tool, but also “trying to learn to understand the descriptions of each of these hundred different tools, so when it sees the 101st tool it’s faster in learning to use the new tool,” said Narasimhan. “We’re doing two things: We’re teaching the robot how to use the tools, but we’re also teaching it English.”
The researchers measured the success of the robot in pushing, lifting, sweeping and hammering with the nine test tools, comparing the results achieved with the policies that used language in the machine learning process to those that did not use language information. In most cases, the language information offered significant advantages for the robot’s ability to use new tools.
One task that showed notable differences between the policies was using a crowbar to sweep a cylinder, or bottle, along a table, said Allen Z. Ren, a Ph.D. student in Majumdar’s group and lead author of the research paper.
“With the language training, it learns to grasp at the long end of the crowbar and use the curved surface to better constrain the movement of the bottle,” said Ren. “Without the language, it grasped the crowbar close to the curved surface and it was harder to control.”
The research was supported in part by the Toyota Research Institute (TRI), and is part of a larger TRI-funded project in Majumdar’s research group aimed at improving robots’ ability to function in novel situations that differ from their training environments.
“The broad goal is to get robotic systems — specifically, ones that are trained using machine learning — to generalize to new environments,” said Majumdar. Other TRI-supported work by his group has addressed failure prediction for vision-based robot control, and used an “adversarial environment generation” approach to help robot policies function better in conditions outside their initial training.
Original Article: Words prove their worth as teaching tools for robots
The Latest Updates from Bing News
Go deeper with Bing News on:
- High school students set their eyes on international robotics competition
They started with Robotics 1 - learning the building, coding and the basics of robotics. "Then as you go into a team, you find your individual role, which is where you want to compete, you want to ...
- Boston Dynamics' Spot robot dogs to start painting residency program
For this project, Pilat has been utilizing a combination of artificial intelligence, software, and machine learning to mold the robots' "personalities" in collaboration with an engineer and her ...
- Using large language models to code new tasks for robots
You've likely heard that "experience is the best teacher"—but what if learning in the real world is prohibitively expensive? This is the plight of roboticists training their machines on manipulation ...
- An approach that allows robots to learn in changing environments from human feedback and exploration
To best assist humans in real-world settings, robots should be able to continuously acquire useful new skills in dynamic and rapidly changing environments. Currently, however, most robots can only ...
- Worldwide Data Center Robotics Market Size Projected to Reach USD 63.93 Billion By 2032, With 22.2% CAGR: Polaris Market Research
As per the recent [115+ Pages] analysis by Polaris Market Research, the global data center robotics market size and share was valued at USD 8.64 billion in 2022 and is predicted to reach USD 63.93 ...
Go deeper with Bing News on:
- A surgical robot is changing the way doctors perform surgeries at Covenant Medical Center
Covenant Medical Center is offering a new way to use modern technology inside the operating room. Covenant has owned and operated a da Vinci Surgical System for years. Once doctors saw the benefits of ...
- New statewide system for motor vehicle services coming in 2024
LEXINGTON, Ky. (WKYT) - A new system named Kentucky Automated Vehicle Information System (KAVIS) is coming to every county in Kentucky in the new year. All county clerk officers will come to a pause ...
- First cases performed in Kentucky with Stereotaxis robotic magnetic navigation system
Stereotaxis (NYSE:STXS) announced that a health provider in Kentucky performed the first patients with its Genesis surgical robotic system.
- Baptist Health Lexington Enhances 20-Year Leadership in Pioneering Advanced Robotic Technology for Heart Disease Patients
Stereotaxis (NYSE: STXS), a pioneer in surgical robotics for minimally invasive endovascular intervention, and Baptist Health Lexington, a leading health provider in Kentucky, today announced that ...
- USC’s Robotic System to Reshape Stroke Recovery
USC's robotic system tracks arm usage in stroke recovery, offering objective data for personalized rehab with motivational assistance.