A psychologist weighs in on the general intelligence of artificial intelligence

via Association for Psychological Science

Researchers test cognitive abilities of the language model GPT-3

Researchers at the Max Planck Institute for Biological Cybernetics in Tübingen have examined the general intelligence of the language model GPT-3, a powerful AI tool. Using psychological tests, they studied competencies such as causal reasoning and deliberation, and compared the results with the abilities of humans. Their findings paint a heterogeneous picture: while GPT-3 can keep up with humans in some areas, it falls behind in others, probably due to a lack of interaction with the real world.

Neural networks can learn to respond to input given in natural language and can themselves generate a wide variety of texts. Currently, the probably most powerful of those networks is GPT-3, a language model presented to the public in 2020 by the AI research company OpenAI. GPT-3 can be prompted to formulate various texts, having been trained for this task by being fed large amounts of data from the internet. Not only can it write articles and stories that are (almost) indistinguishable from human-made texts, but surprisingly, it also masters other challenges such as math problems or programming tasks.

The Linda problem: to err is not only human

These impressive abilities raise the question whether GPT-3 possesses human-like cognitive abilities. To find out, scientists at the Max Planck Institute for Biological Cybernetics have now subjected GPT-3 to a series of psychological tests that examine different aspects of general intelligence. Marcel Binz and Eric Schulz scrutinized GPT-3’s skills in decision making, information search, causal reasoning, and the ability to question its own initial intuition. Comparing the test results of GPT-3 with answers of human subjects, they evaluated both if the answers were correct and how similar GPT-3’s mistakes were to human errors.

“One classic test problem of cognitive psychology that we gave to GPT-3 is the so-called Linda problem,” explains Binz, lead author of the study. Here, the test subjects are introduced to a fictional young woman named Linda as a person who is deeply concerned with social justice and opposes nuclear power. Based on the given information, the subjects are asked to decide between two statements: is Linda a bank teller, or is she a bank teller and at the same time active in the feminist movement?

Most people intuitively pick the second alternative, even though the added condition – that Linda is active in the feminist movement – makes it less likely from a probabilistic point of view. And GPT-3 does just what humans do: the language model does not decide based on logic, but instead reproduces the fallacy humans fall into.

Active interaction as part of the human condition

“This phenomenon could be explained by that fact that GPT-3 may already be familiar with this precise task; it may happen to know what people typically reply to this question,” says Binz. GPT-3, like any neural network, had to undergo some training before being put to work: receiving huge amounts of text from various data sets, it has learned how humans usually use language and how they respond to language prompts.

Hence, the researchers wanted to rule out that GPT-3 mechanically reproduces a memorized solution to a concrete problem. To make sure that it really exhibits human-like intelligence, they designed new tasks with similar challenges. Their findings paint a disparate picture: in decision-making, GPT-3 performs nearly on par with humans. In searching specific information or causal reasoning, however, the artificial intelligence clearly falls behind. The reason for this may be that GPT-3 only passively gets information from texts, whereas “actively interacting with the world will be crucial for matching the full complexity of human cognition,” as the publication states. The authors surmise that this might change in the future: since users already communicate with models like GPT-3 in many applications, future networks could learn from these interactions and thus converge more and more towards what we would call human-like intelligence.

Stanford-hosted study examines how AI might affect urban life in 2030

Original Article: Artificial Intelligence from a psychologist’s point of view

Go deeper with Bing News on:

GPT-3

An unknown gpt2-chatbot is blowing up on X: Is this OpenAI's GPT-5 upgrade?
ChatGPT is easily the best-known generative AI chatbot in the world, but it offers different experiences depending on whether or not you pay for a … The post An unknown gpt2-chatbot is blowing up on X: Is this OpenAI’s GPT-5 upgrade?
Open AI CEO Sam Altman Once Called GPT-2 'Very Bad' But Now Confesses He Has A 'Soft Spot' For The Version — Here's ChatGPT's Evolution Story
OpenAI CEO Sam Altman has expressed a fondness for the GPT-2 model despite his previous criticisms of earlier models, as he anticipates significant improvements with the upcoming GPT-5.

Go deeper with Bing News on:

GPT-3 cognitive abilities

Llama 3 reasoning and coding performance tested
Llama 3 has been tested encoding and reasoning by the WorldofAI team providing an insight into what you can expect from the open source AI ...
OpenAI’s GPT-4 Can Autonomously Exploit 87% of One-Day Vulnerabilities, Study Finds
Other open-source models, including GPT-3.5 and vulnerability scanners ... conducted to address the gap in knowledge regarding the ability of LLMs to successfully exploit one-day vulnerabilities ...
What does GPT stand for? Understanding GPT 3.5, GPT 4, GPT-4 Turbo, and more
Accelerate your tech game Paid Content How the New Space Race Will Drive Innovation How the metaverse will change the future of work and society Managing the Multicloud The Future of the Internet ...
Microsoft launches Phi-3 Mini, an AI model that is smaller but still rivals GPT-3.5
or Phi-3 Medium, which will have been trained on 14 billion parameters. To put these parameter numbers into perspective, some reports have suggested that OpenAI's GPT-4 Turbo was trained on more ...
Microsoft's new lightweight AI model is as capable as GPT-3.5 despite being small enough to run on a smartphone
Microsoft states that Phi-3 Mini is as capable as GPT-3.5 but that it is a smaller form factor. A new lightweight AI model is here from Microsoft, and it promises to deliver a similar level of ...

What's Your Reaction?

Don't Like it!

I Like it!

Researchers test cognitive abilities of the language model GPT-3

The Linda problem: to err is not only human

Active interaction as part of the human condition

The Latest Updates from Bing News

Go deeper with Bing News on:

GPT-3

Go deeper with Bing News on:

GPT-3 cognitive abilities

Leave a Reply