A new type of computer chip that boosts the performance and slashes the energy demands of systems used for artificial intelligence

By shifting a fundamental property of computation, Princeton researchers have built a new type of computer chip that boosts the performance and slashes the energy demands of systems used for artificial intelligence. The chip, which works with standard programming languages, could be particularly useful on phones, watches or other devices that rely on high-performance computing and have limited battery life.
Photo by Frank Wojciechowski

By shifting a fundamental property of computation, Princeton researchers have built a new type of computer chip that boosts the performance and slashes the energy demands of systems used for artificial intelligence.

The chip, which works with standard programming languages, could be particularly useful on phones, watches or other devices that rely on high-performance computing and have limited battery life.

The chip, based on a technique called in-memory computing, is designed to clear a primary computational bottleneck that forces computer processors to expend time and energy fetching data from stored memory. In-memory computing performs computation directly in the storage, allowing for greater speed and efficiency.

The announcement of the new chip, along with a system to program it, follows closely on an earlier report that the researchers in collaboration with Analog Devices Inc. had fabricated circuitry for in-memory computing. Lab tests of the circuitry demonstrated that the chip would perform tens to hundreds of times faster than comparable chips. However, the initial chip did not include all the components of the most recent version, so its capability was limited.

In the new announcement, researchers in the lab of Naveen Verma, an associate professor of electrical engineering, report that they have integrated the in-memory circuitry into a programmable processor architecture. The chip now works with common computer languages such as C.

“The previous chip was a strong and powerful engine,” said Hongyang Jia, a graduate student in Verma’s group and one of the chip designers. “This chip is the whole car.”

Although it could operate with a broad range of systems, the Princeton chip is intended to support systems designed for deep-learning inference — algorithms that allow computers to make decisions and perform complex tasks by learning from data sets. Deep learning systems direct such things as self-driving cars, facial recognition systems and medical diagnostic software.

Verma said that for many applications, the chip’s energy savings would be as critical as the performance boost. That is because many AI applications are expected to operate on devices driven by batteries such as mobile phones or wearable medical sensors. The Apple iPhone X, for example, already has an AI chip as part of its circuitry. But, both the energy savings and performance boosts are only of use if they can be accessed by the broad base of applications that need them — that is where the need for programmability comes in.

“The classic computer architecture separates the central processor, which crunches the data, from the memory, which stores the data,” Verma said. “A lot of the computer’s energy is used in moving data back and forth.”

In part, the new chip is a response to the slowing promise of Moore’s Law. In 1965, Intel founder Gordon Moore observed that the number of transistors on integrated circuits doubled about every year, and the industry also noted that those transistors became faster and more energy efficient in the process. For decades, these observations, which became known as Moore’s Law, underpinned a transformation in which computers became ever more powerful. But in recent years, transistors have not kept improving as in the past, running into fundamental limitations of their physics.

Verma, who specializes in circuit and system design, thought about ways around this squeeze on the architectural level rather than the transistor level. The computation needed by AI would be much more efficient if it could be done at the same location as the computer’s memory because it would eliminate the time and energy used to fetch data stored far away. That would make the computer faster without upgrading the transistors. But creating such a system posed a challenge. Memory circuits are designed as densely as possible in order to pack in large amounts of data. Computation, on the other hand, requires that space be devoted for additional transistors.

One option was to substitute electrical components called capacitors for the transistors. Transistors are essentially switches that use voltage changes to stand for the 1s and 0s that make up binary computer signals. They can do all sorts of calculations using arrays of 1 and 0 digits, which is why the systems are called digital. Capacitors store and release electrical charge, so they can represent any number, not just 1s and 0s. Verma realized that with capacitors he could perform calculations in a much denser space than he could with transistors.

Capacitors also can be made very precisely on a chip, much more so than transistors. The new design pairs capacitors with conventional cells of static random access memory (SRAM) on a chip. The combination of capacitors and SRAM is used to perform computations on the data in the analog (not digital) domain, yet in ways that are reliable and amenable to including programmability features. Now, the memory circuits can perform calculations in ways directed by the chip’s central processing unit.

“In-memory computing has been showing a lot of promise in recent years, in really addressing the energy and speed of computing systems,” said Verma. “But the big question has been whether that promise would scale and be usable by system designers towards all of the AI applications we really care about. That makes programmability necessary.”

Learn more: Merging memory and computation, programmable chip speeds AI, slashes power use

The Latest on: In-memory computing

[google_news title=”” keyword=”in-memory computing” num_posts=”10″ blurb_length=”0″ show_thumb=”left”]

via Google News

The Latest on: In-memory computing

Inside NASA's 5-month fight to save the Voyager 1 mission in interstellar space
on April 26, 2024 at 2:03 pm
After months of trying to reestablish communication with the Voyager 1 probe — the most distant human-made object in existence — NASA finally announced success.
Saatva Loom And Leaf Mattress Review: Memory Foam With Firm Support
on April 26, 2024 at 11:58 am
My Saatva Loom and Leaf Mattress review includes my insights on everything from firmness to support after sleeping on it for more than a year.
Lenovo Announces World's First Laptop With LPCAMM2 DDR5x Memory Modules
on April 26, 2024 at 5:03 am
It's been a minute since we first heard about the all-new memory standard for laptops, Compression Attached Memory Module, or CAMM for short. Dell first introduced its home-brewed memory standard ...
Western Digital tops profit expectations on demand from cloud-computing customers
on April 25, 2024 at 2:41 pm
Western Digital surpassed expectations for quarterly revenue and profit on Thursday, riding on a surge in demand for its data storage products from cloud service providers.
Samsung unleashes new computer memory technology that promises to accelerate AI to new heights — 10.7Gbps LPDDR5X RAM could be last one before expected game-changing LPDDR6 ...
on April 25, 2024 at 10:56 am
“As demand for low-power, high-performance memory increases, LPDDR DRAM is expected to expand its applications from mainly mobile to other areas that traditionally require higher performance and ...
Researchers uncover interrelation between recency and central tendency biases in working memory
on April 24, 2024 at 12:14 pm
Neuroscientists have revealed that recency bias in working memory naturally leads to central tendency bias, the phenomenon where people's (and animals') judgements are biased towards the average of ...
How can we harness sleep to enhance our memory?
on April 24, 2024 at 9:01 am
How does memory consolidation work, and can we improve our memory retention as we sleep? If so, how? In this Special Feature, we look at what studies have found out about sleep techniques and memory ...
XConn Technologies Showcases CXL Switch for Overcoming Memory Wall in AI Computing at TSMC 2024 Technology Symposium
on April 24, 2024 at 6:00 am
XConn Technologies (XConn), the innovation leader in next-generation interconnect technology for the future of high-performance computing and AI applications, today announced that it will be ...
A Paradigm Shift in RAM Is About to Make Computing Unstoppable
on April 24, 2024 at 5:30 am
For more than two decades, the most advanced version of this technology—magnetoresistive RAM, or MRAM—has been the go-to tech for the kind of intense computing necessary in industrial, military, and ...