A completely new kind of algorithm has been developed that exponentially speeds up computation

Smarter, faster algorithm cuts number of steps to solve problems

What if a large class of algorithms used today — from the algorithms that help us avoid traffic to the algorithms that identify new drug molecules — worked exponentially faster?

Computer scientists at the Harvard John A. Paulson School of Engineering and Applied Sciences (SEAS) have developed a completely new kind of algorithm, one that exponentially speeds up computation by dramatically reducing the number of parallel steps required to reach a solution.

The researchers will present their novel approach at two upcoming conferences: the ACM Symposium on Theory of Computing (STOC), June 25-29 and International Conference on Machine Learning (ICML), July 10 -15.

A lot of so-called optimization problems, problems that find the best solution from all possible solutions, such as mapping the fastest route from point A to point B, rely on sequential algorithms that haven’t changed since they were first described in the 1970s. These algorithms solve a problem by following a sequential step-by-step process. The number of steps is proportional to the size of the data. But this has led to a computational bottleneck, resulting in lines of questions and areas of research that are just too computationally expensive to explore.

“These optimization problems have a diminishing returns property,” said Yaron Singer, Assistant Professor of Computer Science at SEAS and senior author of the research. “As an algorithm progresses, its relative gain from each step becomes smaller and smaller.”

Singer and his colleague asked: what if, instead of taking hundreds or thousands of small steps to reach a solution, an algorithm could take just a few leaps?

“This algorithm and general approach allows us to dramatically speed up computation for an enormously large class of problems across many different fields, including computer vision, information retrieval, network analysis, computational biology, auction design, and many others,” said Singer. “We can now perform computations in just a few seconds that would have previously taken weeks or months.”

“This new algorithmic work, and the corresponding analysis, opens the doors to new large-scale parallelization strategies that have much larger speedups than what has ever been possible before,” said Jeff Bilmes, Professor in the Department of Electrical Engineering at the University of Washington, who was not involved in the research. “These abilities will, for example, enable real-world summarization processes to be developed at unprecedented scale.”

Traditionally, algorithms for optimization problems narrow down the search space for the best solution one step at a time. In contrast, this new algorithm samples a variety of directions in parallel. Based on that sample, the algorithm discards low-value directions from its search space and chooses the most valuable directions to progress towards a solution.

Take this toy example:

You’re in the mood to watch a movie similar to The Avengers. A traditional recommendation algorithm would sequentially add a single movie in every step which has similar attributes to those of The Avengers. In contrast, the new algorithm samples a group of movies at random, discarding those that are too dissimilar to The Avengers. What’s left is a batch of movies that are diverse (after all, you don’t want ten Batman movies) but similar to The Avengers. The algorithm continues to add batches in every step until it has enough movies to recommend.

This process of adaptive sampling is key to the algorithm’s ability to make the right decision at each step.

“Traditional algorithms for this class of problem greedily add data to the solution while considering the entire dataset at every step,” said Eric Balkanski, graduate student at SEAS and co-author of the research. “The strength of our algorithm is that in addition to adding data, it also selectively prunes data that will be ignored in future steps.”

In experiments, Singer and Balkanski demonstrated that their algorithm could sift through a data set which contained 1 million ratings from 6,000 users on 4,000 movies and recommend a personalized and diverse collection of movies for an individual user 20 times faster than the state-of-the-art.

The researchers also tested the algorithm on a taxi dispatch problem, where there are a certain number of taxis and the goal is to pick the best locations to cover the maximum number of potential customers. Using a data set of two million taxi trips from the New York City taxi and limousine commission, the adaptive-sampling algorithm found solutions 6 times faster.

“This gap would increase even more significantly on larger scale applications, such as clustering biological data, sponsored search auctions, or social media analytics,” said Balkanski.

Of course, the algorithm’s potential extends far beyond movie recommendations and taxi dispatch optimizations. It could be applied to:

designing clinical trials for drugs to treat Alzheimer’s, multiple sclerosis, obesity, diabetes, hepatitis C, HIV and more
evolutionary biology to find good representative subsets of different collections of genes from large datasets of genes from different species
designing sensor arrays for medical imaging
identifying drug-drug interaction detection from online health forums

This process of active learning is key to the algorithm’s ability to make the right decision at each step and solves the problem of diminishing returns.

“This research is a real breakthrough for large-scale discrete optimization,” said Andreas Krause, professor of Computer Science at ETH Zurich, who was not involved in the research. “One of the biggest challenges in machine learning is finding good, representative subsets of data from large collections of images or videos to train machine learning models. This research could identify those subsets quickly and have substantial practical impact on these large-scale data summarization problems.”

Machine-learning system spontaneously reproduces aspects of human neurology

Singer-Balkanski model and variants of the algorithm developed in the paper could also be used to more quickly assess the accuracy of a machine learning model, said Vahab Mirrokni, a principal scientist at Google Research, who was not involved in the research.

“In some cases, we have a black-box access to the model accuracy function which is time-consuming to compute,” said Mirrokni. “At the same time, computing model accuracy for many feature settings can be done in parallel. This adaptive optimization framework is a great model for these important settings and the insights from the algorithmic techniques developed in this framework can have deep impact in this important area of machine learning research.”

Learn more: ‘Breakthrough’ algorithm exponentially faster than any previous one

The Latest on: Machine learning

[google_news title=”” keyword=”machine learning” num_posts=”10″ blurb_length=”0″ show_thumb=”left”]

via Google News

The Latest on: Machine learning

Machine learning comes to Chrome's address bar on Windows, Mac, and ChromeOS
on April 29, 2024 at 8:42 pm
As noted in the Chromium Blog, starting with Chrome version M124, Google is integrating machine learning models into Chrome's address bar to provide users with more accurate and relevant web page ...
Google adds Machine Learning to power up the Chrome URL bar
on April 29, 2024 at 2:12 pm
The Chrome URL bar, also known as the Omnibox, is an absolute centerpiece of most people's web browsing experience. Used quite literally billions - billions - of times a day, Chrome's URL bar helps ...
AI Jackpot: 7 Machine Learning Stocks to Double Down On
on April 29, 2024 at 1:30 pm
Machine learning is closely related to artificial intelligence. It is a branch thereof which enables computers to emulate human learning. The scientific field arose in the 1950s but has largely ...
Machine learning classifies 191 of the world's most damaging viruses
on April 29, 2024 at 12:15 pm
Researchers from the University of Waterloo have successfully classified 191 previously unidentified astroviruses using a new machine learning-enabled classification process.
Chrome’s address bar adds machine learning to deliver better suggestions
on April 29, 2024 at 12:03 pm
Google announced that the latest version of Chrome (M124) will bring a big improvement to the address bar, also known as the omnibox.
3 Ways AI and Machine Learning are Changing Smartphones
on April 29, 2024 at 10:51 am
AI and machine learning are transforming the world as we know it. Practically every industry has been impacted by AI and machine learning one way or another, with smartphones being no exception. These ...
How Etsy Is Upskilling its Workforce With GenAI and Machine Learning
on April 29, 2024 at 6:22 am
Esty CEO Josh Silverman shared the ways in which his team is using GenAI and machine learning, experimenting with customer service upgrades, fielding customer feedback and training employees on AI.
Chicago Art School Deploys Machine Learning in Admissions
on April 29, 2024 at 12:07 am
After winning a $50,000 grant, the university is deploying the technology to gauge which students are most likely to accept its offers.
Automated machine learning robot unlocks new potential for genetics research
on April 26, 2024 at 9:10 am
University of Minnesota Twin Cities researchers have constructed a robot that uses machine learning to fully automate a complicated microinjection process used in genetic research.

via Bing News

What's Your Reaction?

Don't Like it!

I Like it!

Smarter, faster algorithm cuts number of steps to solve problems

The Latest on: Machine learning

The Latest on: Machine learning

What's Your Reaction?

Leave a Reply