
Genevera Allen (Photo by Tommy LaVergne/Rice University)
Rice U. expert: Key is creating ML systems that question their own predictions
Rice University statistician Genevera Allen says scientists must keep questioning the accuracy and reproducibility of scientific discoveries made by machine-learning techniques until researchers develop new computational systems that can critique themselves.
Allen, associate professor of statistics, computer science and electrical and computer engineering at Rice and of pediatrics-neurology at Baylor College of Medicine, will address the topic in both a press briefing and a general session today at the 2019 Annual Meeting of the American Association for the Advancement of Science (AAAS).
“The question is, ‘Can we really trust the discoveries that are currently being made using machine-learning techniques applied to large data sets?’” Allen said. “The answer in many situations is probably, ‘Not without checking,’ but work is underway on next-generation machine-learning systems that will assess the uncertainty and reproducibility of their predictions.”
Machine learning (ML) is a branch of statistics and computer science concerned with building computational systems that learn from data rather than following explicit instructions. Allen said much attention in the ML field has focused on developing predictive models that allow ML to make predictions about future data based on its understanding of data it has studied.
“A lot of these techniques are designed to always make a prediction,” she said. “They never come back with ‘I don’t know,’ or ‘I didn’t discover anything,’ because they aren’t made to.”
She said uncorroborated data-driven discoveries from recently published ML studies of cancer data are a good example.
“In precision medicine, it’s important to find groups of patients that have genomically similar profiles so you can develop drug therapies that are targeted to the specific genome for their disease,” Allen said. “People have applied machine learning to genomic data from clinical cohorts to find groups, or clusters, of patients with similar genomic profiles.
“But there are cases where discoveries aren’t reproducible; the clusters discovered in one study are completely different than the clusters found in another,” she said. “Why? Because most machine-learning techniques today always say, ‘I found a group.’ Sometimes, it would be far more useful if they said, ‘I think some of these are really grouped together, but I’m uncertain about these others.’”
Learn more: Can we trust scientific discoveries made using machine learning?
The Latest on: Machine-learning discoveries
via Google News
The Latest on: Machine-learning discoveries
- New Deep Learning Discovery Paves Way for AI Interpretation of Brainwave Dataon January 21, 2021 at 1:00 pm
A new paper published in the Journal of Neural Engineering shows the successful first application of self-supervised learning, a very promising recent approach to train deep neural networks, to ...
- MultiPlan Corporation to Acquire Discovery Health Partnerson January 21, 2021 at 5:00 am
Discovery works with about 80 healthcare payor customers in the Medicare Advantage, Medicaid and commercial markets to improve the integrity of their payment and revenue processes. Discovery solutions ...
- Google Rolls Out ‘Product Discovery Solutions For Retail’ Suiteon January 20, 2021 at 8:37 am
Google Cloud is rolling out Product Discovery Solutions for Retail, a suite of services that will use AI and ML to help eCommerce firms deliver personalization.
- Google Cloud boosts personalised online searches with an enhanced product discovery AI suiteon January 20, 2021 at 4:33 am
Google Cloud has launched a product discovery solutions for retail, a suite of solutions built to help retailers enhance their ecommerce offerings and deliver highly personalised consumer experiences ...
- Google Cloud Launches Product Discovery Solutions for Retail, Bolstering Personalized Online Shoppingon January 19, 2021 at 10:07 pm
Google Cloud today announced the launch of Product Discovery Solutions for Retail, a suite of solutions built to help retailers around the globe enhance their ecommerce capabilities and deliver highly ...
- Researchers at U of T, Northwestern use AI to accelerate discovery of industrial materialson January 13, 2021 at 11:19 am
Researchers at the University of Toronto and Northwestern University are using machine learning to craft the best materials for different industrial uses. The findings, published this week in Nature ...
- Machine learning accelerates discovery of materials for use in industrial processeson January 11, 2021 at 1:17 pm
Research led by scientists at the University of Toronto and Northwestern University employs machine learning to craft the best building blocks in the assembly of reticular framework materials for use ...
- GoldSpot Discoveries Deploy New Inversion Process at Northstar Gold's Miller Gold Propertyon January 11, 2021 at 9:45 am
GoldSpot Discoveries Corp. (TSXV: SPOT) (the "Company" or "GoldSpot") is pleased to announce the deployment of its proprietary inversion process, MinusOne, on Northstar Gold Corp. Miller Gold project.
- Fierce JPM Week: Koller on what 2021 holds for insitro, machine learning and working virtuallyon January 11, 2021 at 8:37 am
While 2020 has changed the way we live, the pandemic may also deliver lasting changes to how we work—and, as machine learning aims to have a larger impact on the life sciences, they may be changes for ...
- GoldSpot Machine Learning Identifies 8 New Syenite Drill Targets at Northstar's Miller Gold Propertyon January 11, 2021 at 5:30 am
GoldSpot has identified no less than eight new syenite gold exploration targets along 2 property-wide trends, providing numerous drill targets that can potentially result in a number of new gold ...
via Bing News