Machine learning system aims to determine if an information outlet is accurate or biased

An MIT system needs only about 150 articles to detect the factuality of a news source — meaning it could be used to help stamp out new fake-news outlets before their stories spread too widely. The most reliable way to detect fake news and biased reporting was to look at the common linguistic features across the source’s stories, including sentiment, complexity, and structure.
Image courtesy of MIT CSAIL

Detecting fake news at its source

Lately the fact-checking world has been in a bit of a crisis. Sites like Politifact and Snopes have traditionally focused on specific claims, which is admirable but tedious; by the time they’ve gotten through verifying or debunking a fact, there’s a good chance it’s already traveled across the globe and back again.

Social media companies have also had mixed results limiting the spread of propaganda and misinformation. Facebook plans to have 20,000 human moderators by the end of the year, and is putting significant resources into developing its own fake-news-detecting algorithms.

Researchers from MIT’s Computer Science and Artificial Intelligence Lab (CSAIL) and the Qatar Computing Research Institute (QCRI) believe that the best approach is to focus not only on individual claims, but on the news sources themselves. Using this tack, they’ve demonstrated a new system that uses machine learning to determine if a source is accurate or politically biased.

“If a website has published fake news before, there’s a good chance they’ll do it again,” says postdoc Ramy Baly, the lead author on a new paper about the system. “By automatically scraping data about these sites, the hope is that our system can help figure out which ones are likely to do it in the first place.”

Baly says the system needs only about 150 articles to reliably detect if a news source can be trusted — meaning that an approach like theirs could be used to help stamp out new fake-news outlets before the stories spread too widely.

The system is a collaboration between computer scientists at MIT CSAIL and QCRI, which is part of the Hamad Bin Khalifa University in Qatar. Researchers first took data from Media Bias/Fact Check (MBFC), a website with human fact-checkers who analyze the accuracy and biases of more than 2,000 news sites; from MSNBC and Fox News; and from low-traffic content farms.

They then fed those data to a machine learning algorithm, and programmed it to classify news sites the same way as MBFC. When given a new news outlet, the system was then 65 percent accurate at detecting whether it has a high, low or medium level of factuality, and roughly 70 percent accurate at detecting if it is left-leaning, right-leaning, or moderate.

The team determined that the most reliable ways to detect both fake news and biased reporting were to look at the common linguistic features across the source’s stories, including sentiment, complexity, and structure.

For example, fake-news outlets were found to be more likely to use language that is hyperbolic, subjective, and emotional. In terms of bias, left-leaning outlets were more likely to have language that related to concepts of harm/care and fairness/reciprocity, compared to other qualities such as loyalty, authority, and sanctity. (These qualities represent a popular theory — that there are five major moral foundations — in social psychology.)

Co-author Preslav Nakov, a senior scientist at QCRI, says that the system also found correlations with an outlet’s Wikipedia page, which it assessed for general — longer is more credible — as well as target words such as “extreme” or “conspiracy theory.” It even found correlations with the text structure of a source’s URLs: Those that had lots of special characters and complicated subdirectories, for example, were associated with less reliable sources.

“Since it is much easier to obtain ground truth on sources [than on articles], this method is able to provide direct and accurate predictions regarding the type of content distributed by these sources,” says Sibel Adali, a professor of computer science at Rensselaer Polytechnic Institute who was not involved in the project.

Nakov is quick to caution that the system is still a work in progress, and that, even with improvements in accuracy, it would work best in conjunction with traditional fact-checkers.

“If outlets report differently on a particular topic, a site like Politifact could instantly look at our fake news scores for those outlets to determine how much validity to give to different perspectives,” says Nakov.

Baly and Nakov co-wrote the new paper with MIT Senior Research Scientist James Glass alongside graduate students Dimitar Alexandrov and Georgi Karadzhov of Sofia University. The team will present the work later this month at the 2018 Empirical Methods in Natural Language Processing (EMNLP) conference in Brussels, Belgium.

The researchers also created a new open-source dataset of more than 1,000 news sources, annotated with factuality and bias scores, that is the world’s largest database of its kind. As next steps, the team will be exploring whether the English-trained system can be adapted to other languages, as well as to go beyond the traditional left/right bias to explore region-specific biases (like the Muslim world’s division between religious and secular).

“This direction of research can shed light on what untrustworthy websites look like and the kind of content they tend to share, which would be very useful for both web designers and the wider public,” says Andreas Vlachos, a senior lecturer at the University of Cambridge who was not involved in the project.

Nakov says that QCRI also has plans to roll out an app that helps users step out of their political bubbles, responding to specific news items by offering users a collection of articles that span the political spectrum.

Bots can have an invisible influence on social media bringing up an ethical gray area

“It’s interesting to think about new ways to present the news to people,” says Nakov. “Tools like this could help people give a bit more thought to issues and explore other perspectives that they might not have otherwise considered.”

Learn more: Detecting fake news at its source

The Latest on: Detecting fake news

[google_news title=”” keyword=”detecting fake news” num_posts=”10″ blurb_length=”0″ show_thumb=”left”]

via Google News

The Latest on: Detecting fake news

School principal was framed using AI-generated racist rant, police say. A co-worker is now charged.
on April 26, 2024 at 6:55 am
Pikesville High School's principal Eric Eiswert was implicated in an audio recording making rounds on social media containing racist and antisemitic comments, but police say the voice was AI-generated ...
Zoom and enhance: Adobe AI sharpens videos by up to 8 times the original resolution with minimal artifacts
on April 26, 2024 at 6:27 am
The software giant Adobe has introduced a new AI feature, known as VideoGigaGAN, that promises to upscale videos by eight times the original resolution while minimizing common visual artifacts. This ...
I’m a private investigator — these are the 5 craziest things I’ve seen cheaters do to hide
on April 25, 2024 at 10:29 am
“A man was suspected of cheating on his wife of 50 years; they had three children and seven grandchildren,” Hart told the Mirror. “After utilizing surveillance, we identified that he was living at ...
Banner Urgent Care helps patients detect Valley fever early
on April 24, 2024 at 6:36 pm
Health experts warn that Valley fever cases are likely to rise this spring and to get ahead, new technology is detecting Valley fever in its early stages for patients at Banner Urgent Care. FOX 10's ...
Computer game in school made students better at detecting fake news
on April 24, 2024 at 2:19 pm
A computer game helped upper secondary school students become better at distinguishing between reliable and misleading news.
Computer game helps students get better at detecting fake news
on April 24, 2024 at 12:53 pm
A computer game helped upper secondary school students become better at distinguishing between reliable and misleading news. This is shown by a study conducted by researchers at Uppsala University and ...
Insider Q&A: Trust and safety exec talks about AI and content moderation
on April 22, 2024 at 9:02 pm
Alex Popken was a trust and safety executive at Twitter, leaving in 2023 after a decade at the social media company focusing on content moderation.
With elections and more coming in 2024, here's how to spot deepfake videos
on April 22, 2024 at 6:22 am
While you have the video paused, you can also try taking a screenshot and doing an image search online to see if the original video or references to a fake video are found. If you’re not familiar with ...
The telltale signs of AI-generated images, video and audio, according to experts
on April 21, 2024 at 6:08 am
Experts say it’s still possible to detect AI-generated imagery, but the tools for outsmarting our eyes are getting better and better. Here’s what to look for.

via Bing News

What's Your Reaction?

Don't Like it!

I Like it!

The Latest on: Detecting fake news

The Latest on: Detecting fake news

What's Your Reaction?

Leave a Reply