Forum AI — Newsbench

Fareed Zakaria

CNN Host & Author

Van Jones

CNN Political Commentator

Scott Jennings

CNN Senior Political Commentator

Niall Ferguson

Historian, Hoover Institution

Rob Reich

Political Scientist, Stanford

Antony Blinken

Former U.S. Secretary of State

Kevin McCarthy

Former Speaker of the House

Fareed Zakaria

CNN Host & Author

Van Jones

CNN Political Commentator

Scott Jennings

CNN Senior Political Commentator

Niall Ferguson

Historian, Hoover Institution

Rob Reich

Political Scientist, Stanford

Antony Blinken

Former U.S. Secretary of State

Kevin McCarthy

Former Speaker of the House

Kevin McCarthy

Former Speaker of the House

Antony Blinken

Former U.S. Secretary of State

Rob Reich

Political Scientist, Stanford

Niall Ferguson

Historian, Hoover Institution

Scott Jennings

CNN Senior Political Commentator

Van Jones

CNN Political Commentator

Fareed Zakaria

CNN Host & Author

Kevin McCarthy

Former Speaker of the House

Antony Blinken

Former U.S. Secretary of State

Rob Reich

Political Scientist, Stanford

Niall Ferguson

Historian, Hoover Institution

Scott Jennings

CNN Senior Political Commentator

Van Jones

CNN Political Commentator

Fareed Zakaria

CNN Host & Author

Rob Reich

Political Scientist, Stanford

Fareed Zakaria

CNN Host & Author

Kevin McCarthy

Former Speaker of the House

Van Jones

CNN Political Commentator

Antony Blinken

Former U.S. Secretary of State

Scott Jennings

CNN Senior Political Commentator

Niall Ferguson

Historian, Hoover Institution

Rob Reich

Political Scientist, Stanford

Fareed Zakaria

CNN Host & Author

Kevin McCarthy

Former Speaker of the House

Van Jones

CNN Political Commentator

Antony Blinken

Former U.S. Secretary of State

Scott Jennings

CNN Senior Political Commentator

Niall Ferguson

Historian, Hoover Institution

Aggregate score

Last updated May 13, 2026

Methodology

Forum AI convenes leading experts to evaluate AI on the news that matters.

Experts identify the test cases that matter most as news breaks — the prompts where issues are likeliest to surface. Our judges are then calibrated to 95%+ agreement with expert consensus before any model is scored. View whitepaper

Fareed Zakaria

CNN Host & Author

Tony Blinken

Former Sec. of State

Kevin McCarthy

Former Speaker of the House

Niall Ferguson

Historian, Hoover Institution

Scott Jennings

Political Commentator

Rob Reich

Stanford Professor

Anne Neuberger

Former Deputy NSA

Sebastian Mallaby

Author, Senior Fellow CFR

Bethany McLean

Journalist & Author

Reihan Salam

President, Manhattan Institute

Ian Bremmer

Founder, Eurasia Group

Sebastian Kurz

Former Chancellor of Austria

Fareed Zakaria

CNN Host & Author

Tony Blinken

Former Sec. of State

Kevin McCarthy

Former Speaker of the House

Niall Ferguson

Historian, Hoover Institution

Scott Jennings

Political Commentator

Rob Reich

Stanford Professor

Anne Neuberger

Former Deputy NSA

Sebastian Mallaby

Author, Senior Fellow CFR

Bethany McLean

Journalist & Author

Reihan Salam

President, Manhattan Institute

Ian Bremmer

Founder, Eurasia Group

Sebastian Kurz

Former Chancellor of Austria

Breaking the problem down

Notable findings

Neutrality Leaderboard

Do AI systems present all sides of the story?

Political and social debates rarely have a single correct answer — yet AI systems are increasingly asked to discuss them. We evaluate whether models present multiple perspectives without favoring one side, using language that is ideologically loaded, or embedding assumptions into how they frame questions.

Overall Neutrality Score

Ideological Lean Overview

Breaking down neutrality

How models respond by prompt framing

Across all evaluated prompts, how often does each model's response take on a political lean — and does that change depending on how the question is framed?

Notable findings

Source Quality Leaderboard

Are AI systems using reliable sources?

The credibility of an AI's answer is only as good as the sources it draws from. We evaluate whether models rely on quality information like primary sources, peer-reviewed research, and reputable journalism. We also flag government-controlled media.

Overall Source Quality Score

Inline Source Quality Score

Source Tier Breakdown by Model

Distribution of citations across source quality tiers. Primary and research sources represent the highest-quality evidence; informal and self-published web sources the lowest.

Notable findings

Accuracy Leaderboard

Are AI systems covering the news accurately?

Factual errors in news contexts can mislead voters, spread misinformation, and undermine trust. We evaluate how accurately models represent verifiable claims, whether they hallucinate sources or statistics, and how well they distinguish established facts from contested assertions.

Overall Accuracy Score

Checkable Claims Breakdown

Of the verifiable factual claims in each model's responses — how many were confirmed true, contested, or false/hallucinated.

Notable findings

News Monitoring

Active stories AI systems are covering right now

A live snapshot of the news cycle our judges are evaluating. Activity reflects volume of conversation on X for each story; difficulty summarizes story-level performance across Accuracy, Neutrality, and Source Quality.

Active stories

Notable findings

Judge Health

This page is for internal use only and is not visible to the public.

Full Benchmark

By Criteria

Neutrality

Factuality

Source Quality

By Topic

Bias Assessment

Lean Detection Rate