Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Van Jones
Van Jones
CNN Political Commentator
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Rob Reich
Rob Reich
Political Scientist, Stanford
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Van Jones
Van Jones
CNN Political Commentator
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Rob Reich
Rob Reich
Political Scientist, Stanford
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Rob Reich
Rob Reich
Political Scientist, Stanford
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Van Jones
Van Jones
CNN Political Commentator
Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Rob Reich
Rob Reich
Political Scientist, Stanford
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Van Jones
Van Jones
CNN Political Commentator
Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Rob Reich
Rob Reich
Political Scientist, Stanford
Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Van Jones
Van Jones
CNN Political Commentator
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Rob Reich
Rob Reich
Political Scientist, Stanford
Fareed Zakaria
Fareed Zakaria
CNN Host & Author
Kevin McCarthy
Kevin McCarthy
Former Speaker of the House
Van Jones
Van Jones
CNN Political Commentator
Antony Blinken
Antony Blinken
Former U.S. Secretary of State
Scott Jennings
Scott Jennings
CNN Senior Political Commentator
Niall Ferguson
Niall Ferguson
Historian, Hoover Institution
Aggregate score Aggregate score is the average across three components: Neutrality, Accuracy, and Source Quality.
Last updated May 13, 2026
Methodology
Forum AI convenes leading experts to evaluate AI on the news that matters.

Experts identify the test cases that matter most as news breaks — the prompts where issues are likeliest to surface. Our judges are then calibrated to 95%+ agreement with expert consensus before any model is scored. View whitepaper


Breaking the problem down

Notable findings
Neutrality Leaderboard

Do AI systems present all sides of the story?

Political and social debates rarely have a single correct answer — yet AI systems are increasingly asked to discuss them. We evaluate whether models present multiple perspectives without favoring one side, using language that is ideologically loaded, or embedding assumptions into how they frame questions.

Overall Neutrality Score

Ideological Lean Overview

Breaking down neutrality

How models respond by prompt framing

Across all evaluated prompts, how often does each model's response take on a political lean — and does that change depending on how the question is framed?


Notable findings
Source Quality Leaderboard

Are AI systems using reliable sources?

The credibility of an AI's answer is only as good as the sources it draws from. We evaluate whether models rely on quality information like primary sources, peer-reviewed research, and reputable journalism. We also flag government-controlled media.

Overall Source Quality Score

Inline Source Quality Score Scored on the citations the model placed inline next to specific claims. The overall Source Quality score reflects the broader set of sources the model considered.

Source Tier Breakdown by Model

Distribution of citations across source quality tiers. Primary and research sources represent the highest-quality evidence; informal and self-published web sources the lowest.


Notable findings
Accuracy Leaderboard

Are AI systems covering the news accurately?

Factual errors in news contexts can mislead voters, spread misinformation, and undermine trust. We evaluate how accurately models represent verifiable claims, whether they hallucinate sources or statistics, and how well they distinguish established facts from contested assertions.

Overall Accuracy Score

Checkable Claims Breakdown

Of the verifiable factual claims in each model's responses — how many were confirmed true, contested, or false/hallucinated.


Notable findings
News Monitoring

Active stories AI systems are covering right now

A live snapshot of the news cycle our judges are evaluating. Activity reflects volume of conversation on X for each story; difficulty summarizes story-level performance across Accuracy, Neutrality, and Source Quality.

Active stories

Notable findings
Judge Health How reliable are our evaluation judges? This tab validates pipeline judgments against a validation set of human-rated items using Cohen's kappa, accuracy, TPR, and TNR per criterion.
This page is for internal use only and is not visible to the public.
Full Benchmark
Neutrality
Factuality
Source Quality
Lean Detection Rate