Question 1

How calls are judged

Accepted Answer

For each call in our database, we assemble structured context (game state, players involved, official assignment) and unstructured context (Last Two Minute report excerpts, play-by-play descriptions, video when available). Multiple AI models then independently produce a judgment: correct, incorrect, missed, or inconclusive. The latest judgement is the one displayed.

Question 2

Confidence scoring

Accepted Answer

Each judgment includes a confidence score from 0 to 100. Higher confidence means the available evidence strongly supports the verdict. Low confidence is shown explicitly — we never hide uncertainty. Confidence is not a probability; it is an analytical estimate from the model.

Question 3

Provenance & sources

Accepted Answer

Every judgement is paired with linked sources: the L2M report row, the play-by-play event, the box score, or the original video. If a source cannot be cited, the judgement is marked accordingly and downgraded in confidence.

Question 4

L2M usage

Accepted Answer

The NBA's Last Two Minute reports are an authoritative public source for late-game review. We ingest them, link each row to its underlying call where possible, and use them as primary evidence when judging final-period plays.

Question 5

What "Bias Score" means

Accepted Answer

A bias score is an aggregate analytical estimate — not an accusation. It compares the rate at which an official's calls go for or against a given team, player, or context, relative to a baseline. A non-zero bias score reflects a statistical pattern in our judged sample, not intent or wrongdoing.

Methodology

How calls are judged

Confidence scoring

Provenance & sources

L2M usage

What "Bias Score" means

How foul differential works

What "Confidence" means

What AI does and does NOT do

Language we use — and avoid

Limitations