Gallery — real scores, no login

Three use cases. One scoring engine.

Every card below is a live score sitting in the database — click any one to see the per-metric breakdown, judge rationale, and ranked suggestions. To score your own URL, ticket, or PR, you'll need an account.

Use case · Landing pages

Score any webpage through a customer lens.

Same judge, four real homepages. The rationale cites copy from each page — these are real scores, not screenshots.

Customer-Centric Judge

Judge homepage (this site)

Rubric

65.3/ 100

Judge.tools leads with a strong, concrete value proposition and backs it with unusually honest trust mechanics (published noise floor, deterministic deltas, versioned judges, tran…

May 8Open breakdown →

Customer-Centric Judge

Stripe

Rubric

65.3/ 100

Stripe's homepage delivers strong trust signals through quantified statistics and named enterprise case studies, and the dual CTA pattern with Google sign-up reduces signup fricti…

May 8Open breakdown →

Customer-Centric Judge

Linear

Rubric

50.4/ 100

Linear's marketing page is clearly built for a technical-insider audience: it embeds shell commands, raw code diffs, internal ticket IDs, and unexplained acronyms (PRDs, MCP, Grap…

May 8Open breakdown →

Customer-Centric Judge

Vercel

Rubric

54.9/ 100

Vercel's homepage is technically dense and clearly written for developers who already know terms like 'Fluid Compute,' 'multi-tenant,' and 'Framework-Defined Infrastructure.' A no…

May 8Open breakdown →

Use case · Engineering tickets

Score any text artifact — even a Linear ticket.

The spec-completeness rubric scores a vague 'Fix slow dashboard' ticket against scope, acceptance criteria, success metric, and edge cases. Same engine works for any text artifact.

Spec Completeness Judge

ENG-1234 · "Fix slow dashboard"

Rubric

15.0/ 100

This spec scores poorly across all dimensions. The problem statement relies entirely on anecdotal, unmeasured language ("feels slow," "forever to load," "looks bad") with no basel…

May 8Open breakdown →

Use case · Code PRs via MCP

Score a real GitHub PR — diff, description, and all.

A merged Vercel/SWR PR fetched live via the GitHub API and judged by code-quality. Connect the GitHub App and this happens automatically on every PR.

Code Quality Judge

vercel/swr · PR #4243

Rubric

72.8/ 100

This is a pure dependency-version bump across two `package.json` files in example directories, resolving four named CVEs. The change is minimal and targeted: only the `axios` vers…

May 8Open breakdown →

Score your own — 30 seconds, no card.

Sign up to score your homepage, your latest PR, or any text artifact your team ships. The system rubrics are seeded; you can mint a custom one from a one-line prompt.