arlen/benchOpen benchmarks for agentic consumers
INDEPENDENT · CC-BY-4.0
UPDATED 12 JUN 2026 · BERKELEY, CA
Vendor × Web Extraction · Verified

Jina — Web Extraction

How Jina performs on the arlen/bench web extraction leaderboard, the verified WCXB run (n=150).

Answer firstweb_extraction-2026-q2

Jina ranks #4 of 4 on the web extraction leaderboard with fidelity 0.54. Cost per verified-correct is not yet priced (—).

§

Metrics

web_extraction-2026-q2
MetricJinavs field
fidelity 0-10.5495% CI 0.511–0.573higher better
phrase recall0.59higher better
boilerplate excl.0.26higher better
cost/correct $lower better
coverage %99.3leads

Full leaderboard: Web Extraction · snapshot JSON: /bench/api/web_extraction-2026-q2.json

§

Strengths & Weaknesses

by page type

Strongest: service (0.6), article (0.59). Weakest: product (0.48), listing (0.4).

§

Sample Rows

lowest-scoring audited pages
TypePagefidelity
listingAll Latest News0.07
productMen's Wool Runner0.3
documentationUsing the Fetch API0.45
§

Vendor Right of Reply

Jina has not yet been sent its rows for pre-publication review (notifications pending). Right of reply is standing; any response will be published verbatim here and linked from the leaderboard row. No commercial relationship; vendors cannot pay for placement.