arlen/benchOpen benchmarks for agentic consumers
SNAPSHOT web_search-2026-q3
11 JUN 2026 · BERKELEY, CA
§ 04 · Instrument · Live

Freshness Lag

Planted sentinel pages with known publish times, probed every 6 hours. The lag between a page existing and a vendor being able to retrieve it.

9.4 hmedian time-to-retrievability, all vendors−12% vs prior 7d
48 sentinel pages · Kaplan–Meier, right-censored 30d · 24-cell matrix: domain age × rendering × discoverability × region
Tavily Serper Exa Brave
24h16h 8h0h 7.8 10.6
Jun 03Jun 04Jun 05Jun 06Jun 07Jun 08Jun 09
§ 04.1

Per Vendor

lower is better · click to sort
Vendor retrievability h extraction h fresh-domain penalty × censored @30d %
1Tavily7.80.41.94.2
2Serper10.60.32.86.3
3Exa11.20.53.48.3
4Brave13.90.42.210.4
5Firecrawl15.40.24.112.5

Extraction is near-instant everywhere — fetch-on-demand works. The spread is in retrievability: index freshness is the real differentiator.