Benchlist uses Aligned Layer — a proof aggregation network settling on Ethereum L1 — as the verification backbone. When you see a Verified ⛓ badge, it means the score is backed by an on-chain proof you can re-check.
Every attested run on Benchlist shows up here. Click any batch to jump into the proof viewer — or deep-link to Aligned’s explorer and Etherscan for an independent check.
Publishers pick the proof system that fits their scoring function. The default is SP1 — a RISC-V zkVM that runs unmodified Python/Rust scoring code.
Every commitment is a one-way hash. Edit one character in a transcript and the Merkle root changes, the proof fails, Aligned rejects the batch.
The dataset hash and methodology hash are pinned in the proof. Anyone can re-run the benchmark and check whether the same inputs produce the same score.
Attestors post ETH stake. If a dispute is upheld — i.e., a community replay produces a materially different score — the attestor's stake is slashed.
publisher.json (service metadata)
│
▼
attestor runner (SP1 / Risc0 / signed Ed25519)
│ ┌─ commits ─▶ datasetHash ─┐
│ ├─ commits ─▶ methodologyHash ├─ merkle root
│ ├─ commits ─▶ transcripts ─┘
│ └─ produces ─▶ zk proof of score
▼
aligned-sdk submit (Holesky → mainnet bridge)
│
▼
Aligned operator set (BLS signatures over batch)
│
▼
ServiceManager.sol (0xeF2A…606c on Ethereum L1)
│
▼
batch_id + block ────▶ Benchlist listing credential
Ruby-simple CLI. Run the benchmark, submit the proof, wait for verification.
# 1. Run the benchmark locally (or in CI)
$ benchlist run longmemeval \
--service rem-labs \
--model claude-opus-4-7 \
--runs 3 \
--out run.json
# 2. Commit: hash transcripts, compute Merkle root
$ benchlist commit run.json
# 3. Prove: produce SP1 proof of scoring function
$ benchlist prove run.json --system sp1
# 4. Submit to Aligned Layer
$ benchlist submit run.json --network holesky
→ batch_id: 0x7b3c...2b4c
→ verifier: 0xeF2A...606c
→ waiting for on-chain verification...
→ verified at block 22184921
# 5. Publish the listing
$ benchlist publish run.json
→ https://benchlist.ai/verify/run-rem-lme-001
Aligned Layer’s AVS contracts live on Ethereum L1 mainnet — secured by restaked ETH via EigenLayer, not on Base or any rollup. Point your node at any of these to validate a batch independently.
Benchlist’s own on-chain contracts — for publisher registration, attestor staking, and dispute resolution — are in audit. Addresses publish here once deployed. Until then, payments settle via Stripe (card) and the crypto endpoint (native ETH on Base for pennies of gas, Ethereum L1 for directness, or Arbitrum).
Full wire format and reference runner implementation in the integration docs. Reference runner is MIT-licensed, forkable.