Leaderboards
Public rankings by benchmark task. Sort any column — hover a metric header for a short description.
See Benchmarks for task definitions and Evaluation for metric definitions (tables list the scalar scores used here; P–P style diagnostics are on the Evaluation page as plots).
Illustrative rows only. Entries whose team name starts with [Demo] are seeded with placeholder scores so the layout is easy to review — they are not official benchmark submissions.
LVK — Level 0
ground networkintrinsic PE · fixed extrinsicbbh-pe-l0Same Level-0 science target as the LVK board: five intrinsic BBH parameters with extrinsic quantities fixed by the benchmark; canonical ladder slug from release notes.
| # | Team | Method | C2ST↓ | Expected coverage↓ | KL divergence↓ | JSD↓ | IS-ESS↑ | Date |
|---|---|---|---|---|---|---|---|---|
| 1 | [Demo] Posterior Pioneers | Neural flow matching | 0.480 | 0.060 | 0.110 | 0.035 | 9.40e+2 | Apr 2026 |
| 2 | [Demo] Nested Nebula Crew | MultiNest-style nested sampling | 0.550 | 0.110 | 0.280 | 0.092 | 4.10e+2 | Apr 2026 |
| 3 | [Demo] Slice & Dice Inference | Mean-field VI + normalizing flow | 0.620 | 0.180 | 0.450 | 0.140 | 2.20e+2 | Apr 2026 |
LVK — Level 1
ground networkfull extrinsic (11D)Same setting as Level 0 with free extrinsic parameters (11D). Higher reference-sampler cost.
| # | Team | Method | C2ST↓ | Expected coverage↓ | KL divergence↓ | JSD↓ | IS-ESS↑ | Date |
|---|---|---|---|---|---|---|---|---|
| 1 | [Demo] Ten-Dimensional Den | Parallel-tempered ensemble MCMC | 0.520 | 0.090 | 0.210 | 0.068 | 6.20e+2 | Apr 2026 |
| 2 | [Demo] Trans-D Walkers | RJMCMC + surrogate likelihood | 0.580 | 0.140 | 0.330 | 0.110 | 3.80e+2 | Apr 2026 |
LISA — Level 0
preview · roadmapSpace-based LISA track, Level 0 definition in preparation; the public leaderboard will populate after the first evaluation window.
| # | Team | Method | C2ST↓ | Expected coverage↓ | KL divergence↓ | JSD↓ | IS-ESS↑ | Date |
|---|---|---|---|---|---|---|---|---|
| 1 | [Demo] LISA Ladder League | Sky-marginalized normalizing flow | 0.490 | 0.070 | 0.150 | 0.048 | 7.10e+2 | Apr 2026 |
| 2 | [Demo] Ringdown Randomistas | Hierarchical VB on MBHBs | 0.570 | 0.120 | 0.310 | 0.100 | 3.40e+2 | Apr 2026 |