Leaderboards

Public rankings by benchmark task. Sort any column — hover a metric header for a short description.

See Benchmarks for task definitions and Evaluation for metric definitions (tables list the scalar scores used here; P–P style diagnostics are on the Evaluation page as plots).

Illustrative rows only. Entries whose team name starts with [Demo] are seeded with placeholder scores so the layout is easy to review — they are not official benchmark submissions.

LVK — Level 0

ground networkintrinsic PE · fixed extrinsicbbh-pe-l0

Same Level-0 science target as the LVK board: five intrinsic BBH parameters with extrinsic quantities fixed by the benchmark; canonical ladder slug from release notes.

#TeamMethodC2STExpected coverageKL divergenceJSDIS-ESSDate
1[Demo] Posterior PioneersNeural flow matching0.4800.0600.1100.0359.40e+2Apr 2026
2[Demo] Nested Nebula CrewMultiNest-style nested sampling0.5500.1100.2800.0924.10e+2Apr 2026
3[Demo] Slice & Dice InferenceMean-field VI + normalizing flow0.6200.1800.4500.1402.20e+2Apr 2026

LVK — Level 1

ground networkfull extrinsic (11D)

Same setting as Level 0 with free extrinsic parameters (11D). Higher reference-sampler cost.

#TeamMethodC2STExpected coverageKL divergenceJSDIS-ESSDate
1[Demo] Ten-Dimensional DenParallel-tempered ensemble MCMC0.5200.0900.2100.0686.20e+2Apr 2026
2[Demo] Trans-D WalkersRJMCMC + surrogate likelihood0.5800.1400.3300.1103.80e+2Apr 2026

LISA — Level 0

preview · roadmap

Space-based LISA track, Level 0 definition in preparation; the public leaderboard will populate after the first evaluation window.

#TeamMethodC2STExpected coverageKL divergenceJSDIS-ESSDate
1[Demo] LISA Ladder LeagueSky-marginalized normalizing flow0.4900.0700.1500.0487.10e+2Apr 2026
2[Demo] Ringdown RandomistasHierarchical VB on MBHBs0.5700.1200.3100.1003.40e+2Apr 2026