Leaderboards

Public rankings by benchmark task. Sort any column — hover a metric header for a short description.

See Benchmarks for task definitions and Evaluation for metric definitions (tables list the scalar scores used here; P–P style diagnostics are on the Evaluation page as plots).

Illustrative rows only. Entries whose team name starts with [Demo] are seeded with placeholder scores so the layout is easy to review — they are not official benchmark submissions.

LVK — Level 0

ground networkintrinsic PE · fixed extrinsicbbh-pe-l0

Same Level-0 science target as the LVK board: five intrinsic BBH parameters with extrinsic quantities fixed by the benchmark; canonical ladder slug from release notes.

#	Team	Method	C2ST↓	Expected coverage↓	KL divergence↓	JSD↓	IS-ESS↑	Date
1	[Demo] Posterior Pioneers	Neural flow matching	0.480	0.060	0.110	0.035	9.40e+2	Apr 2026
2	[Demo] Nested Nebula Crew	MultiNest-style nested sampling	0.550	0.110	0.280	0.092	4.10e+2	Apr 2026
3	[Demo] Slice & Dice Inference	Mean-field VI + normalizing flow	0.620	0.180	0.450	0.140	2.20e+2	Apr 2026

LVK — Level 1

ground networkfull extrinsic (11D)

Same setting as Level 0 with free extrinsic parameters (11D). Higher reference-sampler cost.

#	Team	Method	C2ST↓	Expected coverage↓	KL divergence↓	JSD↓	IS-ESS↑	Date
1	[Demo] Ten-Dimensional Den	Parallel-tempered ensemble MCMC	0.520	0.090	0.210	0.068	6.20e+2	Apr 2026
2	[Demo] Trans-D Walkers	RJMCMC + surrogate likelihood	0.580	0.140	0.330	0.110	3.80e+2	Apr 2026

LISA — Level 0

preview · roadmap

Space-based LISA track, Level 0 definition in preparation; the public leaderboard will populate after the first evaluation window.

#	Team	Method	C2ST↓	Expected coverage↓	KL divergence↓	JSD↓	IS-ESS↑	Date
1	[Demo] LISA Ladder League	Sky-marginalized normalizing flow	0.490	0.070	0.150	0.048	7.10e+2	Apr 2026
2	[Demo] Ringdown Randomistas	Hierarchical VB on MBHBs	0.570	0.120	0.310	0.100	3.40e+2	Apr 2026