Changelog
All notable changes to this project will be documented in this file.
Version 0.2.2 (2026-04-28)
Added
Geometric evaluation metrics
geom_at_k()andgeom_at_k_ci()for questionwise geometric blends of Pass@k and Unanimous@k.geom_ds_at_k()andgeom_ds_at_k_ci()for dataset-level Pass/Unanimous endpoint blends.geo_spectrum_at_k()andgeo_spectrum_at_k_ci()for configurable GeoSpectrum metrics with threshold-spectrum weights.geo_spectrum_star_at_k()andgeo_spectrum_star_at_k_ci()for the default upper-half GeoSpectrum operating point.threshold_spectrum_at_k()andthreshold_spectrum_at_k_ci()for finite-bank threshold-spectrum summaries.
Version 0.1.0 (2025-12-15)
Initial release
Added
Pass@k metrics
Standard
pass_at_k()(at least one correct)pass_hat_k()/g_pass_at_k()(all correct)g_pass_at_k_tau()with threshold parametermg_pass_at_k()