Latest commit

loading…

Trends

Metrics are grouped by suite and shown with readable labels; hover any label (or the raw stem_us beneath it) for the dataset + query it measures. Every metric is smaller-is-better (customSmallerIsBetter); the Δ-vs-best pill compares the latest commit against the all-time minimum for that metric.

Scaling

How a metric scales with dataset size / depth, for suites run at multiple sizes (e.g. Deep Taxonomy depths, WatDiv scale factors). The size/depth axis is read from the metric name.