Live Dashboard & UX (v0.44.0)

21 cross-cutting UX features, +192 tests.

Highlights:

  • Format-gate row helpers for the eval-gate dashboard
  • HF resume checkpoint skipping optimization
  • Polish across run cost, replay, profiling, and the Textual TUI

See [observability](/docs/observability) for soup why / soup tui / crash bundles.