Compares Python golden frames vs the live WebGL pipeline using the same params. Mean ΔE ≤ 5.0 = pass for non-shake / non-bloom tests; shake + bloom are flagged as approximate (math doc §9).
Goldens path: Run Parity