Researchers explored various spectral-whitening methods that outperform Adam at the Pareto frontier of compute, including Shampoo, SOAP, SPlus, and PSGD, which use different techniques to approximate the whitening metric. These optimizers reliably outperform Adam, with SOAP being the most effective per gradient-step, and Muon being particularly powerful due to its efficient computational properties.