Machine Learning (ML) Beyond Roofline: Mapping and smoothing GPU Performance Ruggedness by Intel A new Intel study shows GPU matrix-multiply performance is a rugged terrain, not a smooth ceiling and introduces "ruggedness analysis" to map it and an optimizer that makes it 30% faster.