Changes
Summary
- [X86][Costmodel] Now that `getReplicationShuffleCost()` is good, update `getInterleavedMemoryOpCostAVX512()` (details)
- [AArch64][SVE] Mark fixed-type FP extending/truncating loads/stores as custom (details)
- Use a deterministic order when updating the DominatorTree (details)
- fix typos in comments (details)
- [NFC][X86][LV][Costmodel] Add most basic test for masked interleaved load (details)