QEMU does not model caches at all, so you cannot use it to look at the performance of software in the way you are hoping to do. (In general, trying to estimate performance by running code on a software model is tricky at best, because the behaviour of software models is often significantly different from the behaviour of real hardware, especially for modern hardware which is significantly out-of-order, speculative and microarchitecturally complex. There are a lot of pitfalls for the unwary.)
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…