I've said it before, I'll say it again. The future is lower latency RAM. And stacked behind the CPU to minimise distance related latency.
Then there's no need for caches, prefetching, speculative execution, branch prediction etc. Just simple high speed in/out processing.
Assuming we're not all using quantum computers by then.