Why has it taken so long?
It's obviously not beyond the resources of an Apple or Samsung to design their own RISC cpu.
The real clever stuff comes in the process design which, at least on the higher end parts, you need to do yourself.
With opensource compilers the tool chain and so the experienced developers are there for you.
I always assumed it was that if you were an Apple or Samsung the royalties to ARM dropped to teh point where they were just less than creating your own architecture