
Wish I could upvote an article... trying to read this at 8:45am, whilst still half a sleep, left me going.. huh? what? huh? umm?? It did eventually make sense though :D
Oracle reckons Linux remote direct memory access (RDMA) implementations need features like high availability and load balancing, and hopes to sling code into the kernel to do exactly that. The problem, as Oracle Linux kernel developer Sudhakar Dindukurti explained in this post, is that performance and security considerations …
There are many different scenarios that may cause you to have to fail over, this addresses just one of those scenarios. If Oracle could maybe put a little more effort into updating their applications to be more HA then we’d likely get a heck of a lot more of those scenarios covered.
Wasn't HPs Moonshot kinda-sorta about accessing gobs and gobs of memory-speed storage using photonics as the medium? The reason I'm asking is because despite being a registered member of Team Lenovo, I recall being really inspired by HP's whole Moonshot architecture which, had it been built, would have rendered RDMA less important. It seems like we'll get to something like Moonshot, but incrementally and likely over decades.
ran RDMA over an abstracted network device, with the resilience built in to the underlying network. This allowed the network layer to adjust to failures without the RDMA setup being exposed to the changes.
Seemed to work quite well on AIX, and I believe that the Linux support (for the P7 775 9125-F2C) worked the same (or even better!). I'm pretty sure that IBM would have put their work back into the kernel.