Re: IBM's realised file systems are key.
"...With huge and increasing amounts of data, the simple filing systems of Linux/Mac/Windows are no longer adequate. Advanced database filing systems where files have extended metadata for both the user and the file data are sorely needed...."
Ok, and what advantage does the metadata you talk of, give us? Why do we need it? You only say it is needed, but why? Please enlighten us? :o)
.
.
How does this GPFS compare to Lustre + ZFS? The new coming IBM supercomputer at 20Teraflops will use Lustre + ZFS. It will have 55 Petabyte of data, and 0.5-1TB/sec bandwidth. They are porting ZFS to Linux Lustre, because ext does not scale.
http://zfsonlinux.org/docs/LUG11_ZFS_on_Linux_for_Lustre.pdf
http://www.youtube.com/watch?v=c5ASf53v4lI
Nor does ext protect against data corruption. All hard disks have checksums to detect and correct data corruption, but that is not enough. Every once in a while, the disks will encounter random bit flips that the checksums can not correct. Even worse, some of the errors are not even detectable by the disk. The only solution is to use End-To-End checksums, which ZFS does. Thus, ZFS detects and protects against data corruption. There is research on data corruption and ext / ntfs / XFS / JFS / etc here (research shows that ZFS does protect your data):
http://en.wikipedia.org/wiki/ZFS#Data_Integrity