gpfs

IBM goes for really, really, really big data

According to an article in this week's MIT Technology Review, IBM researchers are working on a new 120 petabyte data repository made up of 200,000 conventional hard disk drives working together. The giant data container is expected to store around 1 trillion files and should provide the space needed to allow more powerful simulations of complex systems, like those used to model weather and climate.

The new system benefits from a file system known as General Parallel File System (GPFS) that was developed at IBM Almaden to enable supercomputers faster data access. It spreads individual files across multiple … Read more