If you have more data than memory...
The more data than memory problem is hard to solve without nearline-media. Since Big Data typically has non-relational data as a large component, perhaps they do lossless compression or invoke some kind of minimally lossy compression.
Lossy stuff can be novel compression or somehow suppressing duplicate (or near-duplicate) information, without losing any analytical capability. That is working rather well in many enterprise backup approaches...