IBM Research has come up with a new package it calls BLU Acceleration, intended to speed up (big) data analysis and reporting of data held in DB2 and Informix databases, amongst others. The BLU Acceleration package includes these handy features: Data skipping - an ability to skip data that doesn't need analysing, such as …
If you have more data than memory...
The more data than memory problem is hard to solve without nearline-media. Since Big Data typically has non-relational data as a large component, perhaps they do lossless compression or invoke some kind of minimally lossy compression.
Lossy stuff can be novel compression or somehow suppressing duplicate (or near-duplicate) information, without losing any analytical capability. That is working rather well in many enterprise backup approaches...
- Boffins attempt to prove the UNIVERSE IS JUST A HOLOGRAM
- China building SUPERSONIC SUBMARINE that travels in a BUBBLE
- Review Raspberry Pi B+: PHWOAR, get a load of those pins
- That 8TB Seagate MONSTER? It's HERE... (You'll have to squint, 'cos there are no specs)
- Review Reg man looks through a Glass, darkly: Google's toy ploy or killer tech specs?