IBM Research has come up with a new package it calls BLU Acceleration, intended to speed up (big) data analysis and reporting of data held in DB2 and Informix databases, amongst others. The BLU Acceleration package includes these handy features: Data skipping - an ability to skip data that doesn't need analysing, such as …
If you have more data than memory...
The more data than memory problem is hard to solve without nearline-media. Since Big Data typically has non-relational data as a large component, perhaps they do lossless compression or invoke some kind of minimally lossy compression.
Lossy stuff can be novel compression or somehow suppressing duplicate (or near-duplicate) information, without losing any analytical capability. That is working rather well in many enterprise backup approaches...
- Fee fie Firefox: Mozilla's lawyers probe Dell over browser install charge
- Does Apple's iOS make you physically SICK? Try swallowing version 7.1
- Pics Indestructible Death Stars blow up planets with glowing KILL RAY
- Video Snowden: You can't trust SPOOKS with your DATA
- 166 days later: Space Station astronauts return to Earth