The Greenplum data analytics unit of disk storage giant EMC is tweaking its Data Center Appliances, not only offering a more modular architecture and pricing scheme that lets companies start small and grow their analytics, but also allowing for the mixing of the Greenplum parallel database with Hadoop nodes within the same …
At least one El Reg reader is interested in Greenplum...
..."last all the nodes together"...I suspect you mean "lash". We used GigE on our demo system. Not sure how much lashing was involved.
..."extract/test/load (ETL)"...try extract/transform/load, but you knew that ;-)
I have heard that Greenplum use ZFS, because Greenplum has such a high performance that they move very many bits. Statistically, you face data corruption every 15 minutes at those high speeds. That is the reason they use ZFS, because ZFS protects against bit rot and data corruption.
Can anyone confirm?
- 'Windows 9' LEAK: Microsoft's playing catchup with Linux
- Infosec geniuses hack a Canon PRINTER and install DOOM
- Boffins say they've got Lithium batteries the wrong way around
- Game Theory Half a BILLION in the making: Bungie's Destiny reviewed
- Review A SCORCHIO fatboy SSD: Samsung SSD850 PRO 3D V-NAND