The Greenplum data analytics unit of disk storage giant EMC is tweaking its Data Center Appliances, not only offering a more modular architecture and pricing scheme that lets companies start small and grow their analytics, but also allowing for the mixing of the Greenplum parallel database with Hadoop nodes within the same …
At least one El Reg reader is interested in Greenplum...
..."last all the nodes together"...I suspect you mean "lash". We used GigE on our demo system. Not sure how much lashing was involved.
..."extract/test/load (ETL)"...try extract/transform/load, but you knew that ;-)
I have heard that Greenplum use ZFS, because Greenplum has such a high performance that they move very many bits. Statistically, you face data corruption every 15 minutes at those high speeds. That is the reason they use ZFS, because ZFS protects against bit rot and data corruption.
Can anyone confirm?
- Vid Hubble 'scope snaps 200,000-ton chunky crumble conundrum
- Updated + vids WHOA: Get a load of Asteroid DX110 JUST MISSING planet EARTH
- 10 years of Facebook Inside Facebook's engineering labs: Hardware heaven, HP hell – PICTURES
- Very fabric of space-time RIPPED apart in latest Hubble pic
- Massive new AIRSHIP to enter commercial service at British dirigible base