EMC's Data Domain has got itself DD Boost software which pre-processes backup data on a media server to increase deduplication speed by up to 50 per cent. The software is installed on a media server and is integrated with either Symantec NetBackup or Backup Exec. It is a library and is used to identify segments in incoming data …
That is one hefty increase..
"Its DD880 is now rated at 8.8TB/hour because of DD Boost, up from the initial 5.4B/hour"
That's a speed increase of 1.55555556 × 10^12 times according to google.
There's no free lunch
I can’t speak for all the competition for Data Domain, but the introduction of DD Boost hasn’t caused anyone at Quantum to jump off a bridge. It essentially amounts to a rebranding of support for OST, as an extension of their Global Deduplication Array. Reducing redundancy at the media server as DD Boost requires could reduce the load on the network and the target, but it means that the media server is doing more. So the change is likely to mean customers will actually need a bigger media server – there’s no free lunch. Quantum’s mid-range DXi6500 family sees a 25 to 40% performance increase when we use the OST interface without changing anything on the media server. Do you suppose Quantum might extend that capability to enterprise products in the near future? Watch.
Customers should also consider the complexity of a distributed solution like DD Boost. It can be time-consuming to configure, may require hardware upgrades, and troubleshooting can become extremely cumbersome. The simplicity of DXi, whether with OST or not, provides increased speed and better economic value without the extra complexity of distributed processing.
Mark: I hear what you're saying and agree that DD Boost is just a rebrand of their DD OST - just added a BO to it.
But when you look at backup performance, purely, how can you reconcile the performance increase you state? The data is the data is the data and whether it is a full backup or an incremental backup that data, in its life size form, is being sent across the wire.
The only way i can see backup performance being truly increased as it relates to the wire transfer is if you're not sending all the data.
speeding up the destination target IS an important aspect to backup performance, but that's only half of the solution.
EMC's "claim" thus far in my opinion is an empty one. I've not seen any benchmarks or white paper results showing the facts.
Now, that's not to say that they won't do something interesting with Legato...but that's a while away yet.
DD Boost reminds me of the scene in the movie "Hook" when all the kids are "eating a huge buffet of food" but Robin Williams sees nothing in front of him. And the kids say "use your imagination Peter"...that's what I think DD boOST is - an imaginary buffet.
EMC / Data Domain is NOT faster than VTL
"Data Domain says that overall resource use on the media server is reduced by 20 to 40 per cent because of a reduced data copy overhead."
Data copy overhead? This is a patently stupid and nonsensical statement by EMC. There is no such thing as copy overhead - the whole backup process is a data movement process. The data moves through the media server whether it's raw or deduped. Deduplication on the media server creates ADDITIONAL overhead just as it does with the CommVault deduplication. This is basic backup physics. It's also the very same argument that the Avamar sales team at EMC claims when competing against CommVault's dedupe strategy. (They used to claim that you had to reduce data on the host to gain efficiency).
Here are the basics:
The whole point of using a solution like this is to reduce your backup window... so why would you add an additional process that INCREASES your backup window?
Inline deduplication is no where close to being as efficient as going straight to VTL. Many enterprise customers have tested it and inline dedupe (whether from EMC/Data Domain or others) loses every time in head-to-head tests. Why? Because the inline process creates REAL overhead and slows down the backup job. EMC has been getting killed in Enterprise accounts for this (and for the lack of global dedupe and lack of FC connectivity... but that's a different story). So what did EMC do to fix the problem? They split up the dedupe process between the media server and the appliance with beefy processors. The improvement they get is not from the design, but from the chips... and even then it's not as fast as writing at full speed to a VTL. The design is still flawed.
Any process that moves data between point A and point B is fastest without ADDITIONAL processes (like dedupe) in the way. If your backup job takes X amount of time to complete when writing directly to a VTL, any additional processes (no matter the speed of the processor) will incrementally add to the time to complete your backup job.
It's the very worst combination of what CommVault does with media server dedupe combined with the very worst of data domain - inline dedupe.
This new version is a band aid to mask the problems of inline dedupe which is why EMC is touting improvement speeds without comparing it to the speed of writing to multiple VTLs with global dedupe. This is how they are losing in the enterprise. Its why they always test these for customers using small data sets. When the customer wants to test a real full backup set, they balk.
EMC’s Brian Biles said in the DD Boost press release: “There has been no significant change in roles between backup clients, backup servers and target storage in the last 20 years of traditional backup software deployment architectures, until right now…DD Boost literally thinks outside the box.” Unfortunately this statement does not acknowledge significant advancements in the backup and storage industry. Symantec OST is one of those advancements, a technology on which DD Boost relies. Another is FalconStor’s Backup Accelerator, a technology that requires no software or load on the backup server, works with all backup applications, and accelerators backup and transfer of data to deduplication targets by 400 percent.