how about defining it first
what is big data? how big is big ?
I know some folks who claim be "big data" people but operate on only a few TBs worth of data.
Is big data more about say structured vs unstructured data rather than the volume of data? If you have a 5 node hadoop cluster with 50GB of data does that mean your running big data? How about how much active data vs inactive data ? You may have 600TB of data but maybe 95% of the jobs only operate on the most recent 5TB.
Perhaps big data is data that doesn't fit in a traditional old school SQL database? (there are of course mostly SQL compliant databases that can operate on massive amounts of data pretty easily).
From what I've heard the MySQL sharding system backed with FusionIO at facebook stores quite a bit of data, is that big data?
what is big data?!