Potential law suit over the name incoming in 10 ... 9 ... 8 ...
http://www.cs.wustl.edu/~schmidt/TAO.html (the name's a trademark).
Facebook has revealed details about Tao, its multi-petabyte data store for the company's social graph. Though Facebook's social network may have little relevance for IT pros, its internal infrastructure does, because here the social network is dealing with quantities of information so vast that it has to come up with new ways …
Posting Anon for some obvious reasons...
First TAO isn't really a data warehouse. Its a way of abstracting out the underlying MySQL as the pull data from MySQL stores in to HDFS which in itself isn't really a data warehouse.
From the article:
""The fact Tao is using MySQL is completely hidden away from the client," Facebook director of engineering Venkat Venkataramani, tells The Register. "We haven't found anything that is better than MySQL, we are constantly looking at that.""
This isn't totally true.
There are better solutions, however they don't want to pay for it and they will of course want an open source product which they can hack.
In truth, they will need to do a massive re-architecture of their infrastructure in order to replace their reliance on their MySQL sharded machines.
Because FB doesn't think beyond the short term fix... er hack as they call it... they are going to have problems trying to solve things for the long game.
There may be some truth to it. Think of it as AOL 2.0. Oh wait, AOL did a 2.0 upgrade so lets call it AOL 3.0
Unlike AOL, FB can be free because they've figured out a way to monetize their user base in to information.
But like AOL... unless they can constantly keep the attention of 200+ million human 'batteries', they will fail.
(That was a Matrix movie reference BTW)