1 post • joined Wednesday 8th February 2012 20:53 GMT
Apes certainly do have foxp2, and it isn't even that different.
The files are so large because they contain all shotgun reads, mapped to their apparent locations on the (I presume human) reference. The mappings are ~150GB compressed. The read information contains quality information as well as the actual base calls, inflating the size. Quality info is important, especially when the genome is from a dodgy old sample!