Misplaced Moaning
Out of 600 DIMMs, one failing every 2 months is 6 DIMMs/year. 1% failure rate doesn't seem outrageous. With ECC memory, the chances are that you'll get some warning in time to sort things out before the machine suffers total failure. And all your production servers are redundant, of course, and if one fails you have others that will transparently take over it's workload - right?
If you are bemoaning a 1%/year failure rate on DIMMs, it makes me wonder what your failure rate on disks is. I know that my %/year failure rate of disks is several times that of memory modules.
Re: fan failure rate - fans fail. Even decently branded ball bearing ones will get loose after a couple of years of 24/7 operation - cheap bundled fans much sooner than that. Having variable speed fans that slow down when the temperatures drop also helps to prolong their life. And you do have lm_sensors' sensord monitoring fan rpm rates and alerting you when they drop below a healthy minimum, so you can act on it before the machine overheats and crashes - right?
And Lacie NAS, you say? That well known enterprise brand?
Save the moaning - it sounds like the energy would be better spent on better forward planning, redundancy and monitoring. If those aren't covered, you only have yourself to blame.
