"Insufficient VM replicas."
I would go for poor infrastructure design and / or failed / untested implementation as the most likely general cause. Followed by inadequate backups / DR facilities and procedures if it takes a week + to restore services....