Today's reddit meltdown did spawn some funny comments (like this), but did brought my attention to the fact (especially taking into account statements like this) – how much we do depend on the cloud provider? Is it fair to say, that the cloud is the new single point of failure?
I think – yes. The question is not AWS vs Azure vs Rackspace vs my own. The question is how I do balance the loss of one or more of these – yes, I'm speaking about cloud lock-in. So, take IoC to the next level – inject cloud specific implementations based on which cloud you run in. And on the how to manage this – who is watching the watchers? Let's take granted, that you trust the DNS (there is a reason why you need two servers in two subnets…), so it comes back to watching the capacity, the health of the cloud, the health of the other watchers, and the ability to switch quickly.
Is it easy to build such a thing? There are specific solutions for specific usecases, like GreenButton for MPI/Parallel/Map-Reduce cases, Dell Cloud Manager for managing VMs, but – and waiting for suggestions in the comments; do you know a good cloud agnostic solution?