Chaos Monkey Alternatives - Google Cloud Platform

2 minute read

Google Cloud Chaos Monkey

Google Cloud Chaos Monkey is an open-source tool written in F# that performs a few basic Chaos Experiments on Google Cloud instances. Instance target selection is pseudo-randomized and based on probabilities.

  • Reset: Resets the targeted instance.
  • Stop and Start: Stops the instance and then restarts it after 3 seconds.

Engineering Chaos Experiments on TensorFlow with Gremlin

It’s safe and easy to perform Chaos Experiments on Google Cloud Platform services like TensorFlow with Gremlin’s Failure as a Service. As a clustered configuration TensorFlow is ideal for running Chaos Experiments to test the fault tolerance and stability of each clustered node. Gremlin can execute a wide range of attacks on your TensorFlow cluster, including overloading CPU, memory, disk, and IO; killing processes; altering system time; modifying network traffic; and more.

Check out the full tutorial over on our community site.

Swapping Kubernetes Nodes in GKE with Chaos Toolkit

If you’re looking to get into Chaos Engineering with Kubernetes check out our How to Deploy Spinnaker on Kubernetes tutorial to get started with Amazon EKS. On the other hand, to induce Chaos on Google Cloud Kubernetes Engine clusters you can use the Chaos Toolkit Google Cloud driver. A relatively simple failure injection involves attacking a node pool with an HTTP load balancing library like Vegeta and then calling the swap_nodepool function to create a new node pool and drain the old pool. This forces pods to be moved to the new pool.

Below is a simple Action configuration to perform such an attack, which can be added to your configuration file before executing Chaos Toolkit.

{
    "type": "action",
    "name": "swap-out-nodepools",
    "provider": {
        "type": "python",
        "module": "chaosgce.nodepool.actions",
        "func": "swap_nodepool",
        "secrets": ["gce"],
        "arguments": {
            "body": {
                "nodePool": {
                    "config": { 
                        "oauthScopes": [
                            "gke-version-default",
                            "https://www.googleapis.com/auth/devstorage.read_only",
                            "https://www.googleapis.com/auth/logging.write",
                            "https://www.googleapis.com/auth/monitoring",
                            "https://www.googleapis.com/auth/service.management.readonly",
                            "https://www.googleapis.com/auth/servicecontrol",
                            "https://www.googleapis.com/auth/trace.append"
                        ]
                    },
                    "initialNodeCount": 3,
                    "name": "new-default-pool"
                }
            }
        }
    }
}

Head over to our Kubernetes Alternatives - Chaos Toolkit section for more details on the inner-workings and configuration of Chaos Toolkit. Additionally, check out this tutorial for more information on node swapping with Chaos Toolkit.