RabbitMQ Autocluster

What it Does

This plugin provides a mechanism for peer node discovery in RabbitMQ clusters. It also supports a few opinionated features around cluster formation and "permanently unavailable" node detection.

Nodes using this plugin will discover its peers on boot and (optionally) register with one of the supported backends:

If at least one peer node has been discovered, cluster formation proceeds as usual, otherwise the node is considered to be the first one to come up and becomes the seed node.

To avoid a natural race condition around seed node "election" when a newly formed cluster first boots, peer discovery backends use either randomized delays or a locking mechanism.

Some backends support node health checks. Nodes not reporting their status periodically are considered to be in an errored state. If the user opts in, such nodes can be automatically removed from the cluster. This is useful for deployments that use AWS autoscaling groups or similar IaaS features, for example.

This plugin only covers cluster formation and does not change how RabbitMQ clusters operate once formed.

Note: This plugin is not a replacement for first-hand knowledge of how to manually create a RabbitMQ cluster. If you run into issues using the plugin, you should try and manually create the cluster in the same environment as you are trying to use the plugin in. For information on how to cluster RabbitMQ manually, please see the RabbitMQ documentation.

Current Maintainers

This plugin was originally developed by Gavin Roy at AWeber and is now co-maintained by several RabbitMQ core contributors. Parts of it were adopted into RabbitMQ core (as of 3.7.0).

Supported RabbitMQ Versions

There are two branches in this repository that target different RabbitMQ release series:

stable targets RabbitMQ 3.6.x (current stable RabbitMQ branch)
master targets RabbitMQ 3.7.x (current master RabbitMQ branch).

Please take this into account when building this plugin from source.

Please also note that key ideas of this plugin have been incorporated into RabbitMQ master branch and will be included into 3.7.0. This plugin therefore will become a collection of backends (e.g. AWS and etcd) rather than a wholesale alternative cluster formation implementation.

Supported Erlang Versions

This plugin requires Erlang/OTP 17.5 or later. Also see the RabbitMQ Erlang version requirements guide.

Binary Releases

Binary releases of autocluster can be found on the GitHub Releases page.

The most recent release is 0.8.0 that targets RabbitMQ 3.6.10 or later.

Check for version compatibility in plugin release notes.

Installation

This plugin is installed the same way as other RabbitMQ plugins.

Place both autocluster-{version}.ez and the rabbitmq_aws-{version}.ez plugin files in the RabbitMQ plugins directory.
Enable the plugin, e.g. with rabbitmq-plugins enable autocluster --offline.
Configure the plugin.
Start the node.

Alternatively, there is a pre-built Docker Image available at on DockerHub as pivotalrabbitmq/rabbitmq-autocluster.

Note As of version 0.5 the autocluster plugin does not have a default backend configured. See the Project Wiki for configuration details.

Configuration

General settings
Consul configuration
DNS configuration
- Example Configuration
- Troubleshooting
etcd configuration
K8S configuration
- Kubernetes Setup

General settings

Configuration for the plugin can be set in two places: operating system environment variables or the rabbitmq.config file under the autocluster stanza.

Available Settings

The following settings are available for all service discovery backends:

Backend Type: Which type of service discovery backend to use. One of aws, consul, dns, etcd or k8s.
Startup Delay: To prevent a race condition when creating a new cluster for the first time, the startup delay performs a random sleep that should cause nodes to start in a slightly random offset from each other. The setting lets you control the maximum value for the startup delay.
Failure Mode: What behavior to use when the node fails to cluster with an existing RabbitMQ cluster or during initialization of the autocluster plugin. The two valid options are ignore and stop.
Log Level: You can set the log level via the environment variable AUTOCLUSTER_LOG_LEVEL or the autocluster.autocluster_log_level key (see below).
Longname (FQDN) Support: This is a RabbitMQ environment variable setting that is used by the autocluster plugin as well. When set to true this will cause RabbitMQ and the autocluster plugin to use fully qualified names to identify nodes. For more information about the RABBITMQ_USE_LONGNAME environment variable, see the RabbitMQ documentation
Node Name: Like Longname Support, Node Name is a RabbitMQ setting that is used by the autocluster plugin as well. When set to true this will cause RabbitMQ and the autocluster plugin. The RABBITMQ_NODENAME environment variable explicitly sets the node name that is used to identify the node with RabbitMQ. The autocluster plugin will use this value when constructing the local part/name/prefix for all nodes in this cluster. For example, if RABBITMQ_NODENAME is set to bunny@rabbit1, bunny will be prefixed to all nodes discovered by the various backends. For more information about the RABBITMQ_NODENAME environment variable, see the RabbitMQ documentation
Node Type: Define the type of node to join the cluster as. One of disc or ram. See the RabbitMQ Clustering Guide for more information.
Cluster Cleanup: Enables a periodic check that removes any nodes that are not alive in the cluster and no longer listed in the service discovery list. This is a destructive action that removes nodes from the cluster. Nodes that are flapping and removed will be re-added as if they were coming in new and their database, including any persisted messages will be gone. To use this feature, you must not only enable it with this flag, but also disable the "Cleanup Warn Only" flag. Added in v0.5
Note: This is an experimental feature and should be used with caution.
Cleanup Interval: If cluster cleanup is enabled, this is the interval that specifies how often to look for dead nodes to remove (in seconds). Added in v0.5
Cleanup Warn Only: If set, the plugin will only warn about nodes that it would cleanup and will not perform any destructive actions on the cluster. Added in v0.5
HTTP Proxy: If set, the given HTTP URL will be used as a proxy to connect to the service discovery backend.
HTTPS Proxy: If set, the given HTTPS URL will be used as a proxy to connect to the service discovery backend.
No Proxy: List of host names which shouldn't use any proxy.

Setting	Environment Variable	Setting Key	Type	Default
Backend Type	`AUTOCLUSTER_TYPE`	`backend`	`atom`	`unconfigured`
Startup Delay	`AUTOCLUSTER_DELAY`	`startup_delay`	`integer`	`5`
Failure Mode	`AUTOCLUSTER_FAILURE`	`autocluster_failure`	`atom`	`ignore`
Log Level	`AUTOCLUSTER_LOG_LEVEL`	`autocluster_log_level`	`atom`	`info`
Longname	`RABBITMQ_USE_LONGNAME`		`bool`	`false`
Node Name	`RABBITMQ_NODENAME`		`string`	`rabbit@$HOSTNAME`
Node Type	`RABBITMQ_NODE_TYPE`	`node_type`	`atom`	`disc`
Cluster Cleanup	`AUTOCLUSTER_CLEANUP`	`cluster_cleanup`	`bool`	`false`
Cleanup Interval	`CLEANUP_INTERVAL`	`cleanup_interval`	`integer`	`60`
Cleanup Warn Only	`CLEANUP_WARN_ONLY`	`cleanup_warn_only`	`bool`	`true`

Environment Variable	Setting Key	Type	Default
`AWS_AUTOSCALING`	`aws_autoscaling`	`atom`	`false`
`AWS_EC2_TAGS`	`aws_ec2_tags`	`[string()]`
`AWS_USE_PRIVATE_IP`	`aws_use_private_ip`	`atom`	`false`

Environment Variable	Setting Key	Type	Default
`AWS_ACCESS_KEY_ID`	`aws_access_key`	`string`
`AWS_SECRET_ACCESS_KEY`	`aws_secret_key`	`string`
`AWS_DEFAULT_REGION`	`aws_ec2_region`	`string`	`us-east-1`
`AWS_DEFAULT_PROFILE`	N/A	`string`
`AWS_CONFIG_FILE`	N/A	`string`
`AWS_SHARED_CREDENTIALS_FILE`	N/A	`string`

Setting	Environment Variable	Setting Key	Type	Default
Consul Scheme	`CONSUL_SCHEME`	`consul_scheme`	`string`	`http`
Consul Host	`CONSUL_HOST`	`consul_host`	`string`	`localhost`
Consul Port	`CONSUL_PORT`	`consul_port`	`integer`	`8500`
Consul ACL Token	`CONSUL_ACL_TOKEN`	`consul_acl_token`	`string`
Service Name	`CONSUL_SVC`	`consul_svc`	`string`	`rabbitmq`
Service Address	`CONSUL_SVC_ADDR`	`consul_svc_addr`	`string`
Service Auto Address	`CONSUL_SVC_ADDR_AUTO`	`consul_svc_addr_auto`	`boolean`	`false`
Service Auto Address by NIC	`CONSUL_SVC_ADDR_NIC`	`consul_svc_addr_nic`	`string`
Service Port	`CONSUL_SVC_PORT`	`consul_svc_port`	`integer`	`5672`
Service TTL	`CONSUL_SVC_TTL`	`consul_svc_ttl`	`integer`	`30`
Consul Use Longname	`CONSUL_USE_LONGNAME`	`consul_use_longname`	`boolean`	`false`
Consul Domain	`CONSUL_DOMAIN`	`consul_domain`	`string`	`consul`

Environment Variable	`AUTOCLUSTER_HOST`
Setting Key	`autocluster_host`
Data type	`string`
Default Value	`consul`

Setting	Environment Variable	Setting Key	Type	Default
etcd Scheme	`ETCD_SCHEME`	`etcd_scheme`	`list`	`http`
etcd Host	`ETCD_HOST`	`etcd_host`	`list`	`localhost`
etcd Port	`ETCD_PORT`	`etcd_port`	`int`	`2379`
etcd Key Prefix	`ETCD_PREFIX`	`etcd_prefix`	`list`	`rabbitmq`
etcd Node TTL	`ETCD_TTL`	`etcd_ttl`	`integer`	`30`

Setting	Environment Variable	Setting Key	Type	Default
K8S Scheme	`K8S_SCHEME`	`k8s_scheme`	`string`	`https`
K8S Host	`K8S_HOST`	`k8s_host`	`string`	`kubernetes.default.svc.cluster.local`
K8S Port	`K8S_PORT`	`k8s_port`	`integer`	`443`
K8S Token Path	`K8S_TOKEN_PATH`	`k8s_token_path`	`string`	`/var/run/secrets/kubernetes.io/serviceaccount/token`
K8S Cert Path	`K8S_CERT_PATH`	`k8s_cert_path`	`string`	`/var/run/secrets/kubernetes.io/serviceaccount/ca.crt`
K8S Namespace Path	`K8S_NAMESPACE_PATH`	`k8s_namespace_path`	`string`	`/var/run/secrets/kubernetes.io/serviceaccount/namespace`
K8S Service Name	`K8S_SERVICE_NAME`	`k8s_service_name`	`string`	`rabbitmq`
K8S Adddress Type	`K8S_ADDRESS_TYPE`	`k8s_address_type`	`string`	`ip`
K8S Hostname Suffix	`K8S_HOSTNAME_SUFFIX`	`k8s_hostname_suffix`	`string`

noxdafox / rabbitmq-autocluster Goto Github PK