The following procedure lets you restore data to a new cluster to get a cluster clone from snapshots stored in a backup location.
New cluster with the same number of nodes as the source cluster.
Scylla Manager Agent installed on all the nodes.
Access to the backup location from all the nodes.
Ansible installed on your local machine.
Access to all the nodes over SSH from your local machine.
If you do not know the exact number of nodes of the source cluster, you can create a single node cluster and discover the number in the process. In fact you can use any Scylla node with Scylla Manager Agent installed that has access to the backup location for getting the parameters. Note that there is no need for Scylla Manager Server installation prior to restore.
Cloning the cluster is automated with Ansible playbook. The playbook has a readme that explains how to use it. For reader’s convenience it’s repeated here.
Clone repository from GitHub
git clone email@example.com:scylladb/scylla-manager.git
Go to playbook directory
All restore parameters shall be put to
vars.yaml and change parameters to match your clusters.
SSH to one of the nodes and execute the following command:
scylla-manager-agent download-files -L <backup-location> --list-nodes
it gives you information about all the clusters and nodes available in the backup location.
Cluster: prod (a9dcc6e1-17dc-4520-9e03-0a92011c823c) AWS_EU_CENTRAL_1: - 188.8.131.52 (7e68421b-acb1-44a7-a1a8-af7eaf1bb482) - 184.108.40.206 (adc0a3ce-dade-4672-981e-26f91a3d35cb) - 220.127.116.11 (2a575244-3e3c-44a1-a526-da4394f9525e) - 18.104.22.168 (82f0f486-370d-4cfd-90ac-46464c8012cb) - 22.214.171.124 (9ee92c19-5f78-4865-a287-980218963d96) - 126.96.36.199 (aff05f79-7c69-4ecf-a827-5ea790a0fdc6) Cluster: test (da5721cd-e2eb-4d10-a3a7-f729b8f72abf) AWS_EU_CENTRAL_1: - 188.8.131.52 (4001206a-3377-40cb-abd4-d38aad5dec41) - 184.108.40.206 (c6466011-02f9-49dd-8951-c32028dfc6f1) - 220.127.116.11 (bc39bb07-7a21-41cd-b576-51d44c1a694a)
For each node IP in the new cluster you need to assign a host ID (UUID in the listing above).
The mapping must be put into
host_id variable in
To list available snapshot tags for a node use
scylla-manager-agent download-files -L <backup-location> --list-snapshots -n <host-id>
You can filter snapshot tags containing a specific keyspaces or tables by using glob patterns.
-K, --keyspace <list of glob patterns to find keyspaces> lets you do that.
It works the same way as in scheduling backups or repairs with Scylla Manager.
The snapshot ID must be put into
snapshot_tag variable in
Put public IP addresses of all nodes to
Rut the playbook:
ansible-playbook -i hosts -e @vars.yaml restore.yaml