Snapshot and Restore Search Data

Note

This technique makes use of and requires the Content Gateway with S3 enabled.

Swarm builds and maintains your search data (index) through your Search Feed, and it regenerates the search index should it ever be lost. You can trigger this regeneration at any time by running the Refresh command for your search feed in the Swarm Storage UI (or legacy Admin Console, port 90). A complete refresh (which verifies all data) takes a long time, during which your listings are unavailable.

A method exists to take a snapshot of the index data so it can be restored for instant disaster recovery using an Elasticsearch plugin if it needs to be verified listings are never offline. Because the Gateway can function as an S3 Repository, you can leverage the AWS Cloud Plugin to get Snapshot and Restore capability for your search data (index). To snapshot is to back up your search data to a file system or S3 (Swarm); to restore is to place a snapshot back into production.

These are key reasons for using the AWS Cloud plugin:

Search Index Restoration: If your Search cluster has problems and the search index is lost, you can restore a snapshot so applications that depend on listings and collections are not interrupted.
Usage Snapshot: The usage metering indices written are temporary. To preserve data being written since the last backup, you can set up frequent snapshots.
Data Move: If you are making changes to your Search cluster, you can restore a snapshot to the new location to minimize disruption in services.

Important

Refresh the feed for the restored index, and allow time for Swarm to verify the index data. Until it completes, any objects created, changed, or deleted after the last snapshot may be missing or appear erroneously.

Configuring the Plugin

These are the required:

Elasticsearch to backup	`http://<elasticsearch-node>:9200`
Content UI (Portal)	`http://<domain>/_admin/portal/`
Domain	<domain> in destination Swarm storage cluster
Bucket	<bucket> within the `<domain>`
S3 Endpoint	`http(s)://<domain>:<S3-port>`
Token ID (access key)	UUID
S3 Secret Key	generated when token is created

Best practice

Although you can back up Elasticsearch to the same Swarm cluster that is using it, it is best to use a separate Swarm cluster.

In the Content UI (Portal) on the Swarm cluster storing the Elasticsearch snapshots, create an S3 token.
1. Create or select the domain.
2. Open its Settings (gear icon) and select the Tokens tab.
3. Create a token that includes an S3 key.
4. Record both the access key (token ID) and the secret (S3) key:
On each node in your Elasticsearch cluster, install the AWS Cloud Plugin:
1. Log into the node via ssh the root user.
2. Install the plugin:
```
sudo /usr/share/elasticsearch/bin/elasticsearch-plugin install --batch repository-s3
```
3. To allow the keys to be specified, set the following JVM option (/etc/elasticsearch/jvm.options):
```
-Des.allow_insecure_settings=true
```
4. Restart the Elasticsearch service.
Configure an S3 repository using the token (see below).
Test the plugin, as shown below:
1. Take a snapshot.
2. Delete the search index (which causes listings to fail).
3. Restore your snapshot using the manifest file.
4. Verify the listings are working again.

Configure the S3 Repository

In these examples, the Elasticsearch repository is using S3 to store the search data snapshots.

Create the S3 repository using a command like the following. The base_path can be empty, or set if this bucket is the backup destination for multiple Elasticsearch clusters.
- The endpoint with the bucket in the host must be accessible from every Elasticsearch node (to verify, run 'curl -i http://essnapshots.mydomain.example.com/'). This requires either explicit /etc/host entries or wildcard DNS for the domain. If any node fails to contact the endpoint, you must delete the repo with "curl -XDELETE 'http://elasticsearch:9200/_snapshot/myRepo'" and PUT it again.
- These configuration values (endpoint, access_key, secret_key) can be stored in elasticsearch.yml instead of the JSON body (see the Elasticsearch docs for the config names).
```
curl -XPUT -H 'Content-type: application/json'
  'http://elasticsearch:9200/_snapshot/myRepo' 
  -d '{
      "type": "s3",
      "settings": {
        "bucket": "essnapshots",
        "region": null,
        "endpoint": "http://mydomain.example.com/",
        "protocol": "http",
        "base_path": "myswarmcluster",
        "access_key": "18f2423d738416f0e31b44fcf341ac1e",
        "secret_key":"BBgPFuLcO3T4d6gumaAxGalfuICcZkE3mK1iwKKs"
    }
}'
```

List information about the snapshot repository:

curl -XGET 
  'http://elasticsearch:9200/_snapshot'

Verify the repository is created successfully:

curl -XPOST 
  'http://elasticsearch:9200/_snapshot/myRepo/_verify'

Creating a Snapshot the S3 Repository

Create a new snapshot into S3 repository, setting it to wait for completion:
```
curl -XPUT 
  'http://elasticsearch:9200/_snapshot/myRepo/snapshot_20201031
    ?wait_for_completion=true'
```
If needed, you can restrict the indices (such as to Search only, if Metrics backups are not needed). See Elasticsearch documentation for details on restricting indices.
Allow several hours for this to complete, especially for an initial snapshot of an Elasticsearch with a lot of large indices.

Restoring from a Snapshot

Best practice — Always test restoring a backup before needed. Delete the search index in a test or staging environment to simulate a situation where restoring Elasticsearch data is needed:
```
curl -XDELETE 'http://elasticsearch:9200/_all'  # do not do this in Production!
```
In the Storage UI:
1. Open Cluster > Feeds, open the Swarm search feed, and select Actions (gear icon) > Pause to prevent a new index from being created.
2. Open Settings > Cluster, Metrics and temporarily disable Swarm metrics (metrics.targets set to nothing) to prevent those indices from being created during restore.

Restore the search index, renaming indices if they exist and are locked:

curl -XPOST "elasticearch:9200/_snapshot/myRepo/snapshot_20201031/_restore" 
  -H Content-type: application/json	
  -d '{
    "rename_pattern": "(.+)",
    "rename_replacement": "restored_$1"
  }'

In the Storage UI:
1. Open Settings > Cluster, Metrics and re-enable Swarm metrics (metrics.targets set to its prior value).
2. Open Cluster > Feeds, open your Swarm search feed, and select Actions (gear icon) > Unpause to reactivate the feed.

Verify the listings are working as before:

curl -iL 'http://swarm:80/ 
	?domains&format=json'