How Does Listing Cache Work

Ensure Sufficient Disk Space: Listing Cache stores each folder in a separate SQLite database, which consumes disk space. Provide ample disk space to avoid frequent evictions of folder databases, as this impacts performance.
Automatic Folder Detection: Listing Cache automatically learns about folders through ongoing list, write, and delete requests. No manual intervention is required to create or manage databases for each folder.
Monitor Cache Population: Initially, for any new folder, the cache starts with an "infinite gap," meaning it has no data cached and queries ElasticSearch. Over time, as more listings are cached, the gap reduces until the folder is fully cached and can be served without querying ElasticSearch.
Real-Time Cache Updates: Ongoing write and delete requests are intercepted and used to keep the folder databases updated, ensuring the cache remains consistent with the actual data.
LRU-Based Eviction: The system automatically evicts the least recently used (LRU) databases when disk space is full. If a folder's database is evicted and later requested, the cache process restarts for that folder.
Disk Space Directly Impacts Performance: The more disk space available, the fewer evictions occur, allowing more folders to remain fully cached and reducing the need for frequent ElasticSearch queries.
Prepare for ElasticSearch Querying: In case of cache misses or folder database evictions, ElasticSearch will be queried. Ensure that ElasticSearch is properly configured to handle such requests, especially during periods of high cache turnover.

How to Enable Listing Cache

The steps for enabling Listing Cache in a production environment depend on the platform or technology stack. The procedure to enable Listing Cache in Swarm is outlined below:

Info
Validate your system support Listing Cache. Many frameworks or databases like Redis, Memcached, or certain web frameworks support caching.

~~Locate the cache configuration settings in your system’s configuration file or interface.~~ In gateway.cfg, just set [storage_cluster]disableListingCache=false
~~Set the cache engine (e.g., Redis, Memcached) and specify settings such as cache size, eviction policies (e.g., LRU - Least Recently Used), and TTL (Time to Live) for cached items.~~ I don’t know where you got this from. None of these technologies were used in LC????
~~Update code to leverage the Listing Cache. For example, fetch listings from the cache instead of querying the database, and store listings in the cache after the first database query.~~ This is all transparent to gateway S3/SCSP clients.
~~Add checks to ensure cache invalidation happens when the data changes (e.g., if product details are updated).~~
After testing in a staging environment, roll out the Listing Cache to production by deploying the necessary configurations and code changes.
Monitor performance impact closely during the rollout phase.
Optional. Pre-warm the cache with commonly accessed listings before enabling it in production, so the initial requests are served from the cache.

Metrics

...

Table of Contents

minLevel	1
maxLevel	6
outline	false
style	disc
type	list
printable	true

Overview

Listing Cache (LC) is a performance optimization feature designed to improve the speed of listing large datasets within Swarm storage. It works by caching directory listings, reducing the time and resource consumption required to fetch and display object listings repeatedly.

The listing cache solves a scalability problem with the gateway's delimited folder listing functionality. To determine if a folder has subfolders, an Elasticsearch query has to enumerate all objects with the folder name as a prefix to their object names. This can run into the millions of objects for large buckets. When such queries are issued repeatedly and at high frequencies, the resulting CPU use brings an entire Elasticsearch cluster to a halt. This has a crippling effect on any sizeable Veeam deployment.

Prerequisites

The listing cache prototype can be installed as a regular gateway. No extra config is required. Increasing java heap size is recommended, and the disk storing /var/spool/caringo/cloudgateway should be SSD with preferably >=100Gb free capacity.

Memory

Minimum: 4 vCPU, 8 GB RAM (heap) - VM 12GB, 100GB dedicated partition on SSD ( on XFS filesystem ) ← THIS STATES MEM, CPU AND DISK REQS
Recommended: 8 vCPU 12 GB RAM(heap) - VM 16 GB, 200GB dedicated partition on SSD ( on XFS )

~~CPU~~

~~The system must have adequate CPU resources, as caching can lead to higher CPU utilization due to cache management (e.g., cache eviction policies, invalidation, etc.).~~
~~Cache with complex data structures or serialization/deserialization can impact CPU usage.~~ ~~Basically already stated above~~

~~Network~~

~~If any.~~

~~Disk~~

~~If any.~~

Limitations

...

Client-Specific Binding: Bound to a dedicated client, with no cross-gateway sharing allowed. The gateway must be able to intercept every write and delete that happens in Swarm.

...

Non-Persistent Cache: The disk/memory cache is discarded by default on restart.

...

Limited Lifecycle and Recursive Deletion Support: No support for bucket lifecycle policies, delete lifepoints, or recursive deletes. All writes and deletes must originate from the gateway.

...

Memory Constraints: Caching large volumes of data can quickly consume system memory. Misconfiguring cache sizes can lead to memory exhaustion or excessive eviction, reducing cache effectiveness.

...

~~Cache Invalidation Complexity~~~~: Managing cache invalidation (i.e., ensuring cached data is refreshed when source data changes) can be complex, especially in distributed environments or when working with dynamic data.~~ Ditto

...

~~Overhead on Writes~~~~: When new data is added or existing data is modified, cache updates and invalidation can add extra overhead, potentially slowing down write operations.~~ Because LC allows switching of synchronous indexing, writes actually became faster. I do not think we need to raise this point.

...

~~Cache Miss Penalties~~~~: If the cache miss rate is high (meaning data isn’t found in the cache often), the overhead of checking the cache and then falling back to the database could negatively impact performance.~~ The worst case is no worse than without caching so I do not see this as a limitation.

...

Custom delimiters are not yet supported, only forward slash "/"

Table of Contents

minLevel	1
maxLevel	2
outline	false
style	disc
type	list
printable	true

Overview

Listing Cache (LC) is a performance optimization feature designed to improve the speed of listing large datasets within Swarm storage. It works by caching pseudo-folder listings, reducing the time and resource consumption required to fetch and display object listings repeatedly.

The Listing Cache solves a scalability problem with the gateway's delimited folder listing functionality. To determine if a folder has subfolders, an Elasticsearch query has to enumerate all objects with the folder name as a prefix to their object names. This can run into the millions of objects for large buckets. When such queries are issued repeatedly and at high frequencies, the resulting CPU use brings an entire Elasticsearch cluster to a halt.

Limitations

Client-specific binding: Bound to a dedicated client, with no cross-gateway sharing allowed. Once you decide to serve 1 or more domains on a listing-cache enabled gateway it must serve all requests to those domain(s) exclusively. This is achieved by configuring your load-balancer with dedicated host-based traffic redirection rules.
Non-persistent cache: The disk/memory cache is discarded by default on restart.
Limited lifecycle and recursivedeletion support: No support for bucket lifecycle policies, delete lifepoints, or recursive deletes. All writes and deletes must originate from the gateway.
Memory constraints: Caching large volumes of data can quickly consume system memory. Misconfiguring cache sizes can lead to memory exhaustion or excessive eviction, reducing cache effectiveness.
Delimiters support: Custom delimiters are not yet supported, only forward slash "/".
Replication support: Do not setup replication when LC is enabled.
Not supported functionalities: Custom delimiters, S3 lifecycles, and recursive deletes.

Prerequisites

The Listing Cache can be enabled on gateway 8.1.2 or above. Ensure the following prerequisites are met before deploying Listing Cache:

Hardware Requirements:
- 8 vCPUs
- 16GB RAM
- 200GB dedicated partition formatted with XFS
Load Balancing Configuration:
- Hardcode domains to a single gateway with Listing Cache (LC).

Info

Shared gateway support is currently not available.

Assuming you are using recommended settings, you will need to do the following:

Set Java Memory Heap

Panel

bgColor	#DEEBFF

vim /etc/sysconfig/cloudgateway

HEAP_MIN="12228m"
HEAP_MAX="12228m"

Create disk cache partition

Panel

bgColor	#DEEBFF

vgcreate swarmspool /dev/sdb
lvcreate -L 195G -n diskcache swarmspool
mkfs.xfs /dev/swarmspool/diskcache
mount /dev/swarmspool/diskcache /var/spool/caringo/

Persist it by adding at the end of /etc/fstab

/dev/mapper/swarmspool-diskcache /var/spool/caringo xfs defaults 0 0

Do Not Use Listing Cache If:

You use multipart S3 operations.
You use custom delimiters in search queries.
You need the ability to do recursive deletes of domains and buckets.
You use S3 lifecycle policies.
You need support for the delete lifepoints.
You do not use pseudo folders or all objects are in a single pseudo folder.

How to Enable Listing Cache

The procedure to enable Listing Cache in Swarm is outlined below:

Add in the /etc/caringo/cloudgateway/gateway.cfg.

Code Block
[storage_cluster] disableListingCache=false

After testing in a staging environment, roll out the Listing Cache to production by deploying the necessary configurations and code changes.
Monitor performance impact closely during the rollout phase.
Optional. Pre-warm the cache with commonly accessed listings before enabling it in production, so the initial requests are served from the cache.