[Cache] Caching Strategies
1) Lazy Loading
- Load data into cache only when necessary.
- Ask Cache - if unavailable, Ask Database.
1.1) Cache Hit
- When your data is in cache and not expired:
- Application requests data from the cache.
- Cache returns data to the application.
1.2) Cache Miss
- Application requests data from the cache.
- Cache doesn't have requested data and returns null.
- Application requests and receives data from database.
- Application updates cache with requested data for quicker access next time.
1.3) Pros & Cons
1.3.1) Pros
- Only requested data is cached.
- Most data are never requested, lazy loading avoids filling up your cache with data that isn't requested.
- Node Failures are not fatal
- When node fails and is replaced, new node continues to function as requests are being made. (as opposed to that in Write Through)
1.3.2) Cons
- Cache Miss Penalty
- Each cache miss results in 3 trips:
- Initial data request to cache
- Database query
- Writing data to node
- Stale data
- If data is only written to the cache when there is a cache miss, data in the cache can become stale because there is no updates to the cache when data is changed in the database.
2) Write Through
Adds/updates data in cache whenever data is written to database.
2.1) Pros & Cons
2.1.1) Pros
- Data in cache is never stale
- Data stored is always the most updated.
- Write penalty vs Read penalty
- Every write involves two trips:
- A write to cache
- A write to database
- This adds latency to the process. But end users are generally more tolerant of latency when updating data than wen retrieving data.
2.1.2) Cons
- Missing Data
- When spinning up a new node (eg. due to node failure), there is missing data which continues to be missing until it is added/updated on database.
- Cache Churn
- Since most data is never read, there can be a lot of data in cluster that is never read - waste of resource.
- Adding TTL helps minimize wasted space.
3) Adding TTL
TTL = Time to Live. This indicates how long a data can live before it is removed.
In the case of Lazy Loading, this prevents data from being too stale.
In the case of Write Through, it reduces the problem of too much unread data in cluster.
Comments
Post a Comment