Logo

dev-resources.site

for different kinds of informations.

Large documents in redis: is it worth compressing them (Part 1)

Published at
5/28/2021
Categories
redis
compression
performance
benchmark
Author
ghusse
Author
6 person written this
ghusse
open
Large documents in redis: is it worth compressing them (Part 1)

At Forest Admin, we build admin panels for which we need to compute and cache large JSON documents. These documents are stored in redis and retrieved from this storage in order to be as fast as possible.

🐘 The problem with large JSON documents: latency

Some of these JSON document can weight more than 20MB. Storing and retrieving such large document introduce latency in our services, as the data needs to be uploaded or downloaded through the network.

So, every time we need to retrieve these documents from the server, we download 20MB (in worst cases) from redis and parse the received json.

Looking at performance logs in production, I could see that this operation of storing or retrieving these documents costed us some significant amount of time.

I wanted to test if would worth to upload compressed json documents instead of plain json content in redis.

That's why I created a repository on github with some code to run benchmarks.

πŸ•— Protocol

I want to evaluate the total time of pushing a document and retrieving it from redis, with 5 different implementations:

  • none: JSON.stringify() + push the result to redis
  • brotli: JSON.stringify() + brotli + push the buffer to redis
  • deflate: JSON.stringify() + deflate + push the buffer to redis
  • gzip: JSON.stringify() + gzip + push the buffer to redis

  • msgpack: msgpack.pack() + push the buffer to redis

Each compression algorithm will be evaluated with every possible compression level:

  • brotli: 1 to 11
  • deflate: -1 to 9
  • gzip: -1 to 9

Json documents will be retrieved from a redis server, based on a key pattern. For my own test, I will use real JSON documents of various sizes that we handle in production.

The benchmark is run against a redis server, and in my case, it'll be a local server running in docker.

Finally, these tests are run on my Macbook Pro 2,8 GHz Intel Core i7, on node 14.

πŸ“ˆ Results

Brotli compression

Overall download duration with brotli

We can see that a compressed document with brotli will always be faster to retrieve, and decompress than a large uncompressed document.

Overall upload duration with brotli

These first results show that ⚠ brotli is very slow during compression, when used with quality levels of 10 and 11. That's something worth noting as 11 is the default value in node 14.

Using brotli with its default value is actually slower than not doing anything for document upload.

I plotted another graph without these 2 values for brotli compression, to be able to compare quality levels.

Overall upload duration with brotli (without 10 and 11)

With this 3rd plot, we can see that any other quality level allows to have better upload time of our documents to redis.

We also can see that the minimal compression level has the best possible performance in the conditions of my experiments. It might change for instance with a remote redis server.

To summarize

  • Download:
    • πŸ… brotli-11 for download performance
    • βœ… every compression level has better performance
  • Upload:
    • πŸ… brotli-1 for upload performance
    • βœ… compression levels up to 9 are faster than no compression
    • ❌ brotli-10 and brotli-11 are slower than no compression
  • Potential winner:
    • πŸ… brotli-1

Deflate compression

Overall download duration with deflate

Same results here for deflate than brotli: it's always faster to retrieve compressed documents from redis when they are compressed.

Overall upload duration with deflate

On the other hand, it's also always faster to upload compressed documents to redis with deflate, whatever the compression level we choose.

We can see that deflate 3 seems to be the faster when used to upload documents.

To summarize

  • Download:
    • βœ… Results are very similar across compression levels, and we cannot spot a clear winner from the results
  • Upload:
    • πŸ… deflate-3 seems to be slightly faster than other compression levels
    • βœ… all compression levels are faster to use than nothing at all
  • Potential winner:
    • πŸ… deflate-3

Gzip compression

Overall download duration with gzip

With gzip also, it's always faster to retrieve compressed documents from redis than uncompressed documents.

Overall upload duration with gzip

Using gzip to upload documents is also always faster, but we can see a clear difference between stronger compression levels (8 & 9) that seem a little slower.

To summarize

  • Download:
    • βœ… Results are very similar across compression levels, and we cannot spot a clear winner from the results
  • Upload:
    • πŸ… gzip-3 seems to be slightly faster than other compression levels
    • βœ… all compression levels are faster to use than nothing at all
  • Potential winner:
    • πŸ… gzip-3

msgpack

The library msgpack allows to serialize javascript objects in a smaller form that the standard JSON.stringify. It is documented as slower than JSON, but I wanted to measure if the time to serialize was compensated by a faster transfert.

It also has the advantage to transform objects into a buffer in one operation, in contrary to the solution using a compression algorithm:

  • msgpack transforms an object into a buffer
  • solutions with compression are requiring 2 steps: one for transformation to JSON, another for the compression of the resulting string.

Overall download duration with msgpack

Results are not good for msgpack in my local environment: it seems always slower to use msgpack than just sending plain JSON documents to my redis server.

⚠ This library can still produce interesting results when the network gets slower. In my situation, it cannot be faster than solutions based on compression, because the compression level obtained with msgpack (around 60%) is a lot less interesting than the compression obtained with compression algorithms (more than 90%).

Overall upload duration with msgpack

Upload results are not good either, for msgpack.

To summarize

❌ msgpack is slower than the original implementation both for upload & download

Final match: brotli-1 vs deflate-3 vs gzip-3

Overall download duration with best compression algorithms

Results here are very similar between compression algorithms.

Overall upload duration with best compression algorithms

Regarding upload time, brotli-1 seems to have a clear advantage over other algorithms.

And the winner is: πŸ… brotli-1

Next step: validate these results on an solution that is closer to the production:

  • on an instance that has similar performance to production instances
  • use a remote redis instance, similar to the one used in production.

These results will be shared in a part 2 of this series!

compression Article's
30 articles in total
Favicon
AVIF File Format: The Evolution in Web Image Compression
Favicon
Lightweight, Transparent, Animated: Get All of Them by WebP Format
Favicon
Flutter Image Compression: Ensuring High Quality
Favicon
Compression Is About Knowing Your Data
Favicon
How to use compression in Node.js server for better bandwidth ?
Favicon
Dall.E Image Gen, And Size Comparison Of Image Formats
Favicon
When life gives you lemons ...
Favicon
Image Compression in JavaScript/TypeScript
Favicon
How to Use IMGCentury For Bulk & Unlimited Image Compressions?
Favicon
Created simple phone number compression functions.
Favicon
Exploring the LZ77 Compression Algorithm
Favicon
Ultimate 3D Compression: echo3D Introduces Compression Tools for 3D Content to Accelerate 3D Development
Favicon
Reduce Network Usage - With This 1 Simple Trick!
Favicon
How Finding the Right Compression Level Can Speed Up your Website
Favicon
Client-side image compression with Firebase and Compressor.js
Favicon
Client-side image compression with Supabase Storage
Favicon
COS Cost Optimization Solution
Favicon
Is there any way to compress the data while using mongo persistence with NEventStore?
Favicon
Search safe number compression algorithm(feat. Python)
Favicon
Golang HTTP Handler With Gzip
Favicon
How to use AVIF today!
Favicon
Large documents in redis: is it worth compressing them (Part 1)
Favicon
Open or create TAR files in C# with Aspose.ZIP
Favicon
Supercharge your API with Compression
Favicon
Django 3.2 - News on compressed fixtures and fixturesΒ compression
Favicon
How I made my own file compressor using Node.js
Favicon
Compressed GraalVM Native Images: the best startup for Java apps comes in tiny packages
Favicon
What is lossy and lossless compression
Favicon
did you ever ask yourself how file compression works ? the maths behind greedy algos
Favicon
Huffman coding from scratch with Elixir

Featured ones: