Configuring Cache Control for S3 Hosted Resources

Configuring Cache Control for S3 Hosted Resources

When doing web development, one of the most important aspects is to get a site performing well with SEO (Search Engine Optimisation). One contributing factor to this (there are many), is effective resource caching. In this article, I'm going to show how to achieve this when resources are stored within a S3 bucket.

Within Chrome, we can generate a Lighthouse report which will identify any images that are not cached correctly.

Lighthouse Report

In this image, we can see that the default "cache policy" applied to resources in a S3 bucket is to tell the browser to not cache the resource.

To cache an resource, we need to send a Cache-Control HTTP header with a suitable timeout to cache. The timeout is measured in seconds and is identified as the max-age of the resource.

So for example, to tell the browser to cache resources for 1 day, we would use a max age of 60s 60m 24h = 86400. To tell the browser to cache for 1 week, we would use a max-age of 60s 60m 24h * 7d = 604800.

So, how do we configure S3 to specify a cache time ?

Within the S3 Console, we can select either an individual resource, or an entire directory and specify this value. To do this, we select to edit the Metadata of the selected objects.

Edit Metadata Report

On the resulting screen, we select the Add Metadata section and add a System Defined piece of metadata with a key of Cache-Control and a value of max-age=<number of seconds>

Add Metadata Report

In this image, you can see that I've set the max-age to 7 days (60s 60m 24h * 7d).

That's all there is to it! Save the changes to the metadata and S3 will start hosting your resources with an effective cache control.

Credits

Photo by Marc-Olivier Jodoin on Unsplash.