Feat: add serve static with by ross-byrne · Pull Request #113 · gleam-wisp/wisp

ross-byrne · 2025-03-12T14:25:03Z

Resolves #103

Adds a new function serve_static_with which takes options for enabling etags and setting response headers for specific file types.

Context

There are two main use cases this PR is trying to address and they are, from what I can tell, the two more common scenarios where you'd want caching. Caching it too complicated for us to support every possible option.

Use Case 1

A user is serving a site, webapp or SPA that is built using a build tool (bias: my use case). The build tool fingerprints the files, so the names change when the contents change. This seems to be normal for a lot of frameworks. In this case, you can safely tell the browser to cache the JS and CSS for the longest possible time, as long as you don't cache your index.html or equivalent entry point. You can do this by setting the cache-control header to something like: cache-control: max-age=31536000, immutable, private. This provides the best performance because the browser will instantly use the cached JS/CSS and will only request new versions if something changes in the index.html file.

Use Case 2

The second use case is, you don't have unique filename fingerprinting but you still want to cache assets to speed up page loads. This is where you reach for Etags. You get most of the performance of not having to re-download all the JS on page reload but without having to use a build tool to fingerprint the file names. The downside is, the browser will still have to hit the server to check the Etag values. So you don't save as much bandwidth, time and compute as the first option but still save a lot compared to not caching anything. The use case for supporting both Etags AND setting custom headers is, you can configure how long the browser will wait to re-validate assets with an Etag or use cache-control: stale-while-revalidate=<some_value> to allow the browser to use old assets while waiting for the validation to happen.

Conclusion

I hope the reasoning here is a bit clearer. It's a messy topic that has more than one way of solving, so I tried to go with the most flexible option. I did also take inspiration from other frameworks that allow setting headers separately from toggling Etags, such as hono, express. There was a bigger list in the issue as well.

ross-byrne · 2025-03-12T14:28:50Z

@lpil Here's the PR for #103. Feedback is probably needed here, particularly around naming and the new types being introduced.

Thanks 👍

ross-byrne · 2025-03-12T14:30:01Z

Question: do we want to add an example for this or will the documentation be enough?

lpil

Thank you!

Do we not want to always use etags?

What's the use case for setting headers, and the use case for setting headers only for certain file extensions?

ross-byrne · 2025-03-14T11:23:33Z

@lpil Great questions. Apologies in advance for the short novel, I should have included more context in the PR description.

Do we not want to always use etags?

No, not necessarily. Using etags as a caching strategy is a trade off, like most things. Etag generation is not that expensive but it's not free. The browser will also always have to hit the server to make sure the resource is still valid. That is faster than downloading the file again but slower than the browser using the cached version without verifying it first.

What's the use case for setting headers, and the use case for setting headers only for certain file extensions?

The use case for setting headers only for certain file extensions is, generally you don't want to cache markup files for too long but you do want to cache things like JS and CSS. As an example, say you have an index.html that references different JS and other files. You might want to cache those other resources but not the markup, so when you update the index.html and it references the new updated files, the browser will pull them down instead of using the cached versions (if the names are different). If the markup is cached, you basically can't get changes to the user without coding some specific solution.

To take a step back, there are two main use cases this PR is trying to address and they are, from what I can tell, the two more common scenarios where you'd want caching. Caching it too complicated for us to support every possible option.

Use Case 1

A user is serving a site, webapp or SPA that is built using a build tool (bias: my use case). The build tool fingerprints the files, so the names change when the contents change. This seems to be normal for a lot of frameworks. In this case, you can safely tell the browser to cache the JS and CSS for the longest possible time, as long as you don't cache your index.html or equivalent entry point. You can do this by setting the cache-control header to something like: cache-control: max-age=31536000, immutable, private. This provides the best performance because the browser will instantly use the cached JS/CSS and will only request new versions if something changes in the index.html file.

Use Case 2

The second use case is, you don't have unique filename fingerprinting but you still want to cache assets to speed up page loads. This is where you reach for Etags. You get most of the performance of not having to re-download all the JS on page reload but without having to use a build tool to fingerprint the file names. The downside is, the browser will still have to hit the server to check the Etag values. So you don't save as much bandwidth, time and compute as the first option but still save a lot compared to not caching anything. The use case for supporting both Etags AND setting custom headers is, you can configure how long the browser will wait to re-validate assets with an Etag or use cache-control: stale-while-revalidate=<some_value> to allow the browser to use old assets while waiting for the validation to happen.

Conclusion

I hope the reasoning here is a bit clearer. It's a messy topic that has more than one way of solving, so I tried to go with the most flexible option. I did also take inspiration from other frameworks that allow setting headers separately from toggling Etags, such as hono, express. There was a bigger list in the issue as well.

lpil · 2025-03-14T13:16:57Z

Thank you!

Etag generation is not that expensive but it's not free. The browser will also always have to hit the server to make sure the resource is still valid. That is faster than downloading the file again but slower than the browser using the cached version without verifying it first.

Avoiding a single file_info call doesn't seem like a good justification, especially since the previous implementation already did it once to determine if the file exists, and the new one does it twice. We could use that first read for both, and then always set etags.

To take a step back, there are two main use cases this PR is trying to address and they are, from what I can tell, the two more common scenarios where you'd want caching. Caching it too complicated for us to support every possible option.

I can see why you'd want to add cache headers, but the API given seems either too general or too restrictive. I think we should either have specific support for caching headers, or it should be a much more flexible API than having a fixed set of any headers to be attached to a single fixed set of file extensions.

How about we be conservative with the scope of this PR and focus on adding etags, and then add add a default cache-control, similar to how Plug.Static does it https://hexdocs.pm/plug/1.15.3/Plug.Static.html#module-cache-mechanisms.

To make the API forwards-compatible let's use a builder API for the static asset configuration. Then we can add more functionality to it in future without doing a major version bump.

ross-byrne · 2025-03-14T13:56:23Z

Avoiding a single file_info call doesn't seem like a good justification, especially since the previous implementation already did it once to determine if the file exists, and the new one does it twice. We could use that first read for both, and then always set etags.

A fair point, I agree. My main motivation was more, I want to set the cache-control headers directly to avoid the issue. Perhaps following Plug and making etags a default that can be overridden in the future is the best path forward.

How about we be conservative with the scope of this PR and focus on adding etags, and then add add a default cache-control, similar to how Plug.Static does it https://hexdocs.pm/plug/1.15.3/Plug.Static.html#module-cache-mechanisms.

I think this is a good compromise. I wasn't aware of Plug but I like how they handle the config. I do still think it's important to have control over setting response headers and controlling what files get served, but I agree my proposed solution isn't totally satisfying either. I think we could use Plug as an example rather than trying to re-invent. Let's stick with Etags for now and figure out the rest after.

To make the API forwards-compatible let's use a builder API for the static asset configuration. Then we can add more functionality to it in future without doing a major version bump.

I hadn't considered the builder pattern but that's a great idea. In terms of inspiration, would it be good to look at how POG handles config?

lpil · 2025-03-14T15:00:15Z

Pog would be a good one to look at, aye! I'm not sure if it would be in a different module or not though.

ross-byrne · 2025-03-14T16:23:34Z

I'm not sure if it would be in a different module or not though

I was wondering the same. It would be cleaner to have it as a module but maybe we can leave it in the main wisp module and see what it looks like? Then we can move it pretty easily if we want before merging.

So to recap on what the plan is. We're still adding the serve_static_with function but it's going to take a StaticConfig (or something like that) that can be set using a builder pattern. The only setting being etag generation for now, which is on by default.

I'm imagining something roughly like the following:

use <- wisp.serve_static_with(
     req,
     under: "/static",
     from: priv,
     config: wisp.default_static_config()
)

or manually setting the config like:

let config = wisp.default_static_config()
  |> wisp.with_etag_generation

use <- wisp.serve_static_with(req, under: "/static", from: priv, config:)

Thoughts on the above before I start working on it?

lpil · 2025-03-18T16:13:40Z

So to recap on what the plan is. We're still adding the serve_static_with function but it's going to take a StaticConfig (or something like that) that can be set using a builder pattern. The only setting being etag generation for now, which is on by default.

No, we want to always have etag generation on as we've not found any reason to not have it. If we have further configuration we want to have a builder API, not a configuration record as a final argument to a new _with function.

ross-byrne · 2025-03-18T16:26:01Z

Ah OK, I misunderstood. So I'll just add etag generation to the existing serve_static function and not add anything new?

If we have further configuration we want to have a builder API, not a configuration record as a final argument to a new _with function.

OK sure. I don't have a clear picture of what that would look like right now but it's not important for this PR.

ross-byrne · 2025-03-19T09:06:24Z

@lpil I've made that change now

lpil

Thank you! This is still reading for file information twice though, let's make it do it only once as discussed by removing the is_file check. Inlining the generate_etag function may make it clearer.

Could you update the changelog also please 🙏

ross-byrne · 2025-03-19T14:06:21Z

Updated. I think we should be more or less there now.

lpil

Looks great! One last thing

lpil · 2025-03-19T15:22:05Z

src/wisp.gleam

-      case simplifile.is_file(path) {
-        Ok(True) ->
+      case simplifile.file_info(path) {
+        Ok(file_info) -> {


Could you add a test to make sure that if the file is a directory then it returns a 404 🙏

Great catch. This actually didn't work the way it looked like it did. We still have to check file_info for if it's a file or directory.

I added the extra check, it's not as clean looking as before but file_info_type just does a bitwise op on file_info.mode. So we're still only reading the file info once. Let me know what you think.

Also, based on the current functionality, serve_static returns OK with an empty body if the file can't be found. So I added a test to make sure the same happens with a directory.

lpil · 2025-03-20T12:19:50Z

test/wisp_test.gleam

+
+  // Get a directory
+  let response =
+    testing.get("/stuff/", [])


This directory doesn't exist, so this test doesn't test what it claims to test. It's instead a file-not-found test.

Please move this into a new test (as each test should only test one thing) and also assert that the directory exists, to catch this problem should directory not exist by mistake.

The directory does exist. In this test /stuff is mapped to the root directory. Removing the file check causes this to fail, confirming it.

But that aside, fair point. I've moved the directory test to its own test and added a check to assert what's being requested is actually a directory. I also changed it to request the /test directory to be more similar to the other tests, which request files inside that one.

Oh! Sorry about that, I misread the code

lpil

Thank you so much!!!

ross-byrne added 4 commits March 10, 2025 19:39

Added generate etag internal function

e0c3624

Added serve_static_with

37b4435

Cleaned up handle_etag function

07f813f

Added tests

35277b5

lpil reviewed Mar 13, 2025

View reviewed changes

ross-byrne added 3 commits March 19, 2025 09:59

Revert changes and add etag generation to serve_static

167a425

Removed unused function

7ff4dd9

clean up tests

3a7d96b

lpil reviewed Mar 19, 2025

View reviewed changes

No longer reading file twice for information. Updated changelog.

53b5797

lpil reviewed Mar 19, 2025

View reviewed changes

Correctly handling directories passed to serve_static

2e7dc13

lpil reviewed Mar 20, 2025

View reviewed changes

Updated directory test

c2e5d64

lpil approved these changes Mar 21, 2025

View reviewed changes

lpil merged commit 6083d4c into gleam-wisp:main Mar 21, 2025
1 check failed

Conversation

ross-byrne commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Use Case 1

Use Case 2

Conclusion

Uh oh!

ross-byrne commented Mar 12, 2025

Uh oh!

ross-byrne commented Mar 12, 2025

Uh oh!

lpil left a comment

Choose a reason for hiding this comment

Uh oh!

ross-byrne commented Mar 14, 2025

Use Case 1

Use Case 2

Conclusion

Uh oh!

lpil commented Mar 14, 2025

Uh oh!

ross-byrne commented Mar 14, 2025

Uh oh!

lpil commented Mar 14, 2025

Uh oh!

ross-byrne commented Mar 14, 2025

Uh oh!

lpil commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ross-byrne commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ross-byrne commented Mar 19, 2025

Uh oh!

lpil left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ross-byrne commented Mar 19, 2025

Uh oh!

lpil left a comment

Choose a reason for hiding this comment

Uh oh!

lpil Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

ross-byrne Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

ross-byrne Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

lpil Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ross-byrne Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

lpil Mar 21, 2025

Choose a reason for hiding this comment

Uh oh!

lpil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ross-byrne commented Mar 12, 2025 •

edited

Loading

lpil commented Mar 18, 2025 •

edited

Loading

ross-byrne commented Mar 18, 2025 •

edited

Loading

lpil left a comment •

edited

Loading

lpil Mar 20, 2025 •

edited

Loading