15-7 · Chapter 15 · 7 min read

Releasing Frontend Changes Without Breaking Everything

You've just pushed a new version of your frontend. The build passed, the tests are green, and the deployment pipeline says "success." But when you check

Releasing Frontend Changes Without Breaking Everything

You've just pushed a new version of your frontend. The build passed, the tests are green, and the deployment pipeline says "success." But when you check the production metrics, something is off. Some users are seeing a broken layout. Others report that a button doesn't work. And a few users are still on the old version anyway, because their browser hasn't picked up the new files yet.

This is the reality of frontend releases. Unlike backend services where you can restart a server and instantly switch versions, frontend code lives in the user's browser. You can't force every user to reload. You can't control when their cache expires. And if something goes wrong, you can't just hit a rollback button and expect everyone to get the fix immediately.

Why Frontend Releases Are Different

The core problem is that frontend assets are distributed across thousands of browsers, each with its own cache state. When you deploy a new version, some users get it right away, some get it after their cache expires, and some might be stuck with a broken version until they manually refresh.

For static frontends served through a CDN, the challenge is that you're not deploying to a server you control. You're uploading files to a storage bucket and letting the CDN distribute them. The CDN might serve old files to some users and new files to others, depending on cache headers and edge node behavior.

For server-side rendered (SSR) frontends, the challenge is different. You're dealing with actual servers that need to be updated without dropping active connections. A user filling out a form or navigating between pages shouldn't lose their session because you switched versions.

The following flowchart helps you choose the right release strategy based on your frontend type and risk tolerance:

flowchart TD A[Frontend type?] --> B[Static] A --> C[SSR] B --> D[Staged rollout via CDN] D --> E[Monitor errors] E --> F{Errors low?} F -->|Yes| G[Increase traffic] F -->|No| H[Rollback: switch pointer] G --> I[100% rollout] C --> J{Risk tolerance?} J -->|Low| K[Canary release] J -->|Very low| L[Blue-green deployment] K --> M[Deploy to few instances] M --> N[Monitor health] N --> O{Healthy?} O -->|Yes| P[Gradually add instances] O -->|No| Q[Remove new instances] L --> R[Deploy green env] R --> S[Switch load balancer] S --> T{Issues?} T -->|Yes| U[Flip back to blue] T -->|No| V[Keep green]

Staged Rollouts for Static Frontends

The safest way to release a static frontend is to control how many users see the new version. Most CDNs support weighted distribution, where you can send a percentage of requests to the new files and the rest to the old ones.

Here's how it typically works:

Upload the new version to your storage bucket with a unique path, like app-v2/ or using content-hashed filenames.
Configure your CDN to route 5% of requests to the new files.
Monitor error rates, page load times, and user-reported issues.
If everything looks good, gradually increase the percentage to 25%, then 50%, then 100%.

You can also use feature flags on the client side. Serve the same application shell to everyone, but conditionally load new components or features based on a cookie, URL parameter, or user ID. This gives you finer control over who sees what, without needing CDN-level traffic splitting.

The key advantage of staged rollouts is that you can stop at any point. If error rates spike at 10%, you drop the percentage back to zero. Users who already loaded the new version might still see issues, but you've limited the blast radius.

Canary Releases for SSR Frontends

For SSR frontends, staged rollouts work much like backend canary deployments. You run multiple instances of your application behind a load balancer. When you want to release a new version, you deploy it to one or two instances while the rest keep running the old version.

The load balancer is configured to send a small percentage of traffic to the new instances. If health checks fail or error rates increase, those instances are removed from rotation. Rollback is as simple as shutting down the new instances and letting all traffic return to the old ones.

This approach works well because SSR applications maintain state on the server side. A user's session is tied to a specific instance, so you need to make sure session data is stored in a shared location like Redis or a database. Otherwise, switching traffic between versions will log users out.

To automate this process, you can define a canary rollout in your deployment configuration. Here's an example using Argo Rollouts on Kubernetes:

apiVersion: argoproj.io/v1alpha1
kind: Rollout
metadata:
  name: frontend-ssr
spec:
  replicas: 10
  strategy:
    canary:
      steps:
        - setWeight: 10
        - pause: { duration: 5m }
        - setWeight: 50
        - pause: { duration: 5m }
        - setWeight: 100
      analysis:
        templates:
          - templateName: success-rate
        startingStep: 0
        args:
          - name: service-name
            value: frontend-ssr
---
apiVersion: argoproj.io/v1alpha1
kind: AnalysisTemplate
metadata:
  name: success-rate
spec:
  metrics:
    - name: error-rate
      successCondition: result < 0.01
      provider:
        prometheus:
          query: |
            sum(rate(http_requests_total{status=~"5.."}[5m])) /
            sum(rate(http_requests_total[5m]))

This configuration routes 10% of traffic to the new version, waits 5 minutes to monitor error rates, then proceeds to 50% and finally 100% only if the error rate stays below 1%.

Blue-Green Deployments for Zero Downtime

If you need to guarantee that no user experiences downtime during a release, blue-green deployment is a solid choice. You maintain two identical environments: blue (current version) and green (new version). All traffic goes to blue. Once green is ready and passes all health checks, you switch the load balancer to send all traffic to green.

If something goes wrong after the switch, you flip back to blue. The entire process takes seconds, and users see no interruption as long as session state is handled properly.

The catch is that you need twice the infrastructure. For small teams or low-traffic applications, this might be overkill. But for high-traffic production systems where even a few seconds of downtime costs money or trust, blue-green is worth the overhead.

The Real Challenge: Rolling Back Static Frontends

Here's where things get tricky. Rolling back a static frontend is technically simple but practically messy. You can revert the files in your CDN to the old version, but users who already downloaded the new files won't automatically get the old ones. Their browser cache still holds the new version, and it will keep serving that until the cache expires.

The solution is to never overwrite files. Every version should have a unique path, typically based on a content hash. When you deploy, you upload new files alongside old ones. The CDN serves the version that the application points to. To roll back, you change the pointer from the new version to the old version. Users who already have the new version cached will keep using it until their cache clears, but new users and users who refresh will get the old version.

This approach means you need to think about rollback before you ever need it. If your pipeline is designed to overwrite files in place, you're setting yourself up for a painful recovery when something goes wrong.

Practical Checklist for Frontend Releases

Before you push that next release, run through this quick checklist:

Are all file paths unique per version (content-hashed or versioned)?
Can you route a small percentage of traffic to the new version?
Do you have a way to monitor error rates and page load times in real time?
Is your rollback plan tested, not just documented?
For SSR: is session state stored outside the application server?
For static: do you have a mechanism to switch the version pointer without redeploying?

What Matters Most

Staged rollouts and rollback strategies are not features you can add after the pipeline is built. They need to be designed into the deployment process from the start. If your pipeline only knows how to deploy one version to one location, you will struggle to do canary releases or blue-green deployments. If you don't have traffic management at the CDN or load balancer level, you have no way to limit the impact of a bad release.

The goal is not to prevent all problems. Problems will happen. The goal is to make sure that when a problem happens, you can respond without panic. A well-designed release strategy gives you control over the blast radius, a clear rollback path, and the confidence to ship changes frequently. Without it, every release feels like a gamble.