2-8 · Chapter 2 · 7 min read

What Happens After You Hit Deploy: Checking That Your New Version Actually Works

The deploy button has been pressed. The pipeline shows green. Your team lets out a collective breath. But if you think the work is done, you are setting

What Happens After You Hit Deploy: Checking That Your New Version Actually Works

The deploy button has been pressed. The pipeline shows green. Your team lets out a collective breath. But if you think the work is done, you are setting yourself up for a rude awakening.

I have seen this pattern play out in countless teams. A new version goes live, everyone celebrates, and then thirty minutes later the first support ticket arrives. The feature works in staging, but in production it is eating memory. The API responds fine to your test calls, but real user traffic exposes a race condition nobody caught.

Release is not the finish line. It is the moment when the real test begins.

Why Staging Is Not Enough

Your team probably ran tests before release. Unit tests passed. Integration tests were green. The staging environment looked healthy. All of that is good, but it is not sufficient.

Staging has no real users. It has no real data volume. It does not have the unpredictable traffic patterns that production handles daily. A database query that runs in 50 milliseconds on staging can take five seconds on production because the production table has millions of rows. A feature that works fine with test accounts can break when hit with authentication tokens from a dozen different identity providers.

This gap between staging and production is why post-release checking exists. You need to confirm that the version running on production actually works the way you expect, under real conditions.

Start With a Smoke Test

The first thing to do after release is a smoke test. The name comes from electronics: when you power on a circuit board for the first time, you check whether smoke comes out. No smoke means the board is not burning up, and you can proceed with deeper testing.

In software, a smoke test is a quick check to see if the new version is alive. You hit the main page and confirm it loads. You call a critical API endpoint and check that it returns a response. You verify that the database connection is established. You make sure the login flow does not immediately crash.

Smoke tests are shallow by design. You are not testing every feature or edge case. You are answering one question: did the deployment actually work, or is the application broken on arrival?

The following flowchart maps the post-release verification process described in this article, including the decision points for rollback.

Here is a simple bash command you can run right after deployment to perform a smoke test:

curl -f https://prod.example.com/health && echo "Smoke test passed" || echo "Smoke test failed"

flowchart TD A[Release] --> B[Smoke Test] B -->|Pass| C[Verification] B -->|Fail| D[Rollback] C -->|Pass| E[Health Monitoring] C -->|Fail| D E -->|Anomaly Detected| F{Severity?} F -->|Minor| G[Hotfix] F -->|Major| D E -->|Normal| H[Continue Monitoring] G --> E

A good smoke test takes less than two minutes. If you have automated it, it runs in seconds. If you are doing it manually, keep the checklist short. Five to ten checks maximum. Anything longer and you are drifting into verification territory.

Move Into Verification

Once the smoke test passes, you need to verify that the new version behaves correctly. Verification is more systematic. You compare the actual behavior of the system against what you expected.

This is where your existing test suite becomes useful again. Run the critical automated tests against production. Not the full regression suite, but the subset that covers the features you just changed. If you added a new checkout flow, verify that orders can be placed. If you changed the search algorithm, verify that search results make sense.

Verification also includes manual checks for things that are hard to automate. Check the logs for unexpected errors. Look at the data that was processed after the release. Confirm that the new feature is reachable through the normal user interface.

The key difference from smoke testing is depth. Smoke testing asks "is it alive?" Verification asks "is it working correctly?"

Watch the Health Signals

Smoke tests and verification are active checks. You send requests and observe responses. But there is another layer of checking that happens passively: monitoring health signals.

Health signals are the metrics that tell you whether your system is in good shape. CPU usage, memory consumption, request rate, error rate, response time, database connection pool utilization, queue depth. These numbers change continuously as users interact with your system.

After a release, you want to watch these signals for anomalies. A sudden spike in error rate is a red flag. A gradual increase in memory usage might indicate a leak. Response time that climbs steadily could mean the new code is slower under load.

The difference between health signals and active testing is important. Active testing tells you what happens when you specifically ask. Health signals tell you what is happening naturally as users use the system. Both are necessary.

Set up dashboards that show these metrics side by side with the previous version's baseline. Better yet, configure alerts that trigger when metrics cross thresholds. You do not want to discover a problem by staring at a graph for an hour. You want the system to tell you when something is wrong.

How Long Should You Check?

The duration of post-release checking depends on the change.

For a small bug fix or a minor feature addition, a few minutes of smoke testing and health signal observation is enough. If nothing breaks in the first ten minutes, the change is probably safe.

For a major feature, a database migration, or an infrastructure change, you need longer. Plan to monitor for several hours. Some teams keep a close watch for a full business day after a large release. The reason is that certain problems take time to surface. A memory leak might not show up until the system has processed thousands of requests. A slow query might only become visible when concurrent users reach a certain threshold.

The important thing is to define the checking period before the release. Agree on what you will check, for how long, and what threshold triggers a rollback. Do not make these decisions in the middle of an incident.

When Things Go Wrong

If your post-release checks find a problem, you need to decide what to do. The decision usually comes down to two options: hotfix or rollback.

A hotfix is appropriate when the problem is small, isolated, and can be fixed quickly. A typo in a configuration value. A missing permission on a new endpoint. A CSS glitch that only affects one browser. You can patch it and deploy again without disrupting users.

A rollback is appropriate when the problem is serious. Data corruption. A security vulnerability. A feature that breaks the entire user flow. A performance regression that makes the system unusable. In these cases, the fastest way to restore service is to revert to the previous version.

The decision should be made before the release, not during the crisis. Define what counts as a hotfix-worthy problem and what counts as a rollback-worthy problem. Document it. Make sure everyone on the team knows the criteria.

A Practical Post-Release Checklist

Here is a short checklist you can adapt for your team. Adjust the items based on your application and risk tolerance.

Smoke test: main page loads, critical API responds, database connected
Automated verification: run the test subset that covers changed features
Manual verification: check logs, confirm new feature is accessible
Health signals: review error rate, response time, resource usage
Alert configuration: verify that monitoring alerts are active and thresholds are set
Rollback criteria: confirm the team knows what triggers a rollback and who decides

The Real Takeaway

Post-release checking is not optional. It is the step that separates teams who ship confidently from teams who ship and pray. The goal is not to eliminate all production issues, that is impossible. The goal is to catch problems quickly, before they affect many users, and to have a clear plan for responding when you find them.

Your deployment is not complete when the pipeline turns green. It is complete when you have confirmed that the new version is running correctly in production, under real traffic, and your health signals look normal. That is the moment you can truly say the release was successful.