26-5 · Chapter 26 · 6 min read

Why Terraform Needs a State File (And Why You Should Never Store It on Your Laptop)

You just ran terraform apply and successfully created a server and a database. Everything works. You close your laptop, satisfied. Tomorrow, you need to

Why Terraform Needs a State File (And Why You Should Never Store It on Your Laptop)

You just ran terraform apply and successfully created a server and a database. Everything works. You close your laptop, satisfied. Tomorrow, you need to add a load balancer. You open the configuration, add the resource, and run terraform plan.

Terraform stares back at you with a plan that wants to recreate everything from scratch.

What happened? Terraform has no memory of what it built yesterday. It doesn't know that server exists. It doesn't know that database is running. It has no record of the infrastructure it just created.

This is the moment most teams realize they need a state file.

What a State File Actually Does

A state file is Terraform's working memory. It records every resource Terraform has created, including unique IDs, the last known configuration, and how resources relate to each other. When you run terraform plan, Terraform doesn't go back to your cloud provider to ask "what exists?" That would be slow, inconsistent, and unreliable across different providers. Instead, it reads the state file and compares it against your current configuration.

Think of it as a receipt. After every purchase, you get a receipt that says what you bought, when, and for how much. Without that receipt, you'd have to inspect your entire house to figure out what you own. Terraform is the same. Without a state file, it has to scan your entire cloud account to figure out what it manages. And even then, it can't be sure which resources it created versus which ones someone else created manually.

Here is a visual comparison of the two workflows:

flowchart TD subgraph WithStateFile[With State File] A1[Read state file] --> B1[Compare state vs config] B1 --> C1[Produce accurate plan] C1 --> D1[Apply changes safely] end subgraph WithoutStateFile[Without State File] A2[Scan entire cloud account] --> B2[Guess which resources Terraform owns] B2 --> C2[Uncertain plan: risk of duplicates] C2 --> D2[Apply may recreate everything] end WithStateFile --> |Reliable| Outcome1[Predictable infrastructure] WithoutStateFile --> |Risky| Outcome2[Duplicates, conflicts, high bills]

The state file contains detailed information about each resource. For a cloud server, that includes the server ID, IP address, machine type, availability zone, and network configuration. For a database, it includes the endpoint, port, storage size, and backup settings. This information is the source of truth for Terraform's next operation.

The Laptop Problem

Many developers start by storing the state file locally. It's the easiest thing to do. Terraform creates a file called terraform.tfstate in your working directory, and you move on.

This works until:

Your laptop dies or gets stolen.
You accidentally delete the file.
You switch computers and forget to copy the file.
A teammate runs Terraform from their machine and creates a different state file.

When the state file disappears, Terraform loses all knowledge of your infrastructure. It thinks nothing exists. If you run terraform apply again, it tries to create everything from scratch. You end up with duplicate resources, conflicting configurations, and a cloud bill that suddenly doubles.

Even worse, if someone manually changes a resource in the cloud console without updating the state file, Terraform becomes desynchronized. It thinks a server has 4GB of RAM, but the actual server has 8GB. The next terraform plan produces a plan that doesn't match reality.

Remote State: The Only Sensible Approach

The fix is straightforward: store the state file in a shared location that everyone on your team can access. This is called remote state.

Remote state means the state file lives in a storage service like Amazon S3, Azure Storage, or Google Cloud Storage. Every team member and every CI/CD pipeline reads from and writes to the same state file. This ensures everyone works from the same source of truth.

Setting up remote state is usually a one-time configuration in your Terraform backend block:

terraform {
  backend "s3" {
    bucket = "my-team-terraform-state"
    key    = "production/terraform.tfstate"
    region = "us-east-1"
  }
}

Once configured, Terraform automatically fetches the state file from remote storage before every plan and pushes updates after every apply. No one needs to manually copy files or worry about version conflicts.

State Locking: Preventing Collisions

With remote state, multiple people can run Terraform at the same time. This introduces a new problem: two people running terraform apply simultaneously can corrupt the state file or create inconsistent infrastructure.

State locking prevents this. When one process starts terraform apply, it locks the state file. Other processes must wait until the lock is released. Most remote state backends support locking. S3 uses DynamoDB for locking. Azure Storage has built-in lease support. Google Cloud Storage uses object generation numbers.

Without locking, you risk scenarios like:

Two pipelines deploying different changes at the same time.
A developer running a manual apply while a CI/CD pipeline is mid-deployment.
Partial updates where only half the changes get recorded.

State locking is not optional for team environments. It's a requirement.

Sensitive Data in State Files

State files contain everything Terraform knows about your infrastructure. That includes sensitive information like database passwords, API keys, and connection strings. Terraform stores these values in plain text inside the state file.

This means your state file is a security risk. Anyone with access to the state file can read your secrets. Remote state backends typically support encryption at rest and in transit. S3 supports server-side encryption with KMS. Azure Storage supports encryption at rest by default. Always enable encryption for your state storage.

Some teams also separate state files by environment. Development, staging, and production each get their own state file stored in separate locations. This prevents changes in one environment from affecting another. It also limits the blast radius if a state file gets compromised.

How State Files Work in CI/CD

In a CI/CD pipeline, the pipeline needs access to the state file to run Terraform. The pipeline authenticates to the remote state backend using credentials or IAM roles. It fetches the state file, runs terraform plan, and if approved, runs terraform apply and pushes the updated state back.

This is why remote state is essential for automation. A CI/CD pipeline running in an ephemeral container cannot rely on a state file stored on a developer's laptop. It needs a consistent, accessible location that exists independently of any individual machine.

Practical Checklist for State File Management

Store state files in remote storage, not on local machines.
Enable encryption at rest and in transit for the storage backend.
Configure state locking to prevent concurrent modifications.
Use separate state files for each environment.
Restrict access to state files using IAM policies or access controls.
Never commit state files to version control.
Back up state files regularly, especially before major changes.

What This Means for Your Team

State files are not an implementation detail you can ignore. They are the backbone of Terraform's ability to manage infrastructure reliably. Without a properly managed state file, your Terraform workflows will produce unpredictable results, duplicate resources, and hard-to-debug failures.

The moment you have more than one person running Terraform, or the moment you add a CI/CD pipeline, move your state file to remote storage. It takes ten minutes to configure and saves you from hours of recovery work when something goes wrong.

Your infrastructure is too important to trust to a file on your laptop.