What is the right scope for a Terraform module — how broad or narrow should it be?

A module should manage resources that must always be provisioned together and share a strong lifecycle coupling. For example, a GCP VPC and its subnets belong together, as do a Cloud SQL instance and its IAM bindings. Avoid modules that are either too narrow (a thin wrapper around a single resource) or too broad (an entire environment stack), as both extremes hurt testability and reusability.

Why should Terraform modules always over-output resource IDs and names?

Module outputs become the glue between modules — you pass the VPC ID from your networking module into your compute module, and the database connection string from your database module into your application module. If an output is omitted initially, you must modify the module later and update every caller that references it. The recommendation from production experience is to output every resource ID and name from day one, even if they are not immediately needed.

Why should you pin module versions to Git tags instead of the main branch?

Referencing a module from the Git main branch means any merged change — including breaking changes — immediately affects all environments that use that module. Tagging releases (e.g. v1.0.0, v1.1.0) and pinning each environment to a specific tag ensures changes are deliberate and controlled. Breaking changes get a major version bump, so environments update on their own schedule rather than automatically.

How should Terraform variable validation blocks be used in production modules?

Validation blocks enforce constraints at plan time, catching misconfigurations before any infrastructure is touched. They should be used aggressively — for example, restricting the prod environment to use only db-custom-4-8192 or larger database machine types, which prevents accidental under-provisioning. Required security settings such as encryption and backup retention should have no default values, forcing each environment to make an explicit decision.

When should you use community Terraform Registry modules versus writing your own?

Community modules from the Terraform Registry — such as the Google Network and Google Cloud SQL modules — are well-maintained and save significant development time for common patterns. Custom internal modules are warranted only when community modules do not match specific requirements, such as particular IAM policies, naming conventions, or multi-region configurations for specific regions. The practical rule is to use community modules as starting points, fork and customize only when necessary, and contribute improvements back upstream when possible.

Building Reusable Terraform Modules: A Production IaC Guide

Terraform without modules is copy-paste infrastructure. You define the same VPC, the same compute instances, and the same database configurations in every environment — and then spend hours reconciling drift when you change something in one environment but forget another. I learned this the hard way at Commsult Indonesia, where we had three near-identical GCP environments (dev, staging, prod) with manually synchronized configurations. After the third incident where staging and prod drifted apart, I rewrote everything as composable Terraform modules. This guide shows the exact patterns I use.

What a Good Terraform Module Actually Is

A Terraform module is a container for multiple resources used together. The key word is 'together' — a module should represent a coherent infrastructure component, not a single resource type or an entire application stack. A module for a VPC makes sense. A module for a 'database with monitoring and alerting and backups configured' makes sense. A module that's just a thin wrapper around google_compute_instance does not — you've added abstraction without adding value. The standard module structure from HashiCorp is clear: main.tf for resource definitions, variables.tf for inputs, outputs.tf for values the caller needs, and a README.md that makes the module externally consumable.

The DRY Principle Applied to Infrastructure

The Don't Repeat Yourself principle from software engineering applies equally to infrastructure code. When you define a GCP Cloud SQL instance in main.tf of your dev environment and separately in staging and prod, you have three sources of truth — and they will diverge. A module collapses those three definitions to one, with environment-specific values passed as variables. The module enforces constraints — the prod environment can only use db-custom-4-8192 or larger, enforced by validation blocks in variables.tf. No individual can accidentally provision an undersized database in production.

Module Scope: Narrow vs Wide

The most common mistake with Terraform modules is scope creep — making modules too broad. A 'production environment' module that provisions VPCs, databases, compute instances, load balancers, and DNS records in one block is hard to test, hard to version, and hard to reuse. The right scope: a module should manage resources that must always be provisioned together and have strong lifecycle coupling. A GCP VPC and its subnets are tightly coupled — provision them together. A GCP Cloud SQL instance and its IAM bindings are coupled — provision them together. A GCP Load Balancer and the backend services it routes to — probably separate modules composed at the environment level.

From my experience: always output every resource ID and name from your modules, even if you don't use them immediately. In practice, module outputs become the glue between modules — you pass the VPC ID from your networking module into your compute module, and the database connection string from your database module into your application module. If you don't output it initially, you have to modify the module later and update all callers. Over-output from day one.

Module Structure and File Layout

A well-structured Terraform module repository separates root modules (environment definitions that call child modules) from child modules (reusable building blocks). The convention I use at Commsult Indonesia places child modules under modules/ and environment root configs under environments/dev, environments/staging, environments/prod. Each child module has its own README.md, variables.tf with descriptions and validation blocks, outputs.tf, and main.tf. Child modules never contain provider configurations — they inherit from the root. This keeps modules environment-agnostic and truly reusable.

# modules/cloud-sql/variables.tf
variable "instance_name" {
  description = "Cloud SQL instance name"
  type        = string
}

variable "machine_type" {
  description = "Cloud SQL machine type"
  type        = string
  default     = "db-custom-2-8192"

  validation {
    condition     = contains(["db-custom-2-8192", "db-custom-4-16384", "db-custom-8-32768"], var.machine_type)
    error_message = "Machine type must be one of the approved sizes."
  }
}

variable "environment" {
  description = "Deployment environment"
  type        = string
  validation {
    condition     = contains(["dev", "staging", "prod"], var.environment)
    error_message = "Environment must be dev, staging, or prod."
  }
}

# modules/cloud-sql/outputs.tf
output "instance_connection_name" {
  description = "Cloud SQL instance connection name"
  value       = google_sql_database_instance.this.connection_name
}

output "private_ip_address" {
  description = "Cloud SQL private IP address"
  value       = google_sql_database_instance.this.private_ip_address
}

# environments/prod/main.tf
module "database" {
  source       = "../../modules/cloud-sql"
  instance_name = "prod-db-01"
  machine_type  = "db-custom-4-16384"
  environment   = "prod"
}

Variable Validation and Type Constraints

Terraform's validation blocks let you enforce constraints at plan time rather than apply time. Use them aggressively in your modules to catch misconfiguration before any resource is touched. Type constraints in variables.tf define what callers can pass — object types with required keys prevent callers from omitting critical configuration. Combine with default values only for truly optional settings — required security settings (encryption, backup retention) should have no defaults to force explicit decisions per environment.

Early in my Terraform adoption I referenced internal modules directly from the Git main branch: source = 'git::https://github.com/myorg/tf-modules.git//modules/vpc'. This seemed convenient until a team member merged a breaking change to main that immediately broke our staging apply. The fix: tag module releases (v1.0.0, v1.1.0) and reference specific versions: source = 'git::...?ref=v1.1.0'. Breaking changes get a major version bump. All environments pin to specific versions and update deliberately, not automatically.

Testing Terraform Modules

Untested Terraform modules accumulate bugs silently. The minimum testing approach is running terraform validate and terraform plan against a real (but ephemeral) environment for every pull request. For more comprehensive testing, Terratest (a Go-based testing library) lets you write tests that provision real infrastructure, assert on state and outputs, and destroy everything after the test. For modules that manage expensive resources, use localstack or the GCP emulator for unit-level tests, and reserve Terratest for integration tests that run on merge to main.

┌─────────────────────────────────────────────────┐
│         Terraform Module Repository Layout       │
├─────────────────────────────────────────────────┤
│  tf-modules/                                    │
│  ├── modules/           (child modules)         │
│  │   ├── vpc/                                   │
│  │   │   ├── main.tf                            │
│  │   │   ├── variables.tf                       │
│  │   │   └── outputs.tf                         │
│  │   ├── cloud-sql/                             │
│  │   └── cloud-run/                             │
│  └── environments/      (root modules)          │
│      ├── dev/                                   │
│      ├── staging/                               │
│      └── prod/                                  │
└─────────────────────────────────────────────────┘

Remote State and State Locking

Modules without proper state management cause race conditions and corruption. Every environment root module should store state remotely with locking enabled. On GCP, use a GCS bucket with versioning enabled as the backend, and Terraform automatically handles state locking via the GCS object lock mechanism. Each environment gets its own state file — dev, staging, and prod state are completely isolated. Cross-environment state references use terraform_remote_state data sources (use sparingly — this creates tight coupling between environments).

My Take: Module Registry vs Internal Modules

The Terraform Registry has hundreds of community modules for AWS, GCP, and Azure. I use them for common patterns where I do not have specific requirements — the Google Network module and the Google Cloud SQL module are well-maintained and save significant development time. I write custom internal modules only when community modules do not match our specific requirements (usually around IAM policies, naming conventions, or multi-region configurations specific to Indonesia/Singapore regions). The rule: use community modules as starting points, fork and customize only when necessary, and contribute improvements upstream when you can.

Sources & Further Reading

Frequently Asked Questions

Building Reusable Terraform Modules: A Production IaC Guide

Frequently Asked Questions

Building Reusable Terraform Modules: A Production IaC Guide

What a Good Terraform Module Actually Is

The DRY Principle Applied to Infrastructure

Module Scope: Narrow vs Wide

Module Structure and File Layout

Variable Validation and Type Constraints

Testing Terraform Modules

Remote State and State Locking

My Take: Module Registry vs Internal Modules

Related Articles

What a Good Terraform Module Actually Is

The DRY Principle Applied to Infrastructure

Module Scope: Narrow vs Wide

Module Structure and File Layout

Variable Validation and Type Constraints

Testing Terraform Modules

Remote State and State Locking

My Take: Module Registry vs Internal Modules

Related Articles