DevSecOps

Reproducible Builds: The Gold Standard for Supply Chain Integrity

If you can't rebuild a binary from source and get the same result, you can't verify that the binary matches the source. Reproducible builds close this fundamental trust gap.

You audit the source code. You review the commit history. You check the licenses. Then you download a pre-built binary and trust that it was actually built from that source code. That last step is where the entire software supply chain verification model breaks down.

Reproducible builds address this directly: given the same source code, build environment, and build instructions, the output should be bit-for-bit identical every time. If it is, anyone can verify that a distributed binary corresponds to its claimed source. If it isn't, you're trusting the build infrastructure — and the SolarWinds attack proved exactly how dangerous that trust can be.

The Verification Gap

Consider the typical open source distribution model:

Maintainer writes code and pushes to a public repository
Maintainer (or a CI system) builds the code into a distributable artifact
The artifact is published to a package registry (npm, PyPI, Maven Central)
Consumers download and use the artifact

Steps 1 and 4 are visible. Steps 2 and 3 are opaque. The build might happen on the maintainer's laptop, on a CI server, or on compromised infrastructure. The registry distributes whatever it receives. Nobody independently verifies that the published artifact matches the public source.

This gap has been exploited in practice:

SolarWinds/SUNBURST — Attackers compromised the build system and injected malicious code during compilation. The source repository was clean; the distributed binary was not.
event-stream — The malicious code was technically in the source, but the attack demonstrated how build artifacts are consumed without review.
Codecov — The Bash uploader script distributed to users was modified on the distribution server, not in the source repository.

Reproducible builds would have made each of these attacks detectable. If independent parties could rebuild from source and compare the result to the distributed artifact, any discrepancy would be a red flag.

What Makes Builds Non-Reproducible

Achieving bit-for-bit reproducibility is harder than it sounds. Common sources of non-determinism in build processes:

Timestamps

Compilers, archivers, and packaging tools frequently embed build timestamps in their output. Two builds of the same source at different times produce different artifacts. ZIP, JAR, and tar.gz formats often include file modification timestamps.

Fix: Use SOURCE_DATE_EPOCH — a standardized environment variable that tools can use instead of the current time. Set it to the timestamp of the last commit.

File System Ordering

When a build tool traverses a directory, the order of files depends on the filesystem. ext4, NTFS, and APFS may return different orderings. If the build output depends on iteration order — for example, the order of files added to an archive — the result is non-deterministic.

Fix: Sort directory listings explicitly. Build tools should never depend on filesystem enumeration order.

Build Paths

Compilers often embed the build path in debug information, symbol tables, or error messages. Building in /home/alice/project produces a different artifact than building in /home/bob/project.

Fix: Use path remapping flags (-ffile-prefix-map in GCC, --remap-path-prefix in Rust) or build in a canonical path.

Randomness and Hashing

Some build tools use random seeds for hash table layouts, symbol ordering, or optimization decisions. Different random seeds produce different outputs.

Fix: Pin random seeds where possible. Use deterministic algorithm variants.

Compiler and Toolchain Versions

Different versions of the same compiler produce different output. Even minor version differences can change optimization decisions and code generation.

Fix: Pin exact toolchain versions. Use containerized build environments with locked tool versions.

Parallelism and Race Conditions

Parallel builds can produce different output depending on the order in which parallel tasks complete. If build step order affects the final artifact (such as archive entry ordering), parallelism introduces non-determinism.

Fix: Ensure that build step ordering doesn't affect the final artifact. Post-process outputs to normalize ordering.

The State of Reproducible Builds

Several major projects and ecosystems have made significant progress toward reproducible builds:

Debian

The Debian project has been the leader in reproducible builds for Linux distributions. As of 2022, over 95% of Debian packages build reproducibly. The project developed many of the core tools and techniques used by other ecosystems, including the diffoscope tool for identifying reproducibility differences and the SOURCE_DATE_EPOCH specification.

Bitcoin Core

Given the stakes — billions of dollars in cryptocurrency secured by the Bitcoin Core software — reproducible builds are a security-critical requirement. Bitcoin Core uses Guix-based builds that produce deterministic outputs across independent build environments. Multiple developers independently verify each release by rebuilding and comparing hashes.

Tor Browser

The Tor Browser has used reproducible builds since 2013, motivated by the need to distribute trusted software to users in adversarial environments. Users in authoritarian regimes need confidence that the browser they download hasn't been backdoored — and reproducible builds provide that assurance without requiring trust in the build infrastructure.

Android Verified Boot

Google has invested in reproducible builds for Android system components, allowing device manufacturers and security researchers to verify that pre-installed system software matches its source.

Language Ecosystems

Progress varies significantly across programming language ecosystems:

Go — Relatively good reproducibility due to static linking and deterministic compilation, though CGo introduces challenges
Rust — Active work on reproducibility with --remap-path-prefix and deterministic builds
Java — Maven and Gradle builds are improving but JAR file metadata remains a common source of non-determinism
JavaScript — npm packages are typically distributed as source, which simplifies the problem but doesn't eliminate it (minified/bundled artifacts can differ)
Python — Wheel builds have reproducibility challenges around metadata and compilation of C extensions

Implementing Reproducible Builds

Start with Detection

Before fixing reproducibility, measure it. Build the same commit twice in the same environment and compare the outputs:

# Build twice
make clean && make > /dev/null 2>&1
cp -r dist/ dist-first/
make clean && make > /dev/null 2>&1
cp -r dist/ dist-second/

# Compare
diff -r dist-first/ dist-second/
# For binary comparison:
sha256sum dist-first/* dist-second/*

If the hashes differ, use diffoscope to identify exactly what changed:

diffoscope dist-first/app.jar dist-second/app.jar

diffoscope recursively unpacks archives and compares contents, identifying the specific sources of difference — timestamps, build paths, ordering differences, etc.

Containerize Build Environments

Docker or OCI containers provide a consistent, versionable build environment:

FROM ubuntu:22.04@sha256:abc123...  # Pin by digest, not tag
RUN apt-get update && apt-get install -y \
    gcc=12.2.0-14 \
    make=4.3-4.1build1
ENV SOURCE_DATE_EPOCH=0
WORKDIR /build

Pin the base image by digest (not tag) and pin every tool to an exact version. This ensures that the build environment is identical across machines and over time.

Normalize Build Outputs

Post-processing steps can normalize non-deterministic elements:

Strip timestamps from ZIP/JAR entries
Sort archive contents alphabetically
Normalize line endings
Remove or canonicalize debug information

The strip-nondeterminism tool from the Debian project handles many common cases automatically.

Publish Build Attestations

Once your builds are reproducible, publish attestations that allow others to verify them:

Publish the hash of the build output alongside the artifact
Document the exact build environment (container image digest, tool versions)
Provide build instructions that anyone can follow to reproduce the artifact
Use in-toto or SLSA attestation formats for machine-readable build provenance

The Relationship to SLSA

The SLSA (Supply-chain Levels for Software Artifacts) framework explicitly addresses build integrity:

SLSA Level 1 — Documentation of the build process
SLSA Level 2 — Build service generates authenticated provenance
SLSA Level 3 — Hardened build platform with tamper-resistant provenance
SLSA Level 4 — Requires hermetic, reproducible builds with two-party review

Reproducible builds are the capstone of SLSA's build integrity track. They're the mechanism that makes all the lower levels verifiable. Without reproducibility, provenance attestations are assertions you have to trust. With reproducibility, they're assertions you can verify.

Practical Trade-offs

Full bit-for-bit reproducibility across all platforms is a spectrum, not a binary. Some pragmatic considerations:

Start with your most critical outputs. The signed artifact you distribute to customers matters more than internal development builds.
Reproducibility within a defined environment is valuable. If your build is reproducible within a specific container image, that's meaningful even if it's not reproducible across arbitrary Linux distributions.
Track reproducibility as a metric. Some non-determinism is hard to eliminate. Track your reproducibility rate and improve it over time rather than treating it as all-or-nothing.
Use CI to enforce reproducibility. Build the same commit twice in CI and fail the pipeline if the outputs differ.

How Safeguard.sh Helps

Safeguard integrates with your build pipeline to verify build artifact integrity as part of your software supply chain security program. By generating SBOMs at build time and correlating them with published artifacts, Safeguard provides the transparency layer that connects source code to distributed software. The platform's continuous monitoring detects unexpected changes in build outputs, flagging potential tampering or non-deterministic build issues before compromised artifacts reach your customers. Combined with Safeguard's vulnerability tracking across your full dependency graph, reproducible builds become part of a comprehensive verification chain — from source code through dependencies through build process to final artifact.

reproducible builds build integrity supply chain SLSA verification

Back to all articles

More on #reproducible builds

View all →

Build Security

Build Reproducibility: A Verification Guide

7 min read

Never miss an update

Weekly insights on software supply chain security, delivered to your inbox.