31 releases
0.8.4 | Nov 6, 2024 |
---|---|
0.8.3 | Jul 23, 2024 |
0.7.7 | May 27, 2024 |
0.7.0 | Dec 9, 2023 |
0.4.3 | Apr 16, 2021 |
#108 in Build Utils
207 downloads per month
Used in 2 crates
32KB
456 lines
git warp-time
A CLI utility (and Rust library) that resets the timestamps of files in a Git repository working directory to the exact timestamp of the last commit which modified each file.
For use as a Rust library, include in your Cargo.toml
as documented on the crates.io listing and use per the API documentation.
For use as a CLI utility, first check whether your distro has packages (e.g. Arch Linux).
Otherwise you can run this repository as a Nix Flake with nix run github:alerque/git-warp-time
, install just plain binary with cargo install git-warp-time
, or download the latest source release and use ./configure; make; make install
for a full installation that includes autcompletion for Zsh, Bash, Fish, Elvish, and PowerShell.
CLI usage
Run from inside any Git working directory after clone, after any checkout operation that switches branches, after rebases, etc.
$ git clone ‹your project›
$ cd ‹your project›
$ git-warp-time
For more usage see the --help
output:
$ git-warp-time --help
CLI utility that resets the timestamps of files in a Git repository working
directory to the exact timestamp of the last commit which modified each file
Usage: git-warp-time [OPTIONS] [PATHS]...
Arguments:
[PATHS]... Optional list of paths to operate on instead of default which is
all files tracked by Git
Options:
-d, --dirty Include files tracked by Git but modifications in the
working tee
-i, --ignored Include files tracked by Git but also ignored
-o, --ignore-older Only touch files that are newer than their history, ignore
ones that are older
-q, --quiet Don't print any output about files touched or skipped
-h, --help Print help
-V, --version Print version
Library Usage
In your Cargo.toml
file.
[dependencies]
git-warp-time = "0.8"
Then use the crate functions and types in your project something like this:
use git_warp_time::{get_repo, reset_mtimes};
use git_warp_time::{FileSet, Options};
let repo = get_repo().unwrap();
let mut paths = FileSet::new();
paths.insert("foo.txt".into());
let mut opts = Options::new();
opts.verbose(true).paths(Some(paths));
let files = reset_mtimes(repo, opts).unwrap();
println!("Actioned files: {:?}", files);
CI Usage
This may me run in a CI workflow on almost any CI platform either as a binary or using the Docker container found in the GitHub Container Registry (docker run ghcr.io/alerque/git-warp-time:latest
).
It is important to note that the Git repository needs to be checked out with depth.
A shallow clone will cause all timestamps to be as new as the oldest commit in the clone, i.e. newer than actual.
For GitHub Actions this looks like so:
steps:
- name: Checkout
uses: actions/checkout@v4
with:
fetch-depth: 0
- name: Git Warp Time
uses: alerque/git-warp-time:latest
The story
Whenever you git clone
a project or git checkout
a different branch, Git will write all the relevant files to your system at the moment you run the Git command.
Logical enough.
Git is doing the right thing.
For most use cases there is nothing wrong with the latest modification timestamp of a file to be the last time its state changed on your disk.
However many build systems rely on file modification timestamps to understand when something needs to be rebuilt. GNU Make is one example that relies entirely on timestamps, but there are many others. A few rely on checksums (e.g. SCons) and keep a separate database of ‘last seen’ file states, but since this requires extra storage most build systems use what is available. What is available without the build system storing its own state is your file system's meta data.
The rub happens when you take advantage of Git's cheap branching model.
Many workflows branch early and branch often.
Every time you git checkout <branch>
, your local working tree will be updated with the time of your checkout.
In some cases this will cause unnecessary rebuilds.
For many projects these rebuilds are required: if the state of all files at once doesn't match your project won't be built right. However for some projects, particularly those with multiple outputs, this might be a lot of wasted work.
A case study
I have one project with many hundreds of LaTeX files. The projects output is a directory of PDFs in a shared file repository (Nextcloud). There are currently 5.7 Gigs of PDF files. Each week this collection grows. Most of the files are completely independent of each other, but they all use a common template and some other includes. Most weeks I just add new files and building the project just adds a few more files to the output. Periodically I will change something in the template that will cause the entire output file set to regenerate. That process takes about 20 hours to complete.
Git is a distributed version control system and I should be able to work on this project from anywhere, but there is a problem.
If I clone the project to a new system, the source files are all newer than the existing outputs and the build system can't figure out what actually needs to be rebuilt.
One solution is to touch
every file in the project after a clone with a very old date.
That sledge hammer approach works well enough for clones, but any time I work in a feature branch things get messed up.
Returning from a feature branch that messes with the template to the master branch will cause the template file to be ‘new’ again and the whole project tries to rebuild.
This utility is a more elegant solution.
Running git warp-time
after any clone, checkout, rebase, or similar operation will reset all the timestamps to when they were actually last touched by a commit.
The result is project portability. I can clone the project on a new system and without any build state data except the existing output files the project knows what it does and doesn't need to rebuild.
When not to use this
Not all Git projects will benefit from git warp-time
.
- If your build system doesn't use timestamps, this won't help you.
- If your project generates a single output such as a binary, you probably need to rebuild when any of the inputs change so this won't help much.
- If your build system supports incremental builds and you do creative things in your branches, you might completely confuse your build system and cause incomplete builds.
Dependencies
~11–23MB
~394K SLoC