18 stable releases (3 major)

Uses old Rust 2015

3.4.0-alpha0 Mar 23, 2017
3.3.0 Mar 23, 2017
3.2.2 Jan 7, 2017
3.0.0 Dec 17, 2016
0.1.0 Sep 13, 2016

#627 in Filesystem

Apache-2.0

2MB
14K SLoC

Contains (ELF exe/lib, 5.5MB) etc/legacy-wrappers/rzbackup

RZBackup

http://rzbackup.com

James Pharaoh james@pharaoh.uk

This is a partial Rust clone of ZBackup, along with some unique features of its own.

This project is free software and available under the [Apache 2.0 licence] (https://www.apache.org/licenses/LICENSE-2.0).

Binaries for ubuntu are available can be downloaded [here] (https://dist.wellbehavedsoftware.com/rzbackup/).

Online documentation is automatically generated by docs.rs (https://docs.rs/rzbackup/).

List of features

  • Rust library for access to ZBackup repositories
  • Supports encrypted and non encrypted formats
  • Multi-threaded restore and configurable, multi-tier chunk cache
  • Client/server utilities to efficiently restore multiple backups with a shared chunk cache
  • RandomAccess implements Read and Seek to provide efficient random access for rust applications
  • Balance tools to redistribute data in index and bundle files, typically useful to obtain a smaller number of consistently-sized files
  • Thorough garbage collection tools to clean up indexes and chunks which are no longer in use
  • Check tools, to verify the integrity of the repository in various ways, and repair simple problems
  • Command line decrypt utility, mostly useful for debugging

Notable missing features

  • No facility to create backups, these must be performed with the original ZBackup tool

Library usage

In cargo.toml:

[dependencies]
output = "0.5"
rzbackup = "3.3"

A basic example (from examples/restore.rs):

extern crate output;
extern crate rzbackup;

use std::env;
use std::ffi::OsString;
use std::io;

use rzbackup::Repository;

fn main () {

	let output =
		output::open ();

	let arguments: Vec <OsString> =
		env::args_os ().collect ();

	let repository =
		Repository::open (
			& output,
			Repository::default_config (),
			& arguments [1],
			if arguments [2] != "" {
				Some (& arguments [2])
			} else { None },
		).unwrap ();

	let stdout =
		io::stdout ();

	let mut stdout_lock =
		stdout.lock ();

	repository.restore (
		& output,
		arguments [3],
		& mut stdout_lock,
	).unwrap ();

}

Command usage

Restore

The restore command is able to perform a one-off restore. It is basically equivalent to ZBackup's own restore command. The restored content is piped to stdout.

rzbackup restore \
	--repository REPOSITORY \
	[--password-file PASSWORD-FILE] \
	--backup-name BACKUP-NAME

Server

The server process listens for client connections and streams backups over a socket. It has a large cache and so will be more efficient than running separate restore processes for backed up data with lots of shared deduplicated content.

rzbackup server \
	--listen-address HOST:PORT \
	--repository REPOSITORY \
	[--password-file PASSWORD-FILE] \
	... (lots more options, try --help)

Client

The client connects to the server and streams a backup to standard output. It can also tell the server to reload its indexes, which will be necessary if new backups have been made.

rzbackup client reindex \
	--server-address HOST:PORT
rzbackup client restore \
	--server-address HOST:PORT \
	--backup-name BACKUP-NAME

Convert

The convert tool makes low-level changes to the repository. It is able to balance both index and bundle files, changing the number of entries they contain. It can perform garbage collection, removing index entries and chunk data which is no longer referenced by any backups. It is able to perform various consistency checks.

Balance bundles

rzbackup convert balance-bundles \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE \
    [--chunks-per-bundle CHUNKS-PER-BUNDLE] \
    [--fill-factor FILL-FACTOR]

The balance-bundles tool balances the number of chunks contained in bundle files. This consists of reading all index files in order to determine which bundles contain more or less than the desired number of chunks, then reading those bundles and writing them out to new bundles with the required number of chunks in each one.

This writes out a new index file for each bundle, currently, so you may want to follow this by running balance-indexes.

The number of chunks per bundle is configurable on the command line, this currently defaults to 256, which is considerably higher than zbackup's own 32. The fill factor defaults to 25, this is a percentage of the chunks per bundle which will determine the minimum percentage of the total chunks which a bundle should have in order to be left as-is.

Balance indexes

rzbackup convert balance-indexes \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE \
    [--bundles-per-index BUNDLES-PER-INDEX]

The balance-indexes tool balances the number of bundles indexed in individual index files. This consists of reading all indexes to determine which have more or less than the desired number of bundles, then writing out the information about those bundles to new index files with the desired number of bundles in each one.

The number of bundles per index is configurable on the command line, this currently defaults to 16384. ZBackup defaults to writing out a single index file every time a new backup is created (unless there are no bundles written), so index files vary in size and there are typically a large number of small indexes and a small number of large ones in a well-used repository.

Since, in general, every index file is loaded before doing any work on a repository, it makes sense to have fewer files, in order to reduce the overhead of opening a large number of files. This also makes synchronisation, eg to a remote location, more efficient.

Check backups

rzbackup convert check-backups \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE \
    [--move-broken] \
    [--backup-name-hash-prefix] PREFIX

The check-backups command verifies all backups can be restored, by verifying that all the required chunks are present in the indexes. It can optionally move any backups which cannot be restored from backups to backups-broken, and it can operate on a subset of backups by specifying a prefix which is matched against the SHA1 sum of the backup name, eg /dir/filename for a backup in /backups/dir/filename.

It is a good idea to run rebuild-indexes before running this command, so that the state of the indexes is an accurate representation of the available chunks.

Check bundles

rzbackup convert check-bundles \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE \
    [--bundle-name-prefix] PREFIX

The check-bundles tool simply reads every bundle in its entirety, applying consistency checks to the data they contain. This will verify the adler32 checksums after the bundle info at the start of the file and at the end of the file, and also checks that each chunk's data matches its truncated SHA1 sum which makes up the first 16 bytesof its chunk id. It also, implicitly, verifies that the compression and encryption are intact, since these are required to read the bundle.

Since this can take a very long time, a bundle name prefix can be specified, and only bundles whose names start with this string will be checked. This can be used, for example, to check a subset of bundles on a nightly basis.

This is useful to detect corruption in bundle files, which can then be restored from backups, if those are available. If backups are not available, then bundles can be removed, and these can be removed from the indexes using rebuild-indexes, preventing any future backups from being creating referencing these missing chunks.

Backups which are already broken can be identified by running check-backups after running rebuild-indexes. backups can be identified.

Check indexes

rzbackup convert check-indexes \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE \
    [--repair]

The check-indexes tool reads each backup file, building an index of all chunks which they reference, and then reads index files, reporting any missing or duplicated chunks. If the --repair option is specified, it will rewrite the index files to remove these chunks.

Garbage collect indexes

rzbackup convert gc-indexes \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE

The gc-indexes tool performs garbage collection on index files. This consists of identifying all the chunks which are referenced by backups, then rewriting the index files to remove any chunks which are no longer needed.

Note that this will not make any changes to bundles, so you probably want to then run gc-bundles, to remove the chunks which are no longer needed.

Garbage collect bundles

rzbackup convert gc-bundles \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE

The gc-bundles tool performs garbage collection on bundles. This consists of identifying all chunks which are referenced in index files, then rewriting the bundles to remove any chunks which are no longer needed.

This will also remove any chunks which are duplicated. These will not be removed from the index, which can cause problems with restore operations which reference the duplicated chunks, but this can be remedied with the rebuild-indexes tool.

Rebuild indexes

rzbackup convert rebuild-indexes \
    --repository REPOSITORY \
    --password-file PASSWORD-FILE

The rebuild-indexes tool scans all bundle files, reading only the chunk info in their header, then builds a completely new set of index files based on the information contained there. This is a useful way to ensure that the index files are correct, and can fix a lot of problems caused by various types of corruption.

This is important to ensure that restore operations can take place correctly when corruption has occurred, and also may be required for some of the other tools to work correctly, since they mostly rely on the index files only.

Decrypt

This is mostly useful for debugging. It allows you to show the decrypted contents of any backup, index or bundle file in a ZBackup repository.

rzbackup decrypt \
	--repository REPOSITORY \
	--password-file PASSWORD-FILE \
	--encrypted file ENCRYPTED-FILE

Dependencies

~9.5MB
~165K SLoC