Take care: Work In Progress here. This article might be incomplete or contain errors. Know something we don't? Help us! Contribute here on GitHub

rsync

rsync is one of the best ways to deploy code fast and without hassle. It's also an often overlooked option. Let's change this! This article gives you some direction on how to use it in general and especially here on fortrabbit.

About rsync

rsync is a shorthand for remote synchronization. It's a command line tool to synchronize files over the network. It's open source. It's old but really good and it's is up to 10 times faster than FTP as it uses compression and diffs to only transfers changes. It works on top of SSH, so it's also secure and extra convenient when using SSH key auth. Usually, like most deployment related tasks here, you will use rsync from your local machine, not on your fortrabbit App directly.

Installing rsync

Chances are that you already have it: rsync is built-in with Linux and macOS. Check if it is installed. Run this command in the Terminal of your local machine:

$ which rsync
# If installed, it will output the version number.

For Windows 10 we recommend to install the Linux subsystem (WSL). For Windows 7 or even below you might use cwRsync which also requires Cygwin. There are some other clones and desktop GUI clients around as well. But don't be afraid of the Terminal, it's easier than you might think.

Use cases

There are multiple ways to deploy here on fortrabbit: with Git, with SFTP and/or SSH. You can hook in rsync in most deployment work-flows, either as an enhancement or as a replacement. It just doesn't really work well with Professional Apps as those have ephemeral storage and no direct SSH access. These are your main options for using rsync to deploy code to fortrabbit Universal Apps:

rsync instead of SFTP

Not using Git and still using SFTP? Consider rsync as a replacement. With SFTP - unless your SFTP client has some kind of synchronization method (which still will be slower) - you will copy each files manually, one by one. This is mundane and can also be dangerous when forgetting to copy critical files. rsync can work as a two way street directly on the file system. Easily synchronize files up and down from your local development to the App.

rsync in addition to Git

Using Git to deploy? Consider rsync as an essential addition. Why? Your dependencies are managed with Composer and thus excluded from Git. They will be installed and managed with Composer. So you are keeping your Git repo clean by just including the source files of your very own code. But there is more. Your project includes run time data and static assets:

Deploy bundled frontend builds with rsync

You are likely making use of some kind of frontend bundling process - a build tool like webpack, Brunch, Parcel, browserify, gulp.js or alike. That includes compiling, concatenating and minifying of Javascript, SASS, LESS, stylus and maybe also images. Most modern build tools are based on Javascript and Node.js. fortrabbit Apps do not support to trigger such a frontend builds. So we advice to run the production build process locally. That has the advantage that you can optimally debug errors. But you also need to copy those files to your remote App somehow!

Option 1: Just include built files in Git! You can just include the uglyfied files withing Git and deploy it along with the rest of the code. That's not clean, but can be practical when your build process is not that complex. Mind that you might have a development and a production build tasks.

Option 2: Deploy separately. Ideally, those bundled binary-like files should be excluded from Git. Deployment is faster when the Git repo is small. And only uncompressed text can be "diffed". Those one-line files can not be tracked for changes. You can upload those files manually or even better use rsync to upload those bundles.

See our old "I love assets" blog post on that matter as well.

Synchronize uploaded assets with rsync

Also, there likely will be files, like image uploads and alike on the Apps file system itself which will not be reflected in Git. The only way to get those files back into your local development so far was to SFTP or SSH in and grab the files manually. So, here, assuming that you have a live application where content editors are changing files on the App itself, you likelly would like to download certain files from the App into your local development.

PRO TIP: For Craft CMS users we have developed Craft Copy, which besides other nice features, also incorporates rsync to keep the assets in sync.

Using rsync

Hooked? So let's jump into your local terminal:

The rsync command structure

# sync all files UP
# 
#      options
#       ─┴─
$ rsync -av ./ {{app-name}}@deploy.{{region}}.frbit.com:~/
#           ─┬─ ──────────────────┬────────────────────────
#          source           destination
  • options: How to sync, see below.
  • source: This is your local source directory.
  • destination: The target URL, where the files should end up.

Common usage

Here are the most common rsync commands. Likely you will even come by using only the first two:

# DOWN: from remote to local
$ rsync -av {{app-name}}@deploy.{{region}}.frbit.com:~/ ./

# UP: from local to remote
$ rsync -av ./ {{app-name}}@deploy.{{region}}.frbit.com:~/



# LOCAL: two local folders
$ rsync -av ~/projects/{{app-name}}/ ~/projects/{{app-name}}.copy/

# REMOTE TO REMOTE: from App to App
$ rsync -av {{app-name}}@deploy.{{region}}.frbit.com:~/ {{app-name-2}}@deploy.{{region}}.frbit.com:~/

Remote paths

Remote URLs consist of {{user}}@{{host}}:{{folder}}. In the examples here fortrabbit App placeholders are used.

Local paths

rsync accepts all kind of defining local paths. ./ will translate to the current directory. You can also use absolute paths like /home/your-user/projects/{{app-name}} or relative paths like ../{{app-name}}.

Options

Here are some common options to alter the sync mode:

Option Description
-a Shorthand for --archive, like the set of options: -rlptgoD.
-v Verbose output shows all transmitted files and statistical data. Increase verbosity using -vv or -vvv
-n Shorthand for --dry-run. See below.
-r Recursive, so all files and directories below the source directory.
-l Keep symbolic links.
-p Permissions will be synchronized.
-t Preserve modification times.
-g Set Unix group of file/folder on destination to group in source. Also: use group as check criteria
-o Set Unix group of file/folder on destination to group in source. Also: use group as check criteria
-c Instead of modification time and size, use checksum of the file contents. Use with caution when modification time on destination is not reliable.
-C Shorthand for --cvs-exclude. Exclude version control files like .git, .hg, .svn.
-h Human readable output: display byte sizes in MiB, GiB instead of plain bytes.
-delete Remove unused files. See below
-exclude Omit files from being synced. See below

For an exhaustive list of all the possible options and more in depth info on the above options, check out the official rsync man page. Mind that rsync options can be chained, rsync -av combines the two flags -a and -v.

Previewing changes

The handy --dry-run option can be shortened to just -n and also be merged with other options to something like -avn. It will give you a preview of what will happen before doing anything:

$ rsync -avn ./ {{app-name}}@deploy.{{region}}.frbit.com:~/
sending incremental file list
./
index.php
wp-content/themes/your-theme/404.php
wp-content/themes/your-theme/archive.php
wp-content/themes/your-theme/content-link.php

sent 39,119 bytes  received 196 bytes  11,232.86 bytes/sec
total size is 23,325,044  speedup is 593.29 (DRY RUN)

Now, running this will print out everything that rsync would transfer - without doing anything. Better always execute a dry run before actually syncing. Once you're sure, that only files which you want to transfer are in the change set, you can just remove the n again from the options and execute it normally.

Syncing single folders

This is the example for using rsync as an addition when primarily working with Git. In this case you only want to include a specific folder and it's contents. You might want to "upload" your dist folder with your compiled CSS, JS and images. Or you want to "download" the assets folder, when an editor has uploaded new images to the CMS. This is the basic command:

$ rsync -av --include='/{{folder}}/***' --exclude='*' ./ {{app-name}}@deploy.{{region}}.frbit.com:~/

With the --include parameter you can specify the path to include, still you need to exclude everything else. The include exclude syntax is flexible as you can include patterns and multiple folders at once. Alternatively you can also just define the local folder and the remote folder.

Excluding files

Sometimes you want to omit files from being synced. You can just add --exclude=path/to/file. Say we don't want the 404.php from the previous example to be transferred, you would just do:

# use absolute path, as viewed from the source root
$ rsync -avC --exclude wp-content/themes/your-theme/404.php ./ {{app-name}}@deploy.{{region}}.frbit.com:~/

The value of --exclude is a pattern. It can be matched against the files to be transferred. In this case, the following patterns will have the same effect:

# use partial path
$ rsync -avC --exclude themes/your-theme/404.php ./ {{app-name}}@deploy.{{region}}.frbit.com:~/

# use smallest possible partial path
$ rsync -avC --exclude 404.php ./ {{app-name}}@deploy.{{region}}.frbit.com:~/

NOTE: The initial / character is important. --exclude 404.php and --exclude /404.php are not the same. The former means: Any path, which contains "404.php" is to be excluded. The latter means: Any path, which starts with "/404.php" is to be excluded.

Advanced exclude patterns

You can also use wildcard characters. For example:

$ rsync -av --exclude "*.jpg" --exclude "*.jpeg"` ...
#  exclude all JPEG files

$ rsync -av --exclude "/wp-content/themes/*/404.php" ...
# exclude all files, which start with `/wp-content/themes`, followed by an arbitrary name (no slashes! so only one level of sub directory!) and ending in `404.php`. So basically: All `404.php` files of all themes.

$ rsync -av --exclude "themes/**/*.css" ...
# exclude all files, which path name contains `themes/` then followed by anything (including any amount of sub directories) and ending in `.css`. So all `.css` files in all Themes.

Also, as you can see, in the JPEG example, you can add any amount of --exclude options to the command.

Remembering excludes in a file

If you have a set of files which you always want to exclude, you can create an file containing all exclusions and then use it via --exclude-from <file>:

# add two excludes to a plain text file named "excludes"
$ echo 404.php >> .rsyncignore
$ echo something-else.php >> .rsyncignore

# run rsync, using the excludes file
$ rsync -av --exclude-from .rsyncignore ./ {{app-name}}@deploy.{{region}}.frbit.com:~/

NOTE: The file name .rsyncignore is just an example name, use any name you want for your excludes.

There is still a lot more you can do with exclude and filtering. Not only is there --include, which allows to granulate previous --exclude patterns, but there is also --filter. Here is a very interesting blog post by Ira Cooke.

Dealing with obsolete files

rsync will work in a non-destructive way per default, like our overwrite but not delete strategy. So new files will be written and replaced, but old files will not get deleted. Sometimes that's not what you want. Sometimes you want an exact copy - a mirror.

So, how to remove obsolete files on the remote? The short answer is: add the option --delete to your command line and you are done. To give you and example, using the WordPress setup from before: say you deleted this pesky 404.php file locally. Now, if you run rsync without the --delete option (and no other added or modified files), rsync would tell that it will do nothing:

$ rsync -av ./ {{app-name}}@deploy.{{region}}.frbit.com:~/
sending incremental file list
wp-content/themes/your-theme/

sent 39,037 bytes  received 162 bytes  15,679.60 bytes/sec
total size ...

Although it marks the folder wp-content/themes/your-theme/, as there have been changes (the removal of 404.php), but no changes which rsync is gonna apply. Now, running with the --delete option, then the file will be removed from destination. This is a feature, not a bug, meaning: rsync won't let you down by deleting files without your say-so. Either way, the first delete run, as always, using the condensed form -n of the --dry-run option, will show you exactly what would be deleted:

$ rsync -avn --delete ./ {{app-name}}@deploy.{{region}}.frbit.com:~/
sending incremental file list
deleting wp-content/themes/your-theme/404.php

sent 39,062 bytes  received 231 bytes  15,717.20 bytes/sec
total size ...

After you confirm that rsync would only delete, what you want (otherwise: --exclude works also to exclude files which are not in your local file set but remote, and you don't want to remove them from remote), you can go ahead and remove the -n option and run again.

Now, rsync wouldn't be if it would give you not at least four different ways to handle deletes: Besides the --delete flag, there is also --delete-before, --delete-after, --delete-during and --delete-delay (and --delete-excluded, but that's another special case in it's own). Those four variants of --delete just let you control when files are remove. This is actually quite handy: When thinking larger amounts of changed files to a live website, you might want to use --delete-after instead of --delete-before, so that first all new files are in place, then obsolete files are removed, which makes it more likely that your website is not "interrupted", when handling a request during the synchronization, which relies on files which would be removed.. I think you get the gist.

Advanced topics

SSH edge-cases

Should you need to set specific SSH options, for example, if you need to provide a specific private key, then you can use the --rsh option, which stands for "remote shell" and can be shortened to -e. Here an example:

# use specific private key
$ rsync -av -e 'ssh -i /path/to/your/key' {{app-name}}@deploy.{{region}}.frbit.com:~/ ./

# enforce password authentication
$ rsync -av -e 'ssh -o PreferredAuthentications=password' {{app-name}}@deploy.{{region}}.frbit.com:~/ ./

How rsync transfers only changes

Say you have changed ten files in your local code set and want to deploy them now. rsync will first build a local set of files and directories and a remote set of files and directories. For each item in either set it will generate a check value. This check value, can be either the timestamp of the last change of a file, the size of a file, the current permissions or even a checksum (think MD5) of the file contents. Or any combination of those. Using the -a option rsync is gonna use timestamp + file size which is a good balance between performance and accuracy.

Further readings

This article is loosely based on our still popular blog post which is even more complete and includes more details. Also of interest:

Further readings

Craft CMS

Install guides

Develop & deploy

Teamwork

Platform

Stacks

Need individual help?
Learn about Company plans ›
Looking for an old article?
See the full list of articles ›
Found an error?
Contribute on GitHub ›