Getting Started with Singularity and Singularity Hub
So you want to put your scientific analysis in a nice package and run it on a cluster? You’ve probably heard of the container technology called Docker?
…too bad you can’t use it on your research cluster, because it introduces huge security issues. You’ve probably also heard of virtual machines, but most clusters won’t let you run those either. What options does this leave us? Oh wait, duh, you are reading this website. You already know the answer to this question.
Getting Started
Install Singularity
The easiest thing to do is to install Singularity on your local workstation:
sudo apt-get update
sudo apt-get -y install build-essential curl git sudo man vim autoconf libtool
git clone https://github.com/singularityware/singularity.git
cd singularity
./autogen.sh
./configure --prefix=/usr/local
make
sudo make install
If you are using a Mac, or just need a virtual machine, then you will want to follow the instructions here. Basically, you need to install vagrant, virtual box, and then do this:
vagrant init ubuntu/trusty64
vagrant up
vagrant ssh -c /bin/sh <<EOF
sudo apt-get update
sudo apt-get -y install build-essential curl git sudo man vim autoconf libtool
git clone https://github.com/singularityware/singularity.git
cd singularity
./autogen.sh
./configure --prefix=/usr/local
make
sudo make install
EOF
vagrant ssh
Once you are in your Virtual Machine, or have Singularity up and running? Well, it’s time to go NUTS of course!
A little about Singularity Hub
Singularity Hub is an online registry for images. This means that you can connect a GitHub repo containing a build specification file to this website, and the image is going to build for you automatically, and be available programmatically! We can talk more about how that happens later. If you want some quick details, you should check out the Usage Docs on Singularity Hub.
Make and run containers
Run an image
For this little preview, we are going to be first running an image, directly from Singularity Hub. This image is called vsoch/singularity-images and it’s associated with the equivalent GitHub repository.
singularity run shub://vsoch/singularity-images
In the above, we use the Singularity Hub “unique resource identifier,” or uri
, shub://
which tells the software to run an image from Singularity Hub.
Create an image
Running is great, but what if we want to mess around on the command line, using an image we’ve created ourselves? We can do that by creating an image:
sudo singularity create analysis.img
sudo singularity import analysis.img docker://ubuntu:latest
singularity shell analysis.img
In the above, we use the docker “unique resource identifier,” or uri
, docker://
which tells the software to import a docker image.
If we wanted to shell into the image and make it writable, meaning that we can write files and save changes, we would do this:
sudo singularity shell --writable analysis.img
Note that we need sudo, and also note that you wouldn’t be able to do this on a research cluster, because you don’t have sudo.
Create a reproducible image
The problem with create an image, and then maybe writing stuff to it with --writable
is that your work isn’t properly saved anywhere. You COULD ship and share the entire image, but that still doesn’t help to say what was done to it, and this is problematic. To help with this, we encourage you to create a build specification file, a file called Singularity
. There are a few important sections you should know about. First, let’s look at a very simple file:
Bootstrap: docker
From: ubuntu:latest
%runscript
echo "I can put here whatever I want to happen when the user runs my container!"
exec echo "Hello Monsoir Meatball" "$@"
%post
echo "Here we are installing software and other dependencies for the container!"
apt-get update
apt-get install -y git
The important things to note. The header section says that we want to Bootstrap
a docker image, specifically From
ubuntu:latest. No, you don’t actually need Docker installed to run this, because the layers are pulled from their API endpoint.
Once you have your bootstrap file, and you know how to use GitHub, you are really good to go. You can add the file to repository, connect it to Singularity Hub, and it will build automatically and be available via the shub://
endpoint. That’s it!