How to Run the PredictProtein Machine Image on OpenNebula
From Rost Lab Open
Prerequisites
This procedure assumes that:
- you have an OpenNebula account, e.g. at https://www.cloud.mwn.de/
- you have at least 30 Gb temporary storage space to download the machine and data images
Preparing the PredictProtein Machine Image for First Use
- Download PredictProtein Machine Image to your local drive. You will need the Bio-Linux-based KVM QEMU (Generic Raw Format) format version.
- Choose the Debian-based version in case you do not need a graphical user interface. This image is only half the size of the Bio-Linux-based image.
- Download a database image to your local drive.
- The 'raw' format and the tarball are almost the same. You can mount the raw format and unpack it to a formatted drive or you can upload the tarball and then unpack it from there.
- Login into your OpenNebula management (e.g. at LRZ)
- Upload main PP image to OpenNebula cloud
- go to Virtual Resources -> Images
- use (+) to add an image from uploaded data (Type OS)
- Prepare for PP data: use either of these methods:
- Upload PP raw data image to OpenNebula (test) cloud
- go to Virtual Resources -> Images
- use (+) to add an image from uploaded data (Type DATABLOCK)
- Have a larger storage disk in the template you make next and copy the tar ball onto that disk.
- Upload PP raw data image to OpenNebula (test) cloud
- Create a template for the virtual machine
- go to Virtual Resources -> Templates
- Create a new template (give it a name) with these properties:
- on General: Don't use more virtual CPUs than real ones
- on Storage: Choose the main PP image created/uploaded above, specify 'vda' as a 'target'
- on Storage: add a second disk, choose volatile and give it 100 Gb of space -> you will unpack the PP data to that disk
- on Storage: add a third disk, choose the empty data image you created above for storing results
- if you use the raw data image: on Storage: add another disk and choose the PP data image created/uploaded above, no need to specify a target
- on Input/Output: specify keymap 'us'
- Instantiate the virtual machine
- go to Virtual Resources -> Virtual Machines
- Hit (+) to create a new virtual machine
- Choose your template from the list and hit "create"
- Alternatively you can directly instantiate from Virtual Resources -> Templates -> Instantiate
- Configure in the machine that is running
- note the IP of your instance
- log into your instance (in case of biolinux you first have to go in via VNC and set a password for ppuser)
- to check what has been loaded, you can do '
cat /proc/partitions
' and e.g. 'sudo fdisk -l /dev/vdc
' - run the following commands:
sudo mkdir /mnt/resultStore
sudo mount /dev/vdc /mnt/resultStore
sudo mount /dev/vdb /mnt/local-storage/
sudo mkdir /mnt/local-storage/rostlab-data
sudo chown -R ppuser.ppuser /mnt/local-storage/rostlab-data/
cd /mnt/local-storage/rostlab-data
- tarball version:
scp user@host:/path/rostlab-data.txz .
tar -xvJf rostlab-data.txz
- raw image version:
tar -xvJf /dev/vdd
(assuming that the raw data is on /dev/vdd)
- tarball version:
sudo mount --bind /mnt/local-storage/rostlab-data /usr/share/rostlab-data
Using the PredictProtein Machine Image
- Open a terminal and run
man ppmi
to get started. You will find usage examples in the manual pages referenced there, e.g.:man predictprotein
man snapfun