Skip to end of banner
Go to start of banner

Nimbus for Bioinformatics

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

This page:

Summary


This page covers information on how to use the new 'Pawsey Bio - Ubuntu 20.04 - 2021-11' image for Nimbus. Instructions on how to choose this image when creating your instance can be found here. This Bio-image is created to cater to bioinformatics users who prefer to have their instances pre-installed with software commonly used in the bioinformatics domain. Some of the software are part of Pawsey's ongoing effort to improve the experience of bioinformatics users at Pawsey.


Pre-installed software


The list of pre-installed software is as follows:

  • Ansible - an automation platform that Pawsey uses to automate a number of software deployment 
  • CernVM-FS - a read-only file system for accessing reference datasets
  • Docker - a popular container engine
  • Jupyter Notebook (container)
  • Lmod - a modules environment that we use at Pawsey for loading sotware
  • Nextflow - a popular workflow manager
  • Pip - a Python package installer 
  • Python3
  • RStudio (contrainer)
  • Singularity - a popular container engine that can be used on HPC
  • Singularity-HPC - a container modules installer
  • Spack - a package management tool

Instructions


We will cover instructions for how to use CernVM-FS, Jupyter Notebook, RStudio, Singularity-HPC on this page. For instructions for other software listed above, please see the software's original documentation page.

CernVM-FS

CernVM-FS is a file system that was developed by another supercomputing centre (Cern). It allows repositories to store files such as reference datasets that are commonly used by many researchers. At Pawsey, we currently mirror the datasets that are on Galaxy Project's repository. Please note that the datasets may not be comprehensive, and this service is not meant to replace your current methods for accessing public datasets.

On Nimbus instances created with the bio image, you can simple refer to and use the path to the datasets as follows:

ls /cvmfs/data.galaxyproject.org

Note: It may take a minute or two to load the folders. Once you have done it once, it will not take as long to show again.

Jupyter Notebook 


RStudio


Singularity-HPC


Unable to render {children}. Page not found: Nimbus Documentation Landing Page.

  • No labels