Reference datasets
Pawsey hosts a number of life science reference datasets centrally to save users from repeatedly downloading the same common datasets. These are hosted on /scratch/references/
. Additional references can be added if there is sufficient user interest. If there is something you would like to have added, please drop us a line at help@pawsey.org.au. Below is the current list of datasets:
...
10x_singlecell_gene_expression
...
refdata-gex-GRCh38-2020-A
refdata-gex-mm10-2020-A
refdata-gex-GRCh38-and-mm10-2020-A
...
10x_spatial_gene_expression
...
refdata-gex-GRCh38-2020-A
refdata-gex-mm10-2020-A
...
Arabidopsis thalian
...
Broad hg19 bundle
Broad hg38 bundle
GRCh38
...
Interproscan Version v5.56-89.0
...
Includes:
Broad mm10 bundle
GRCm38
mm10
RNA_M25
...
Sarek nf-core pipeline reference
...
Useful Documentation
Applying for compute with the ABLeS Scheme:
https://www.biocommons.org.au/ables
Using the dedicated Workflow Nodes:
How to Run Workflows on the Workflow Nodes
Interactive computing:
How to Run JupyterLab via Conda
How to Run JupyterLab via Container
Conda:
Conda and Reproducible Installations
How to Configure Conda to Avoid Quota Issues
Installing your own modules:
SHPC (Singularity Registry HPC)