Reference datasets
Pawsey hosts a number of life science reference datasets centrally to save users from repeatedly downloading the same common datasets. These are hosted on /scratch/references/
. Below is a list of datasets:
Database/Organism | Additional Information |
---|---|
Alphafold | |
Arabidopsis thalian | TAIR10 |
Blast | Updated ~every monthly maintenance |
Diamond | A faster alternative to Blast |
Human | Includes: Broad hg19 bundle |
Interproscan Version v5.56-89.0 | |
Metagenome Atlas v2.9 | |
Mouse | Includes: Broad mm10 bundle GRCm38 mm10 RNA_M25 |
Qiime | |
Sarek nf-core pipeline reference | iGenome files for GATK GRCh38. By pointing to these references files, users can avoid slowdowns from Sarek downloading and cache-checking if it performs the download itself. Use the `--igenomes_base` flag to point to the local reference files. |