Skip to end of banner
Go to start of banner

Life Science and Bioinformatics

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Reference datasets

Pawsey hosts a number of life science reference datasets centrally to save users from repeatedly downloading the same common datasets. These are hosted on /scratch/references/ . Below is a list of datasets:


Database/OrganismAdditional Information
Alphafold


Arabidopsis thalian

TAIR10
BlastUpdated ~every monthly maintenance
DiamondA faster alternative to Blast
HumanIncludes:

Broad hg19 bundle
Broad hg38 bundle
GRCh38

Interproscan Version v5.56-89.0


Metagenome Atlas v2.9
Mouse

Includes:

Broad mm10 bundle

GRCm38

mm10

RNA_M25

Qiime

Sarek nf-core pipeline reference 

iGenome files for GATK GRCh38. By pointing to these references files, users can avoid slowdowns from Sarek downloading and cache-checking if it performs the download itself. Use the `--igenomes_base` flag to point to the local reference files. 
  • No labels