Excerpt |
---|
There are multiple filesystems mounted to each of Pawsey's supercomputers. Each of these filesystems are designed for particular use cases. This page provides a detailed description of these filesystems. |
...
Column | ||||||
---|---|---|---|---|---|---|
|
Overview
The following filesystems are available from one or more Pawsey supercomputing systems:
...
Apart from /home
, all are Lustre distributed filesystems. Lustre is an open-source, high performance parallel file system optimised for high throughput.
Column | |||||||
---|---|---|---|---|---|---|---|
|
...
Column | ||
---|---|---|
|
Home filesystem
The home filesystem should be used to store software configuration files. It is a Networked FileSystem (NFS). Each user has a default login directory in the /home filesystem with a quota of 1 GB and 10,000 individual files.
...
Due to its small quota limit and low performance, the /home
filesystem is not suitable for launching or storing production work. Files such as software installations and Slurm batch scripts should be stored on the /software
filesystem. Working data, such as job input and output, should use the /scratch
file system.
What to do if you exceeded your quota
First thing to do is to identify those directories that contain a large number of files or those files that are too large and are consuming your quota. Then delete them.
Identifying subdirectories with a large number of files
You can use the following command that finds the subdirectories recursively and list them in descending order of containing files. Execute this command from your $HOME
directory:
...
Then you can check the file $MYSCRATCH/homeSubdirectoriesRanked.out
and decide what subdirectories to remove. Note that the output is written in $MYSCRATCH
because you may have not enough quota to write in $HOME
.
Identifying large files
Column | |||||||||
---|---|---|---|---|---|---|---|---|---|
|
Then you can decide which files to remove. Note that you could have used the last filter (head -n 10
) also in the previous command to avoid a large output of lines, or you could have used here the same final filters as in the previous command in order to save output into a file for a later careful check.
Hidden files
Insert excerpt | ||||||
---|---|---|---|---|---|---|
|
Software filesystem
The /software
filesystem is a Lustre file system with much higher throughput than /home
. It is intended for software installations and Slurm batch script templates. Each project has an associated directory on the filesystem whose path is /software/projects/<project>
. Within a project directory, each project member has his or her own directory whose full path, /software/projects/<project>/<username>
, is contained in the MYSOFTWARE
environment variable.
...
Column | |||||||||
---|---|---|---|---|---|---|---|---|---|
| |||||||||
|
Scratch filesystem
The scratch filesystem should be used for working data, which is input and output files actively used by jobs queued or running on the supercomputer.
...
Anchor | ||||
---|---|---|---|---|
|
Reference datasets
Reference data sets are static data required by software for calibrations or testing or as widely used input data. Reference data sets that are used by several project groups will be provided on /scratch
by Pawsey to avoid multiple copies existing. These data sets will be contained in subdirectories of /scratch/references.
...
If you would like to request addition of a new reference dataset, please email the Pawsey Helpdesk help@pawsey.org.au
File permissions and quota
The effect of file permissions and ownership on storage quotas varies depending on which filesystem the data is located. The default behaviour can be summarised as such:
...
The extra tests for the find
commands in terminal 7 and terminal 8 speed up the process for many files, by only changing files and directories that need to be changed.
Astronomy filesystem
The Astonomy Filesystem /astro
is a Lustre filesystem provided for the scratch space needs of the MWA group who perform computations on the Garrawarla cluster.
...
The expandability of Lustre means that the filesystem can be expanded, without downtime, by adding more OSS's and OST Disk behind them in groups of 2 (for high availability).
Location
The Astronomy filesystem is mounted on all Garrawarla nodes and Setonix data mover nodes as /astro
. The top level directory has directories for all the areas that /astro
has:
...
The pawsey0001 directory is for Pawsey testing of the system and can be set up in different ways as needed. It will not often be used.
Quotas
At the time of writing MWA have requested that mwaeor, mwavcs, mwaops and mwasci are assigned 370 TB, 580 TB, 20 TB and 600 TB respectively.
...
$ lfs quota -g projectcode /astro
Usage
To check usage you can use the df
command to check the entire filesystem. This command gives a breakdown by OST and a summary at the bottom.
Column | |||||||||
---|---|---|---|---|---|---|---|---|---|
| |||||||||
|
Related pages
External links
- Lustre home page