Data management on the disk array¶
Partitions, quotas, tips and commands to manage your job data on Myria
Introduction¶
The data of the Myria computer is stored on a NMS disk array accessible on the whole cluster in GPFS. The performance measured during delivery is 28 GB/s for the /dlocal
and /home
partitions. These partitions are optimized for reading/writing large files (16 MB block size).
If your work involves a multitude of small files, contact support to set up access to a partition more suited to this type of processing (see the rest of the documentation for explanations).
Some useful commands¶
How many files do I have in /dlocal
?
read the gpfs1 dlocal
line, column "files
How much disk space do I consume?
By adding --block-size auto
at the end of the command, you will get a display in "human" format (with the unit of value most suitable for a human)
For a human format display of quotas on home
and dlocal
spaces only:
How to find out the list of temporary job folders of the user name_login?
How can I quickly find out how many files my job has left in /dlocal/run/<jobid>
?
Slurm's prolog counts the number of remaining files in /dlocal/run/<jobid>
and writes it to your .o
file associated with the job:
To search for this line only:
How to know the list of jobs submitted in the partition 2tcourt between 03/01/2019 and 03/15/2019 to do housework?
You can add the -l
option to display more information.
How to know the number of files in folder folder_path
?
I need to reduce my number of files, but I can't delete anything. How can I do it?
Archive some trees with the tar
command: one archive = 1 file
Some tips¶
In the submission scripts...
The retrieval of data is done with an mv
command. Do not replace it with a cp
command, which duplicates the data and can take a long time to execute. The mv
command is immediate between /dlocal
and /home
.
If you develop...
Choose large files with HDF5 type formats rather than a multitude of small files. You will gain in performance on computing clusters with large block sizes.
If you generate a lot of files...
Keep an eye on your quota. Display it automatically at login (with appropriate addition to your ~/.bash_profile
).
Some additional information¶
Partitions and their use
The disk array is separated into 2 parts (file systems) each containing subparts (filesets).
The first part is the largest and most powerful: it hosts /home
and /dlocal
folders.
The second part is smaller and therefore less powerful: it hosts /soft
and /save
folders.
/home
contains the home folders for users./dlocal
contains the temporary job folders (/dlocal/run
) and some permanent job folders (/dlocal/home
) when the need is qualified./soft
contains software made available by CRIANN/save
contains mid-term storage for regional users:- no user or project space is created there by default, it is necessary to make the request by arguing the need: millions of small files on medium term, ...
Warning : no backup is made on user data. Remember to repatriate your codes and data to your laboratories.
We strongly encourage you to use versioning tools, such as GIT, at your institutions. Ask your IT department for more information.
The git
client is installed on the frontend nodes, without module loading.
Disk quotas
In order to guarantee good performance, it is necessary to maintain a reasonable rate of filling the bay (volume and number of files). For this, CRIANN has set two types of quotas:
/home
: default quota of 50 GB / user/dlocal
: 10 million files quota
In both cases, the limits correspond to "soft" values that can be exceeded temporarily (7 days). After this period, the usage must go back down below the soft limit, otherwise, no file creation is possible. A "hard" limit of 10 GB over soft quota is also set: it cannot be exceeded under any circumstances.
If you feel that these limits are too restrictive for you, do not hesitate to open a ticket with support. These limits can be increased upon justified request.
The command mmlsquota
explained at the top of this page allows you to display the quotas and the grace period between the "soft" and "hard" quotas. Once the 7 day time limit is exceeded, any request for additional volume allocation is refused. Only commands allowing to come back under the "soft" quota are allowed (rm
or mv
commands for example).
The problem of the number of files
CRIANN has chosen to keep the temporary files of the jobs (/dlocal/run/<jobid>
) beyond the life of the jobs. This folder can be used as a working folder for the next job.
These folders are automatically deleted by CRIANN, 40 days after the end of the corresponding job. This has the advantage of being able to link several jobs and also to recover data that would not have been recovered at the end of the job.
For most users, in 40 days, this corresponds to a few thousand files. For some users of software such as OpenFoam, this can represent several tens of millions of files. The quota is there to avoid a drift, but the submission of new jobs becomes impossible if the quota is exceeded: it is thus necessary to do some cleaning in addition to the automatic cleaning...
Si vous avez des questions, merci de contacter le support : support@criann.fr