Job Local Storage

Updated: May 19, 2023

Edit this Page via GitHub       Comment by Filing an Issue      Have Questions? Ask them here.

In the gizmo compute cluster you will be provisioned a directory on the allocated node’s local disk. This directory exists only for the duration of the job. Once the job exits this directory will be removed.

Please use this space instead of /tmp for local storage. The /tmp directory has less space available and (more importantly) is not automatically cleaned up after the job is finished

This local storage is on an attached SSD and has approximately 7TB available on the K class nodes. This storage is only available to processes running on the allocated node- it isn’t shared between jobs or between jobs running on multiple nodes (e.g. an MPI job)

Accessing Job Local Storage

In the gizmo compute cluster the environment variable $TMPDIR will be set to the path where this storage has been provisioned. Currently this variable is set using the job ID and uses the form /loc/scratch/<jobid>. This is a directory unique to the job and owned by you.

We recommend that you use the environment variable in your scripts rather than generating this path as the path to the job local storage may change. Using the environment variable will “future proof” your scripts.

Note for Java, GATK, and Picard Users

As Java (and by extension GATK and Picard) does not follow the TMPDIR convention we set the Java property java.io.tmpdir using the environment variable JAVA_TOOL_OPTIONS. This updates the default value- if you are using GATK or Picard specific options to set the temporary directory, that option value will be used.

Updated: May 19, 2023

Edit this Page via GitHub       Comment by Filing an Issue      Have Questions? Ask them here.