Using Motuz to Move Large Data
Updated: October 6, 2023
Edit this Page via GitHub Comment by Filing an Issue Have Questions? Ask them here.This demo provides specific examples of how to use the new tool, Motuz, to transfer data between Fred Hutch data storage locations.
Accessing Motuz
Motuz is available at https://motuz.fredhutch.org. This link is only accessible from the campus network. If you are off campus you will need to use VPN.
Log in with your HutchNet ID and password.
At this time, Motuz allows copying of files/objects between the following file storage systems (with more to come):
- Shared file systems on campus (your home directory,
/fh/fast
,/fh/scratch
) - Amazon S3
- Azure Blob Storage
- Google Cloud Bucket
- SFTP
- WebDAV
- Dropbox (Beta)
- Microsoft OneDrive (Beta)
This demo will focus on moving files between shared file systems on campus and Amazon S3.
Add a new Cloud Connection to Motuz
Amazon S3 Bucket Connection
Before we can use Motuz to move data, we need to add a Cloud Connection which will tell Motuz about our S3 account and bucket.
Every researcher at Fred Hutch has access to
an S3 bucket belonging to their lab. If your PI’s name is Jane Doe, your lab’s bucket would be called fh-pi-doe-j-eco
.
Each researcher also has credentials to access this bucket. See the documentation for how to obtain these credentials.
In Motuz, click the Cloud Connections
link at the top of the browser window. Then click the New Connections
button.
There are two kinds of credentials - Single Sign-On (SSO) and IAM Access Key Pairs. By default (since 2024), all users can use SSO credentials, which last for 12 hours. IAM Access Key Pairs do not expire. If you need longer-lasting credentials, such as for a long-running transfer, please contact the Cloud Team by emailing helpdesk
.
To obtain AWS SSO credentials, navigate to the AWS SSO Portal (documented here). Choose your account and click the triangle to its left:
Click Access Keys
. Scroll to the bottom, where you will see Option 3
:
Go to Motuz Cloud Connections page and click New Connection
(or click an existing connection to update its credentials). Be sure to change S3 Connection Type
in the dropdown list to Temporary Security Credentials (STS)
:
Copy and paste the respective values (access key ID, secret key, and session token) from the AWS SSO page into the appropriate fields in the Motuz Cloud Connection page:
In the Basic
section, set Type
to Amazon Simple Storage Service (s3)
. Give the connection a name such as lab s3 bucket
. Optionally enter the bucket name (for example, fh-pi-doe-j-eco
).
Click on Verify Connection
to make sure the connection works. Then click Create Cloud Connection
, or (if you are modifying an existing connection) Update Connection
.
Copying Files to Cloud Locations
You can copy a single file or the contents of a directory.
The Motuz UI consists of two panes. You can actually copy in either direction, but in this case we will copy from left to right, and therefore use the button that points to the right.
First, select the files you want to copy. In this case I want to copy a directory. I navigate from my home directory by clicking the Path widget in the right-hand pane and navigating to the top level directory (/
):
From there I can navigate to the location
of the files I want to copy, under /fh/fast
.
Note that each user has different permissions and you may not be able to access the files that I can access. Try to copy files in your lab’s subdirectory under /fh/fast
.
Then, in the right-hand pane, choose the cloud connection that you set up earlier:
If you want to copy to a sub-folder (prefix) of the S3 bucket, navigate there (this example uses a bucket you may not have access to; you will be seeing different files/folders in your PI’s bucket).
In the left-hand pane, single-click the folder you want to copy to S3:
Then click the blue arrow that points to the right:
You will see a confirmation screen like this that allows you to enter an optional description of the copy job:
Click the Submit Copy Job
button.
You’ll see the copy job’s progress shown in the bottom pane:
If all goes well, the copy will complete with the status SUCCESS
.
At any point during or after the copy, you can click this line in the status bar to see more information about the job status:
If you change your mind, you can cancel the copy job by clicking the Stop Job
button.
Copying from S3 to a shared directory on campus
This is similar to the previous steps; just find the directory you want to copy in the right hand pane, navigate to the place you want to copy to in the left hand pane, and click the blue arrow that points to the left:
Advanced Usage
Motuz also has an API; consult its documentation. Note that the API endpoint to use with Fred Hutch’s instance of Motuz is https://motuz.fredhutch.org/api/
.
Problems?
Email scicomp
or file an issue in the Motuz repository.
Updated: October 6, 2023
Edit this Page via GitHub Comment by Filing an Issue Have Questions? Ask them here.