Data Science Resource Library

Note: If you would like to request content for this section or are interested in writing content, please email data at fredhutch.org, file an issue in our GitHub repo here, or see our contribution instructions here.

BioRender

BioRender is a web application that creates scientific figures, illustrations, and diagrams. An institutional license for BioRender Premium is available for...

Building Software Containers

This guide outlines the steps needed to create a software container which is automatically built and hosted publicly, which is convenient for use with workfl...

DMPtool.org

DMPtool.org is a free, open-source, online tool that supports researchers in creating, sharing and maintaining their data sharing plans related to many fundi...

Guidance for Making “Tidy” Data

Learning how to organize your data will help reduce errors, make analysis easier, and improve collaboration (even when your collaborator is your future self)...

How you can use cBioPortal

cBioPortal offers countless ways to explore and interpret cancer genomics data—without needing to code. Below, we walk through a series of research questions...

Nextflow Workflow Background

A workflow in Nextflow is (at the minimum) a text file written in a particular format containing all of the details of the analysis that are generally true ...

Nextflow Workflow Run Script

Each workflow which you run with Nextflow should ideally be run from its own directory. The reason we recommend this is that Nextflow uses a small database i...

PROOF Regulated

Analysis that involves data that is subject to an NIH Data Use Certification (DUC) often includes the creation of data and other derivative files that are al...

PROOF Release Notes

PROOF is a collection of projects designed to help researchers run WDL workflows on the Fred Hutch cluster. On this page we document release notes for updat...

PROOF Workbench Troubleshooting

PROOF Workbench aims to make it easier for researchers to get computing and analysis work done on the Fred Hutch cluster, and the PROOF system itself is als...

Running Nextflow Workflows on AWS

In order to execute a workflow on AWS, follow these instructions in order to create a Nextflow config file. We recommend creating this file in a location tha...

Running Nextflow Workflows on Gizmo

In order to execute a workflow using the gizmo SLURM cluster, follow these instructions in order to create a Nextflow config file. We recommend creating this...

Running Your First Nextflow Workflow

If you are working to run a workflow for the first time, you will need to: Decide where you want the workflow to run Set up your Nextflow configuration ...

Specifying Nextflow Workflow Resources

One of the really appealing things about a workflow manager is that you have the ability to specify how much CPU and RAM is used for each individual step in ...

Starting out on the Cluster

Starting out with working on the FH cluster can be a daunting process. In this article, we will point you to the right information you need to get started.