Training, Community and Finding Help
Updated: June 2, 2020
Edit this Page via GitHub Comment by Filing an Issue Have Questions? Ask them here.There are a variety of resources for training on various aspects of bioinformatics, analytics and data skills available at the Fred Hutch, in the Seattle area and on the web.
Resources At Fred Hutch
In Person Courses
fredhutch.io offers frequent, on campus courses on a topics such as R, Python and GitHub; course descriptions can be found here. Fred Hutch and SCCA employees can register for these courses by going to MyHutch and clicking on Hutch Learning. Search for “fredhutch.io” to view and register for all currently available courses. fredhutch.io also helps coordinate additional events and specialized classes in collaboration with The Coop (see below).
Community Groups
The Coop is the Fred Hutch Bioinformatics and Data Science Cooperative, and works to share information and resources about computational work across the Hutch. The Coop maintains a listserv, calendars of data science events, and The Coop Communities Slack, supported by FHBig. FHBig is the Fred Hutch Bioinformatics Interest Group, a community-based group that hosts a blog and facilitates information sharing among the bioinformatics research community at the Hutch. Hutch employees can learn more through the links above or by emailing coophelp
with questions or to be added to our email/newsletter list.
The Coop and FHBig also support community groups that meet regularly to discuss topics ranging from basic data literacy skill building to reproducible computational methods. To learn more about what to expect from these meetings, please visit our Community Groups GitHub repository. Current meeting schedules and locations are available on the Google calendar, and please contact the Coop (coophelp at fredhutch.org) or the relevant channel in The Coop Communities Slack for updates or to ask questions.
-
Coop Meetings: The Coop periodically hosts meetings that rotate among a few different formats, which in the past have included panel discussions, seminars, and community reports on topics related to data-intensive research. See The Coop Communities Slack or The Coop in MS Teams (requires HutchNetID) for more information.
-
Data Visualization Group: Develops visualizations from data released by TidyTuesday and other sources. All programming languages (and even non-coders!) are welcome. Notes about this group can be found here. See the #data-viz channel on The Coop Communities Slack for more information.
-
Nextflow User Group: Discusses issues related to the use of Nextflow (and other workflow managers like Cromwell) at Fred Hutch. The group welcomes Nextflow users of all levels of expertise as well as those just learning about the tool. See the #nextflow channel on The Coop Communities Slack for more information.
-
Python User Group: Discusses programming, data analysis, and troubleshooting in Python and related topics in general software development. Python coders of all levels of expertise are welcome. Notes about this group are recorded here, and check out the #python-user-comm channel on the Coop Communities Slack for more information.
-
R User Group: Discusses programming, data analysis, and troubleshooting in R. R coders of all levels of expertise are welcome. Notes about this group are located here, and check out the #r-user-comm channel on the Coop Communities Slack for more information.
-
Technology Exchange: Informal learning forum for employees in technology-focused jobs to share technical skills. This group is most appropriate for individuals who want to discuss the technical details of tools commonly used in positions focused on administration and infrastructure. However, anyone is welcome to attend this forum, and many topics will be relevant to researchers in positions that require technical skills. Subscribe to the TechExchange mailing list or visit Tech Exchange in MS Teams (requires HutchNetID) for more information.
-
Retired Community Groups: All of our groups were created because of the needs voiced by our community. Sometimes those needs change, and we’re happy to respond by refocusing our efforts. The following groups have been retired, but are noted here for reference purposes:
Software design group:Discusses issues related to software engineering and interface design and invites speakers on specific topics and participants to share their coding projects. This group is not focused on a specific programming language, but rather on issues common across software projects. Notes from this group can be found here. Discussions about this topic are now included periodically in the Python User GroupShiny User Group:The Shiny User Group meets once monthly to discuss this method of using R to create interactive web applications for visualizing data analysis. Notes about this group are located here. Discussions about this topic are now included periodically in the R User Group
Office Hours
Several groups on campus host weekly or monthly office hours to provide assistance on data related tasks. Please visit Centernet or the Google calendar for current scheduling, locations, and contact information.
-
Data Ethics/Compliance/Security: Staff from the Center Information Technology (CIT) Compliance Office with expertise in information security are available to answer questions related to secure data management and resources at the Hutch for security compliance.
-
The Coop/fredhutch.io: Staff are available to assist researchers in getting started with coding and orienting staff to resources for improving their coding, including troubleshooting application of code to research questions.
-
REDcap: REDCap is a secure web application for building and managing online surveys and databases and is managed by Collaborative Data Services (CDS) at Fred Hutch. Staff are available to answer questions related to REDCap functionality and troubleshooting. You can also discuss REDCap issues with staff and other researchers on the #redcap-user-comm channel on The Coop Communities Slack.
-
SciComp General Consulting: Scientific Computing (SciComp) is the group in Hutch Data Core (HDC) that manages basic account access for research computing resources, including data storage and shared computational cluster use. Staff are available to answer questions related to access and usage of Hutch resources for computational research.
-
SciComp Next Gen/HPC Office Hours: In addition to general consulting (see above), Scientific Computing (SciComp) supports researchers interested in emerging technologies like next generation sequencing and cloud computing. Staff are available to answer questions about getting started building analytical pipelines in the cloud, and generally making computation more scalable and reproducible.
Resources In Seattle
- UW Biostatistics Summer Institutes offer yearly intensive courses over the summers on a wide variety of topics.
- Meetup hosts various coding groups that meet regularly to share skills and provide networking opportunities. RLadies Seattle and Seattle UseR both include leaders from Fred Hutch.
Resources On the Web
Classroom-Style Courses
These resources are organized in a lecture type format as slides, screencasts, and video. Most are work-at-your-own-pace, but some may be linked to a course calander.
- MCB517A: Tools for Computational Biology: A graduate-level course taught for UW by Fred Hutch CompBio faculty. This links to a GitHub repository that includes all lectures and homework.
- How to install necessary software for this course
- Ask questions about this course
- Course materials all available for free
- edX: Offers a collection of courses for Data Analysis and Statistics and Bioinformatics
- Rafael Irizarry of Dana Farber has online programs available through edX:
- Generally speaking, edX courses are all free to audit for a limited period of time. Unlimited access and the ability to earn a course Certificate will require payment
- Coursera: Offers a collection of courses for Data Science and Bioinformatics
- R Programming: A beginner-level program has five mini-courses. It takes about 4 months to complete.
- Statistics with R: A beginner-level program with five mini-courses. It takes about 7 months to complete.
- Genomic Data Science: An intermediate-level program for those who are already aquainted with R. It has eight mini-courses. It takes about 6 months to complete.
- Python Programming: An intermediate-level program that takes about 4 months to complete.
- Coursera offers a 7-day free trial, and is a paid subscription service after
- Udacity: Offers a collection of courses for Data Science
- Udacity is a paid subscription service
- Currently offering one month free for their Nanodegree programs.
- Udemy: Offers a collection of courses for Data Science
- Udemy offers courses at various price points.
- Keep an eye out for sales which happen regularly and can drastically reduce the cost.
- CognitiveClass.ai Offers a collection of courses for data science, AI, and cloud computing.
- All courses are free
- The Open Source Data Science Masters: An open-source curriculum for learning data science. This is a mixed media course made up of videos, books, and slides.
- Some content is free, some is paid
- CalTech Learning from Data
- A free YouTube series
Interactive Coding Platforms
These resources offer classes that are work-at-your-own-pace with a major focus on hands-on problem-sets and projects.
- DataQuest: A subscription service that offers programs and courses focused on data anlysis and engineering in Python and R.
- Tiered payment system with basic and premium plans
- CodeAcademy: A subscription service that offers coding programs and courses in many different languages.
- Tiered payment system with limited content available for free
Free ebooks
-
O’Reilly books available through Seattle Public Library
Misc
-
The Carpentries, with lessons from Data Carpentry and Software Carpentry
Updated: June 2, 2020
Edit this Page via GitHub Comment by Filing an Issue Have Questions? Ask them here.