Skip to main content
An official website of the United States government

Training Guide Library

In a hurry and need quick instructions on the cancer data science lifecycle stages? Browse this list of guides and resources.

Data Generation and Collection

Identify and gather the data you need to address a problem.

  • Beginner
  • Advanced
    • [Training Video Recording] WebMeV | Discover how to use this intuitive, web-based, bioinformatics analysis toolkit designed for non-bioinformaticians. This walkthrough includes steps on how to upload data files, run a single-cell analysis (using the tools available within the toolkit), and how to navigate/create public data sets available within WebMeV.

Data Cleaning

Fix discrepancies and handle missing values in your data.

  • Beginner

Data Exploration and Analysis

Study your data, then form a hypothesis.

Predictive Modeling

Use computational tools like machine learning models to make predictions with your data.

Data Visualization

Communicate your data findings using interactive images, plots, and charts.

  • Beginner
  • Advanced
    • [Training Video Recording] Data Visualization with R | Learn how to use the ggplot2 package in the programming language R to graph plots that can form the basis of analysis. Note: This video is one of six videos that make up a course series exclusive to NCI staff and provided by the Bioinformatics Training and Education Program. In this recording, R Studio is accessed via DNAnexus.
    • [Training Video Recording] DNASTAR Lasergene Software | Learn about this software and its applications in molecular biology, including topics such as enzyme labels, primer design, cloning processes, construct analysis, and clone verification using Sanger sequencing.
    • [Training Video Recording] Next-Generation Clustered Heat Maps (NG-CHMs) | Learn how NG-CHMs can help you navigate large omic databases, zoom in on patterns, access external metadata resources, produce high-resolution graphics, and save metadata for later use. NG-CHMs play a valuable role in NIH projects, encompassing phenotypic and genotypic data at DNA, RNA, protein, and metabolite levels in bulk and single-cell studies.

Data Sharing

Accelerate discovery by making your data available to others.

Share Ideas

Help us help you! If you believe content is missing or needs modifying, email NCI CBIIT.

  • Updated:
Email