Skip to main content

Computational Genomics Research

The Genomic Data Commons (GDC) was launched at the University of Chicago on June 6, 2016. GDC is a unified data system that promotes sharing of genomic and clinical data between researchers.

Credit: Univ. of Chicago

Computational genomics applies algorithms and statistical models to big datasets. CCG generates large genomic and clinical datasets through the Genome Characterization Pipeline, shares data through the Genomic Data Commons (GDC), and makes data accessible on commercial clouds by partnering with the NCI Cloud Resources. Members of CCG’s Genomics Data Analysis Network and external researchers from around the world develop and apply a range of computational techniques to analyze data in the GDC.

Key Questions

  • Can analyzing cancer genomic datasets compiled from an exceedingly large number of patients increase our power to discover new cancer driver mutations?
  • What are the best ways to display cancer genomic data such that cancer researchers can explore and visualize large, complex datasets?
  • How can investigators effectively integrate data from multiple modes of genomic analysis into a unified view of oncogenic pathways?
  • What new technologies provide fresh views of cancer mechanisms, such as single cell DNA and RNA sequencing?

Tools and Methods

Computational genomics applies algorithms and statistical models to big datasets. CCG generates large genomic and clinical datasets through the Genome Characterization Pipeline, shares data through the Genomic Data Commons (GDC), and makes data accessible on commercial clouds by partnering with the NCI Cloud Resources. Members of CCG’s Genomics Data Analysis Network and external researchers from around the world develop and apply a range of computational techniques to analyze data in the GDC.

Programs and Collaborations

Genomic Data Commons

CCG established the NCI Genomic Data Commons (GDC) as a data sharing and analysis platform with the overarching to goal to make genomic and clinical data truly accessible to the cancer research community. In addition to serving as one of the most comprehensive genomic data repositories available, the GDC identifies and implements best-in-class bioinformatic pipelines, producing a standardized collection of data that may be cross-analyzed and compared. The GDC also develops web-based, interactive and clinically relevant tools and offers multiple options for accessing data, including cloud-based, high performances routes via NCI’s Cloud Resources.

  • Posted:

If you would like to reproduce some or all of this content, see Reuse of NCI Information for guidance about copyright and permissions. In the case of permitted digital reproduction, please credit the National Cancer Institute as the source and link to the original NCI product using the original product's title; e.g., “Computational Genomics Research was originally published by the National Cancer Institute.”