Computational Genomics Research
Computational genomics applies algorithms and statistical models to big datasets. CCG generates large genomic and clinical datasets through the Genome Characterization Pipeline, shares data through the Genomic Data Commons (GDC), and makes data accessible on commercial clouds by partnering with the NCI Cloud Resources. Members of CCG’s Genomics Data Analysis Network and external researchers from around the world develop and apply a range of computational techniques to analyze data in the GDC.
Key Questions
- Can analyzing cancer genomic datasets compiled from an exceedingly large number of patients increase our power to discover new cancer driver mutations?
- What are the best ways to display cancer genomic data such that cancer researchers can explore and visualize large, complex datasets?
- How can investigators effectively integrate data from multiple modes of genomic analysis into a unified view of oncogenic pathways?
- What new technologies provide fresh views of cancer mechanisms, such as single cell DNA and RNA sequencing?
Tools and Methods
Computational genomics applies algorithms and statistical models to big datasets. CCG generates large genomic and clinical datasets through the Genome Characterization Pipeline, shares data through the Genomic Data Commons (GDC), and makes data accessible on commercial clouds by partnering with the NCI Cloud Resources. Members of CCG’s Genomics Data Analysis Network and external researchers from around the world develop and apply a range of computational techniques to analyze data in the GDC.
Programs and Collaborations
Genomic Data Commons
CCG established the NCI Genomic Data Commons (GDC) as a data sharing and analysis platform with the overarching to goal to make genomic and clinical data truly accessible to the cancer research community. In addition to serving as one of the most comprehensive genomic data repositories available, the GDC identifies and implements best-in-class bioinformatic pipelines, producing a standardized collection of data that may be cross-analyzed and compared. The GDC also develops web-based, interactive and clinically relevant tools and offers multiple options for accessing data, including cloud-based, high performances routes via NCI’s Cloud Resources.