| Tool | Description | Category | Experience Level | Project |
|---|---|---|---|---|
| ATOM Modeling PipeLine (AMPL) | AMPL is an open-source pipeline for building machine learning models on molecular data for in silico drug discovery. You can use this tool to:
Code Repository: Available | Modeling | Advanced | NCI-DOE |
| FORGEdb | FORGEdb is an online platform with API designed to help researchers interpret genetic variants associated with disease at the tissue and cell type level. You can use this tool to:
Code Repository: Available | Exploration/Analysis | Low/No-code | |
| Multi-omics Pathway Workflow (MOPAW) | MOPAW is a fully automated, multi-omics pathway for exploring specific genes and pathways associated with different types of cancer. The tool is currently available through the Cancer Genomics Cloud (CGC). You can use the tool to: run multi-omics analysis with your own or publicly available data sets on the CGC (e.g., The Cancer Genome Atlas and Clinical Proteomic Tumor Analysis Consortium). identify unique pathways for sample groups using multivariate single sample gene set analysis (MOGSA). Code Repository: Not available | Exploration/Analysis | Low/No-code | |
| Multiscale Machine-Learned Modeling Infrastructure (MuMMI) | MuMMI is a machine learning–driven molecular dynamics simulation framework for studying protein–membrane interactions across scales. You can use this tool to:
Code Repository: Available | Modeling | Advanced | NCI-DOE |
| OmicCircos | OmicCircos is an R based application that helps you manage, analyze, and visualize omics data with high-quality circular plots. You can use the tool to:
Code Repository: Available | Visualization | Intermediate | |
| Rapid 3D Visualization of SNP Mutations (3DVizSNP) | 3DVizSNP is an online platform that visualizes protein structure three-dimensionally from a single variant calling file (VCF). You can use the tool to:
Code Repository: Available | Visualization | Low/No-code | |
| RNA-Seq Latent Featurizer Using Center Loss Cost Function (CLRNA) | CLRNA is a python package providing a deep learning algorithm researchers can use to learn about generalized gene expressions for drug response. You can use this tool to:
Code Repository: Available | Exploration/Analysis | Advanced | NCI-DOE |
| scCorr | scCorr is an R package to improve gene to gene correlation estimates in single-cell RNA sequencing data by reducing the impact of zero values that can bias your analyses. You can use the tool to:
Code Repository: Available | Exploration/Analysis | Advanced | |
| Synthetic Data Generator (SYNDATA) | SYNDATA is a suite of machine learning tools for creating realistic synthetic clinical text data. You can use this tool to:
Code Repository: Available | Modeling | Advanced | NCI-DOE |
Analysis Tools
Working across NCI, our team is developing new approaches and technologies for using data to predict, diagnose, and treat cancer. We’ve supported the following informatics and data science tools available for all researchers.