Skip to main content
An official website of the United States government
Email

Machine Learning Identifies Clonal Hematopoiesis Driver Mutations

NCI-supported researchers leveraged machine learning (ML) models to understand the specific mutations that initiate Clonal Hematopoiesis (CH)—a condition where genetically mutated hematopoietic stem cells multiply faster than usual, potentially leading to blood cancers and other health issues. By following their approach, you can use similar ML techniques and software to conduct your own analyses.

To replicate or extend their findings, you’ll need a large set of high-quality blood somatic mutations to train these ML models. Researchers achieved this by using samples from over 33,000 patients with cancer from three cancer genomics cohorts. Building on methods previously applied to identify cancer driver mutations, they trained their models on features that distinguish the blood somatic mutations observed in CH genes across donors from those expected to arise under neutral mutagenesis. With these tools and data sets, you can gain deeper insights into CH and its implications.

Read the full report, “Identification of Clonal Hematopoiesis Driver Mutations Through In Silico Saturation Mutagenesis,” in Cancer Discovery. You can also access links to data and software used in the study.

As researcher, Dr. Abel Gonzalez-Perez shared, “These models outperform expert-curated rules based on prior knowledge of the function of these genes. This suggests machine learning provides a more systematic and unbiased method for identifying CH driver mutations.”

He added, “We propose that these models support the accurate identification of CH across healthy individuals and imply the potential for early CH detection and intervention in seemingly healthy individuals.”

< Older Post

New Insight on Cancer Development and Treatment Resistance from HTAN Publications

Newer Post >

NCI-Funded Tool Helps to Advance Neoantigen Therapies

If you would like to reproduce some or all of this content, see Reuse of NCI Information for guidance about copyright and permissions. In the case of permitted digital reproduction, please credit the National Cancer Institute as the source and link to the original NCI product using the original product's title; e.g., “Machine Learning Identifies Clonal Hematopoiesis Driver Mutations was originally published by the National Cancer Institute.”

Archive

Email