A new single-cell sequencing to validate new cancer cell types

Significance 

Cancer is a disease that stems from the disruption of cellular state. Through genetic perturbations, tumor cells attain cellular states that give them proliferative advantage over the surrounding normal tissue. The inherent variability of this process has hampered efforts to find highly effective common therapies, thereby ushering the need for precision medicine. The scale of single-cell experiments is poised to revolutionize personalized medicine by effective characterization of the complete heterogeneity within a tumor for each individual patient.

Single cell mRNA analysis of cancer cells is one of the leading edge techniques being used to better understand cancer biology. The data generated can be used to try to disrupt cancers with drugs or work out how cancers arise in the first place. Studying the cancer transcriptomes depends on accurate identification of cancer cells. Therefore, the foundational step of tumour single cell analyses is separating cancer from non-cancer cells. The simplest approach to identifying cancer cells is to use expression of cancer specific marker genes. However, such genes do not always exist and are generally insufficiently precise, especially without corroborating readouts such as cellular morphology. Another approach is to infer the presence of tumour-defining somatic copy-number changes from shifts in average expression. The idea here is that gains or losses of genomic regions will generally increase or decrease the expression level of genes in these regions respectively. Challenges with this approach include smoothing and denoising expression changes, establishing a baseline against which to measure shifts in expression, segmenting the genome, and identifying changes in expression not due to copy-number changes. Despite these challenges, both marker genes and shifts in average expression may accurately identify cancer cells in certain circumstances. However, if there is any novelty or ambiguity in the identity of cancer cells, then these two approaches are inherently fallible as they are both based on expression and not direct evidence that a cell is cancerous, that it carries the somatic cancer genome.

A fundamental step in this process is separating cancer and non-cancer cells, but this isn’t always an easy task. As well as the many types of cancer, there will also be molecular differences between cancer cells of the same type within a single tumor. Currently, the best method of doing this is to measure the average gene expression of cells in the sample, with higher or lower expression used to distinguish cancer cells from healthy cells but this method can lead to false conclusions.

A new method of separating cancer cells from non-cancer cells has been developed by researchers at the Wellcome Sanger Institute, in a boost for those working to better understand cancer biology using single-cell mRNA sequencing. The study, published today in Communications Biology, improves on existing methods to identify which cells in a sample are cancerous and provides crucial data on the microenvironment of tumors. The software is now openly available for researchers around the world to apply to their own data, advancing the effectiveness of single-cell sequencing to understand cancer.

The research team performed whole genome sequencing and single-cell mRNA sequencing on clinical samples. By locating imbalances of alleles in these data, which indicate copy number changes in the genome, the team was able to identify cancer cells more reliably than with previous methods. This approach will primarily be useful for validating new cancer cell types and better understanding the microenvironment of tumor tissue. It is also hopeful the new technique will serve in the near future for the discovery of novel prognostic signatures and of therapeutic strategies that may enhance the implementation of precision medicine and will improve the clinical management of various cancers.

A new single-cell sequencing to validate new cancer cell types - Medicine Innovates

Image Credit: Communications Biology (2022). DOI: 10.1038/s42003-022-03808-9

About the author

Dr Matthew D Young

Principal Staff Scientist
Wellcome Trust Sanger Institute

With a background in physics, bioinformatics, and transcriptomics, my research attempts to better understand the transcriptional properties of normal human development and how these processes are reflected in disease.

Recent technological advances have made it possible to measure the patterns of gene expression (the transcriptome) of thousands of individual cells. Using these single cell transcriptomic technologies, we have been able to create high quality reference maps of human tissues in development and maturity, as part of the Human and Developmental Cell Atlas projects. We have then compared these reference maps with tumour cells of childhood cancers, yielding insights into these tumours normal cell correlates, transcriptional transformation, and developmental origins (e.g. Young et al., 2018). Our ongoing efforts in this area continue to improve our understanding of a broad range of childhood cancers.

These efforts have provided convincing evidence of the developmental origins of childhood tumours, and provide the basis for further efforts to understand the relationship between development and disease. In particular, phylogenetic techniques can demonstrate tumour’s cell of origin, and by precisely defining the differences between normality and disease we can understand the biologically relevant commonalities and differences within and between tumour types.

My current research focuses on integrating measurements of genomic transformations (such as copy number changes) with single cell transcrptomic methods to understand the precise transcriptomic consequences of these changes. I study what effect these changes have in both normal tissues and in disease.

I also lead the development of computational tools for the analysis of single cell transcrptomic data and its integration with other data types. This includes the development of SoupX, a tool to remove ambient RNA contamination from droplet based single cell data.

Reference

Mi K Trinh, Clarissa N Pacyna, Gerda Kildisiute, Christine Thevanesan, Alice Piapi, Kirsty Ambridge, Nathaniel D Anderson, Eleonora Khabirova, Elena Prigmore, Karin Straathof , Sam Behjati, Matthew D Young. Precise identification of cancer cells from allelic imbalances in single cell transcriptomes. Communications Biology (2022). DOI: 10.1038/s42003-022-03808-9.

Go To Communications Biology