Gerstein Lab Highlights in 2020
The Gerstein lab research
During 2020 the Gerstein lab was involved in numerous research projects in the area of human genomics, cancer genomics, cancer evolution, genomic privacy, commentary and book review. These projects resulted in various publications and commentaries in journals like Nature, Science Cell, Nature Communications, Genome Biology and others.
Core Publications (for full citations see publication list)
A large part of the lab’s research is focused on cancer, human genomics and genomic privacy with regards to precision medicine. As part of The Pan-Cancer Analysis of Whole Genomes (PCAWG) consortium our lab led and participated in projects such the analysis of more than 2500 cancer whole genomes estimating the burden of non-coding somatic drivers in these genomes, while also developing a framework that detects the presence and impact of cancer drivers from single individual samples using variant allele frequency. Similarly, as part of the ENCODE consortium, our lab led and participated in a successful series of publications on expanding the encyclopedias of DNA elements for genome interpretation in human and mouse genomes. Moreover, with regards to cancer genomics, we developed networks of both transcription factors and RNA-binding proteins that aim to interpret non-coding mutations and gene regulation. Finally, in relation to genomic privacy and personalized medicine, our lab quantified private information leakage from functional genomics while developing the necessary tools to allow raw functional genomic reads to be shared more safely (data sanitization).
Software tools
In Gerstein lab, we build tool for the analysis of next generation sequencing data. Our newly developed software expands from cancer genomics to the analysis of structural variance, functional regulation and privacy. More' specifically, in 2020 our lab 6 new tools for the analysis of next generation sequences. More specifically, SVFX is a machine learning application that aims to quantify the pathogenicity of structural variants (SVs) in somatic and germline SVs with focus on cancer-related pathways. RADAR is a variant scoring framework that uses conservation, RNA structure, network centrality and motifs to provide an overall impact score for noncoding variants associated with RNA-binding proteins (RPBs) functional dysregulation. FANCY, is an application for fast estimation of privacy risk in functional genomics data. SigLASSO is a software tool that decomposes the counts of cancer mutations into a linear combination of known cancer signatures. DiNeR is a graphical model that highlights key disease regulators by quantifying chances in co-regulatory networks and transcription factor co-binding patterns. Finally, TopicNet is a method that applies latent Dirichlet allocation to pinpoint particular transcription factors (TFs) that change greatly between cellular states.
Book reviews, opinions and commentary
In 2020, we participated in the scientific public discourse through reviews, opinion articles and commentaries. We published an opinion article in Hartford Courant highlighting the dangers of information leakage, during while for covid-19 during the pandemic entitled “As coronavirus testing expands, new personal privacy issues arise”. Moreover, we published a book review on “How innovation works” by M.Ridley and “The innovation delusion” by L.Vinsel and A. Russell. Our commentary discusses the different pathways suggested by the two books while innovation has been reduced to a buzzword.
Return to front page