Gerstein lab research

During 2021 our research highlights included findings in genomic privacy, transcriptional regulation, disease genomics and wearable technology. The lab also wrote a comment on quantum computing for biological sciences. Other core publications focused on cancer genomics and a genome browsing tool. Our works were published in journals including Cell Systems, Nature Methods, and others.

Core Publications Highlights

Last year's most significant accomplishment related to preserving genomic privacy, against the reduction of anonymization that has come with next-generation sequencing (1 & 8, see below). We developed a method for genotype imputation with encrypted inputs and outputs, enabling it to confidentially occur in cloud spaces with suboptimal security (1).

Our work in transcriptional regulation also took an important step by developing a deep-learning method to discover and demarcate regulatory regions of the genome (i.e., enhancers) (2), improving the precision of demarcation from our past work, which applied supervised learning to this task. In addition, we developed a novel method to simulate a single cell assay for studying regulatory regions (scATAC-seq), an important step toward providing a gold standard baseline against which the efficacy of real-world scATAC-seq data analysis approaches can be compared (3). Our newest interest is biosensor signal analysis to improve the precision of phenotyping (4, 5 & 12). Our first contribution is Bayesian modeling of data from multiple sensors worn concurrently to assess changes in lifestyles and public policies (4). This interest immersed us in global concerns about wearable sensor quality assurance, data standardization, and privacy. We convened a panel of academic stakeholders to discuss these concerns (5). Proceedings supported the need for a networking hub connecting researchers and manufacturers to accomplish these goals. The new Sports Tech Research Network has taken our advice on board. Finally, we continued to publish many papers on cancer genomics (9, 10 & 11).

Reviews Highlights

We complemented our accomplishments in genomic privacy research with a perspective article about privacy problems specific to the field of functional genomics, highlighting privacy threats and mitigation techniques at the respective steps of the data generation, sharing, analysis, and summarization (6). A newer interest we have begun exploring is quantum computing for its potential applications to biology. Therefore, we published a comment overviewing quantum computing for biologists and suggesting areas to apply it, including sequence analysis, genetics, functional genomics, and neuroimaging phenotyping (7).

Core Publication List (includes full references of citations above)

1. G. Gürsoy, et al.. Privacy-preserving genotype imputation with fully homomorphic encryption. Cell Syst (2021) https:/doi.org/10.1016/j.cels.2021.10.003

2. Z. Chen, et al. DECODE: a Deep-learning framework for Condensing enhancers and refining boundaries with large-scale functional assays. Bioinformatics (2021) https:/doi.org/10.1093/bioinformatics/btab283

3. Z. Chen, et al. SCAN-ATAC-Sim: a scalable and efficient method for simulating single-cell ATAC-seq data from bulk-tissue experiments. Bioinformatics (2021) https:/doi.org/10.1093/bioinformatics/btaa1039

4. J. Liu, et al. Bayesian structural time series for biomedical sensor data: A flexible modeling framework for evaluating interventions. PLoS Comput Biol (2021) https:/doi.org/10.1371/journal.pcbi.1009303

5. G.I. Ash, et al. Establishing a Global Standard for Wearable Devices in Sport and Exercise Medicine: Perspectives from Academic and Industry Stakeholders. Sports Med (2021) https:/doi.org/10.1007/s40279-021-01543-5

6. G. Gürsoy G, et al. Functional genomics data: privacy risk assessment and technological mitigation. Nat Rev Genet (2021) https:/doi.org/10.1038/s41576-021-00428-7

7. P.S. Emani, et al. Quantum computing at the frontiers of biological sciences. Nat Methods (2021) https:/doi.org/10.1038/s41592-020-01004-3

8. G. Gürsoy, et al. Recovering genotypes and phenotypes using allele-specific genes. Genome Biol (2021) https:/doi.org/10.1186/s13059-021-02477-x

9. A.J. Armstrong, et al. Molecular medicine tumor board: whole-genome sequencing to inform on personalized medicine for a man with advanced prostate cancer. Prostate Cancer Prostatic Dis (2021) https:/doi.org/10.1038/s41391-021-00324-5

10. X. Li, et al. Whole-genome sequencing of phenotypically distinct inflammatory breast cancers reveals similar genomic alterations to non-inflammatory breast cancers. Genome Med (2021) https:/doi.org/10.1186/s13073-021-00879-x

11. H. Mohsen, et al. Network propagation-based prioritization of long tail genes in 17 cancer types. Genome Biol (2021) https:/doi.org/10.1186/s13059-021-02504-x

12. S. Lou, et al. Gene Tracer: A smart, interactive, voice-controlled Alexa skill for gene information retrieval and browsing, mutation annotation, and network visualization. Bioinformatics (2021) https:/doi.org/10.1093/bioinformatics/btab107


Functional genomics data: privacy risk assessment and technological mitigation.
G Gursoy, T Li, S Liu, E Ni, CM Brannon, MB Gerstein (2021). Nat Rev Genet 23: 245-258.

Privacy-preserving genotype imputation with fully homomorphic encryption.
G Gursoy, E Chielle, CM Brannon, M Maniatakos, M Gerstein (2021). Cell Syst 13: 173-182e3.

DECODE: a Deep-learning framework for Condensing enhancers and refining boundaries with large-scale functional assays.
Z Chen, J Zhang, J Liu, Y Dai, D Lee, MR Min, M Xu, M Gerstein (2021). Bioinformatics 37: i280-i288.

Recovering genotypes and phenotypes using allele-specific genes.
G Gursoy, N Lu, S Wagner, M Gerstein (2021). Genome Biol 22: 263.

Network propagation-based prioritization of long tail genes in 17 cancer types.
H Mohsen, V Gunasekharan, T Qing, M Seay, Y Surovtseva, S Negahban, Z Szallasi, L Pusztai, MB Gerstein (2021). Genome Biol 22: 287.

Establishing a Global Standard for Wearable Devices in Sport and Exercise Medicine: Perspectives from Academic and Industry Stakeholders.
GI Ash, M Stults-Kolehmainen, MA Busa, AE Gaffey, K Angeloudis, B Muniz-Pardos, R Gregory, RA Huggins, NS Redeker, SA Weinzimer, LA Grieco, K Lyden, E Megally, I Vogiatzis, L Scher, X Zhu, JS Baker, C Brandt, MS Businelle, LM Fucito, S Griggs, R Jarrin, BJ Mortazavi, T Prioleau, W Roberts, EK Spanakis, LM Nally, A Debruyne, N Bachl, F Pigozzi, F Halabchi, DA Ramagole, DC Janse van Rensburg, B Wolfarth, C Fossati, S Rozenstoka, K Tanisawa, M Borjesson, JA Casajus, A Gonzalez-Aguero, I Zelenkova, J Swart, G Gursoy, W Meyerson, J Liu, D Greenbaum, YP Pitsiladis, MB Gerstein (2021). Sports Med 51: 2237-2250.

Bayesian structural time series for biomedical sensor data: A flexible modeling framework for evaluating interventions.
J Liu, DJ Spakowicz, GI Ash, R Hoyd, R Ahluwalia, A Zhang, S Lou, D Lee, J Zhang, C Presley, A Greene, M Stults-Kolehmainen, LM Nally, JS Baker, LM Fucito, SA Weinzimer, AV Papachristos, M Gerstein (2021). PLoS Comput Biol 17: e1009303.

Whole-genome sequencing of phenotypically distinct inflammatory breast cancers reveals similar genomic alterations to non-inflammatory breast cancers.
X Li, S Kumar, A Harmanci, S Li, RR Kitchen, Y Zhang, VB Wali, SM Reddy, WA Woodward, JM Reuben, J Rozowsky, C Hatzis, NT Ueno, S Krishnamurthy, L Pusztai, M Gerstein (2021). Genome Med 13: 70.

Gene Tracer: a smart, interactive, voice-controlled Alexa skill For gene information retrieval and browsing, mutation annotation and network visualization.
S Lou, T Li, J Liu, M Gerstein (2021). Bioinformatics 37: 2998-3000.

SCAN-ATAC-Sim: a scalable and efficient method for simulating single-cell ATAC-seq data from bulk-tissue experiments.
Z Chen, J Zhang, J Liu, Z Zhang, J Zhu, D Lee, M Xu, M Gerstein (2021). Bioinformatics 37: 1756-1758.

Molecular medicine tumor board: whole-genome sequencing to inform on personalized medicine for a man with advanced prostate cancer.
AJ Armstrong, X Li, M Tucker, S Li, XJ Mu, KW Eng, A Sboner, M Rubin, M Gerstein (2021). Prostate Cancer Prostatic Dis 24: 786-793.

Quantum computing at the frontiers of biological sciences
PS Emani, J Warrell, A Anticevic, S Bekiranov, M Gandal, MJ McConnell, G Sapiro, A Aspuru-Guzik, JT Baker, M Bastiani, JD Murray, SN Sotiropoulos, J Taylor, G Senthil, T Lehner, MB Gerstein, AW Harrow (2021). Nat Methods 18: 701-709.


Return to front page