Summary of Gerstein Lab Research in 2018

During 2018 the Gerstein lab was involved in numerous research projects in biomedical data science, in particular involving human genomics, next-generation sequencing, genomic privacy, and text mining.

Core Publications – PsychENCODE

A large part of the lab’s research is focused on disease genomics and more specifically in differential expression and function of genes in brain diseases. The lab is a member of a large consortium (PsychENCODE) which resulted in leading a major analysis project. Our analysis uncovered genomic elements in the brain concerning psychiatric disorders using a deep-learning model (Wang et al., 2018). The analysis also developed lists of brain-specific enhancers, eQTLs, and regulatory networks.

Other Core Publications

Additionally, we developed PrivaSig, a tool that identifies information leakage from functional genomics signal profiles (Harmanci & Gerstein, 2018). Genomic privacy, leakage detection and anonymization of RNA-seq profiles are very important in the era of human genomics and next-generation sequencing. Finally, also in the field of human genomics, we have developed a catalog of predicted functional upstream open reading frames in humans (uORFs) (McGillivray et al., 2018). Latent uORFs in mRNA transcripts can modify the translation of coding sequences by altering ribosome activity. By building a simple Bayesian classifier using 89 attributes of uORFs we were able to extrapolate to a comprehensive catalog of likely functional uORFs.

Book reviews, opinions, and commentary

In 2018, the Gerstein lab participated in the scientific public discourse through book reviews, opinion articles, and commentaries. Published in Science, Dov Greenbaum and Mark Gerstein reviewed “21 Lessons for the 21st Century” by Yuval Noah Harari (Greenbaum & Gerstein, 2018). In Cell, Dov Greenbaum and Mark Gerstein reviewed “Who We Are and How We Got Here: Ancient DNA and the New Science of the Human Past” by David Reich (Greenbaum & Gerstein, 2018). We also published the relationship between text mining and systems biology. In one example, by examining the frequencies of terms in systems biology publications, we can analyze the trends in research focus (Kong & Gerstein, 2018). Finally, we did a data-science oriented newspaper Op-Ed.


Revealing the brain's molecular architecture.
PsychENCODE Consortium (2018). Science 362: 1262-1263.

Comprehensive functional genomic resource and integrative model for the human brain.
D Wang, S Liu, J Warrell, H Won, X Shi, FCP Navarro, D Clarke, M Gu, P Emani, YT Yang, M Xu, MJ Gandal, S Lou, J Zhang, JJ Park, C Yan, SK Rhie, K Manakongtreecheep, H Zhou, A Nathan, M Peters, E Mattei, D Fitzgerald, T Brunetti, J Moore, Y Jiang, K Girdhar, GE Hoffman, S Kalayci, ZH Gumus, GE Crawford, PsychENCODE Consortium, P Roussos, S Akbarian, AE Jaffe, KP White, Z Weng, N Sestan, DH Geschwind, JA Knowles, MB Gerstein (2018). Science 362.

Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder.
MJ Gandal, P Zhang, E Hadjimichael, RL Walker, C Chen, S Liu, H Won, H van Bakel, M Varghese, Y Wang, AW Shieh, J Haney, S Parhami, J Belmont, M Kim, P Moran Losada, Z Khan, J Mleczko, Y Xia, R Dai, D Wang, YT Yang, M Xu, K Fish, PR Hof, J Warrell, D Fitzgerald, K White, AE Jaffe, PsychENCODE Consortium, MA Peters, M Gerstein, C Liu, LM Iakoucheva, D Pinto, DH Geschwind (2018). Science 362.

Integrative functional genomic analysis of human brain development and neuropsychiatric risks.
M Li, G Santpere, Y Imamura Kawasawa, OV Evgrafov, FO Gulden, S Pochareddy, SM Sunkin, Z Li, Y Shin, Y Zhu, AMM Sousa, DM Werling, RR Kitchen, HJ Kang, M Pletikos, J Choi, S Muchnik, X Xu, D Wang, B Lorente-Galdos, S Liu, P Giusti-Rodriguez, H Won, CA de Leeuw, AF Pardinas, BrainSpan Consortium, PsychENCODE Consortium, PsychENCODE Developmental Subgroup, M Hu, F Jin, Y Li, MJ Owen, MC O'Donovan, JTR Walters, D Posthuma, MA Reimers, P Levitt, DR Weinberger, TM Hyde, JE Kleinman, DH Geschwind, MJ Hawrylycz, MW State, SJ Sanders, PF Sullivan, MB Gerstein, ES Lein, JA Knowles, N Sestan (2018). Science 362.

Transcriptome and epigenome landscape of human cortical development modeled in organoids.
A Amiri, G Coppola, S Scuderi, F Wu, T Roychowdhury, F Liu, S Pochareddy, Y Shin, A Safi, L Song, Y Zhu, AMM Sousa, PsychENCODE Consortium, M Gerstein, GE Crawford, N Sestan, A Abyzov, FM Vaccarino (2018). Science 362.

Text mining systems biology: Turning the microscope back on the observer
X Kong, M Gerstein (2018). Current Opinion in Systems Biology 11:117-122.

What’s next for humanity?
D Greenbaum, M Gerstein (2018). Science 362 (6415):648.

Human History, Human Genomes
D Greenbaum, M Gerstein (2018). Cell 174:1043-1044.

Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions.
A Harmanci, M Gerstein (2018). Nat Commun 9: 2453.

Network Analysis as a Grand Unifier in Biomedical Data Science
P McGillivray, D Clarke, W Meyerson, J Zhang, D Lee, M Gu, S Kumar, H Zhou, MB Gerstein (2018). Annual Review of Biomedical Data Science Vol. 1.

A comprehensive catalog of predicted functional upstream open reading frames in humans.
P McGillivray, R Ault, M Pawashe, R Kitchen, S Balasubramanian, M Gerstein (2018). Nucleic Acids Res 46: 3326-3338.

Gene names can confound most-searched listings
MB Gerstein, FCP Navarro (2018). Nature 553: 405.


Return to front page