Coretools

Tools listed below are actively maintained by the lab

LESSeq: Local event-based analysis of alternative splicing using RNA-Seq data

J Leng, CJF Cameron, S Oh, E Khurana, JP Noonan, MB Gerstein (2019). bioRxiv.

Network propagation-based prioritization of long tail genes in 17 cancer types.

H Mohsen, V Gunasekharan, T Qing, M Seay, Y Surovtseva, S Negahban, Z Szallasi, L Pusztai, MB Gerstein (2021). Genome Biol 22: 287.

website

preprint

medline

STARRPeaker: uniform processing and accurate identification of STARR-seq active regions.

D Lee, M Shi, J Moran, M Wall, J Zhang, J Liu, D Fitzgerald, Y Kyono, L Ma, KP White, M Gerstein (2020). Genome Biol 21: 298.

website

preprint

medline

SVFX: a machine learning framework to quantify the pathogenicity of structural variants.

S Kumar, A Harmanci, J Vytheeswaran, MB Gerstein (2020). Genome Biol 21: 274.

website

preprint

medline

RADAR: annotation and prioritization of variants in the post-transcriptional regulome of RNA-binding proteins.

J Zhang, J Liu, D Lee, JJ Feng, L Lochovsky, S Lou, M Rutenberg-Schoenberg, M Gerstein (2020). Genome Biol 21: 151.

website

preprint

medline

Supervised enhancer prediction with epigenetic pattern recognition and targeted validation.

A Sethi, M Gu, E Gumusgoz, L Chan, KK Yan, J Rozowsky, I Barozzi, V Afzal, JA Akiyama, I Plajzer-Frick, C Yan, CS Novak, M Kato, TH Garvin, Q Pham, A Harrington, BJ Mannion, EA Lee, Y Fukuda-Yuzawa, A Visel, DE Dickel, KY Yip, R Sutton, LA Pennacchio, M Gerstein (2020). Nat Methods 17: 807-814.

website

preprint

medline

Using sigLASSO to optimize cancer mutation signatures jointly with sampling likelihood.

S Li, FW Crawford, MB Gerstein (2020). Nat Commun 11: 3575.

website

preprint

medline

TopicNet: a framework for measuring transcriptional regulatory network change.

S Lou, T Li, X Kong, J Zhang, J Liu, D Lee, M Gerstein (2020). Bioinformatics 36: i474-i481.

website

preprint

medline

Epigenome-based splicing prediction using a recurrent neural network.

D Lee, J Zhang, J Liu, M Gerstein (2020). PLoS Comput Biol 16: e1008006.

website

preprint

medline

GRAM: A GeneRAlized Model to predict the molecular effect of a non-coding variant in a cell-type specific manner.

S Lou, KA Cotter, T Li, J Liang, H Mohsen, J Liu, J Zhang, S Cohen, J Xu, H Yu, MA Rubin, M Gerstein (2019). PLoS Genet 15: e1007860.

website

medline

Building a Hybrid Physical-Statistical Classifier for Predicting the Effect of Variants Related to Protein-Drug Interactions.

B Wang, C Yan, S Lou, P Emani, B Li, M Xu, X Kong, W Meyerson, YT Yang, D Lee, M Gerstein (2019). Structure 27: 1469-1481e3.

website

preprint

medline

exceRpt: A Comprehensive Analytic Platform for Extracellular RNA Profiling.

J Rozowsky, RR Kitchen, JJ Park, TR Galeev, J Diao, J Warrell, W Thistlethwaite, SL Subramanian, A Milosavljevic, M Gerstein (2019). Cell Syst 8: 352-357e3.

website

medline

Analysis of sensitive information leakage in functional genomics signal profiles through genomic deletions.

A Harmanci, M Gerstein (2018). Nat Commun 9: 2453.

website

medline

A comprehensive catalog of predicted functional upstream open reading frames in humans.

P McGillivray, R Ault, M Pawashe, R Kitchen, S Balasubramanian, M Gerstein (2018). Nucleic Acids Res 46: 3326-3338.

website

medline

MOAT: efficient detection of highly mutated regions with the Mutations Overburdening Annotations Tool.

L Lochovsky, J Zhang, M Gerstein (2017). Bioinformatics 34: 1031-1033.

website

preprint

medline

Using ALoFT to determine the impact of putative loss-of-function variants in protein-coding genes.

S Balasubramanian, Y Fu, M Pawashe, P McGillivray, M Jin, J Liu, KJ Karczewski, DG MacArthur, M Gerstein (2017). Nat Commun 8: 382.

website

medline

MrTADFinder: A network modularity based approach to identify topologically associating domains in multiple resolutions.

KK Yan, S Lou, M Gerstein (2017). PLoS Comput Biol 13: e1005647.

website

medline

Landscape and variation of novel retroduplications in 26 human populations.

Y Zhang, S Li, A Abyzov, MB Gerstein (2017). PLoS Comput Biol 13: e1005567.

website

medline

HiC-spector: a matrix library for spectral and reproducibility analysis of Hi-C contact maps.

KK Yan, GG Yardimci, C Yan, WS Noble, M Gerstein (2017). Bioinformatics 33: 2199-2201.

website

preprint

medline

Intensification: A Resource for Amplifying Population-Genetic Signals with Protein Repeats.

J Chen, B Wang, L Regan, M Gerstein (2016). J Mol Biol 429: 435-445.

website

medline

DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks.

D Wang, F He, S Maslov, M Gerstein (2016). PLoS Comput Biol 12: e1005146.

website

medline

A uniform survey of allele-specific binding and expression over 1000-Genomes-Project individuals.

J Chen, J Rozowsky, TR Galeev, A Harmanci, R Kitchen, J Bedford, A Abyzov, Y Kong, L Regan, M Gerstein (2016). Nat Commun 7: 11101.

website

preprint

medline

Identifying Allosteric Hotspots with Dynamics: Application to Inter- and Intra-species Conservation.

D Clarke, A Sethi, S Li, S Kumar, RWF Chang, J Chen, M Gerstein (2016). Structure 24: 826-837.

website

preprint

medline

Quantification of private information leakage from phenotype-genotype data: linking attacks

A Harmanci, M Gerstein (2016). Nat Methods 13: 251-6.

website

preprint

medline

LARVA: an integrative framework for large-scale analysis of recurrent variants in noncoding annotations.

L Lochovsky, J Zhang, Y Fu, E Khurana, M Gerstein (2015). Nucleic Acids Res 43: 8123-34.

website

medline

Loregic: a method to characterize the cooperative logic of regulatory factors.

D Wang, KK Yan, C Sisu, C Cheng, J Rozowsky, W Meyerson, MB Gerstein (2015). PLoS Comput Biol 11: e1004132.

website

medline

MUSIC: identification of enriched regions in ChIP-Seq experiments using a mappability-corrected multiscale signal processing framework.

A Harmanci, J Rozowsky, M Gerstein (2014). Genome Biol 15: 474.

website

medline

FunSeq2: a framework for prioritizing noncoding regulatory variants in cancer.

Y Fu, Z Liu, S Lou, J Bedford, XJ Mu, KY Yip, E Khurana, M Gerstein (2014). Genome Biol 15: 480.

website

medline

Integrative annotation of variants from 1092 humans: application to cancer genomics.

E Khurana, Y Fu, V Colonna, XJ Mu, HM Kang, T Lappalainen, A Sboner, L Lochovsky, J Chen, A Harmanci, J Das, A Abyzov, S Balasubramanian, K Beal, D Chakravarty, D Challis, Y Chen, D Clarke, L Clarke, F Cunningham, US Evani, P Flicek, R Fragoza, E Garrison, R Gibbs, ZH Gumus, J Herrero, N Kitabayashi, Y Kong, K Lage, V Liluashvili, SM Lipkin, DG MacArthur, G Marth, D Muzny, TH Pers, GRS Ritchie, JA Rosenfeld, C Sisu, X Wei, M Wilson, Y Xue, F Yu, 1000 Genomes Project Consortium, ET Dermitzakis, H Yu, MA Rubin, C Tyler-Smith, M Gerstein (2013). Science 342: 1235587.

website

preprint

medline

VAT: a computational framework to functionally annotate variants in personal genomes within a cloud-computing environment.

L Habegger, S Balasubramanian, DZ Chen, E Khurana, A Sboner, A Harmanci, J Rozowsky, D Clarke, M Snyder, M Gerstein (2012). Bioinformatics 28: 2267-9.

website

medline

IQSeq: integrated isoform quantification analysis based on next-generation sequencing.

J Du, J Leng, L Habegger, A Sboner, D McDermott, M Gerstein (2012). PLoS One 7: e29175.

website

medline

Integration of protein motions with molecular networks reveals different mechanisms for permanent and transient interactions.

N Bhardwaj, A Abyzov, D Clarke, C Shou, MB Gerstein (2011). Protein Sci 20: 1745-54.

website

medline

AlleleSeq: analysis of allele-specific expression and binding in a network framework.

J Rozowsky, A Abyzov, J Wang, P Alves, D Raha, A Harmanci, J Leng, R Bjornson, Y Kong, N Kitabayashi, N Bhardwaj, M Rubin, M Snyder, M Gerstein (2011). Mol Syst Biol 7: 522.

website

medline

ACT: aggregation and correlation toolbox for analyses of genome tracks.

J Jee, J Rozowsky, KY Yip, L Lochovsky, R Bjornson, G Zhong, Z Zhang, Y Fu, J Wang, Z Weng, M Gerstein (2011). Bioinformatics 27: 1152-4.

website

medline

CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.

A Abyzov, AE Urban, M Snyder, M Gerstein (2011). Genome Res 21: 974-84.

website

medline

AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision.

A Abyzov, M Gerstein (2011). Bioinformatics 27: 595-603.

website

medline

RSEQtools: a modular framework to analyze RNA-Seq data using compact, anonymized data summaries.

L Habegger, A Sboner, TA Gianoulis, J Rozowsky, A Agarwal, M Snyder, M Gerstein (2011). Bioinformatics 27: 281-3.

website

medline

FusionSeq: a modular framework for finding gene fusions by analyzing paired-end RNA-sequencing data.

A Sboner, L Habegger, D Pflueger, S Terry, DZ Chen, JS Rozowsky, AK Tewari, N Kitabayashi, BJ Moss, MS Chee, F Demichelis, MA Rubin, MB Gerstein (2010). Genome Biol 11: R104.

website

medline

3V: cavity, channel and cleft volume calculator and extractor.

NR Voss, M Gerstein (2010). Nucleic Acids Res 38: W555-62.

website

preprint

medline

MOTIPS: automated motif analysis for predicting targets of modular protein domains.

HY Lam, PM Kim, J Mok, R Tonikian, SS Sidhu, BE Turk, M Snyder, MB Gerstein (2010). BMC Bioinformatics 11: 243.

website

preprint

medline

Relating protein conformational changes to packing efficiency and disorder.

N Bhardwaj, M Gerstein (2009). Protein Sci 18: 1230-40.

website

preprint

medline

PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data.

JO Korbel, A Abyzov, XJ Mu, N Carriero, P Cayting, Z Zhang, M Snyder, MB Gerstein (2009). Genome Biol 10: R23.

website

preprint

medline

PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

J Rozowsky, G Euskirchen, RK Auerbach, ZD Zhang, T Gibson, R Bjornson, N Carriero, M Snyder, MB Gerstein (2009). Nat Biotechnol 27: 66-75.

website

preprint

medline

An integrated system for studying residue coevolution in proteins.

KY Yip, P Patel, PM Kim, DM Engelman, D McDermott, M Gerstein (2008). Bioinformatics 24: 290-2.

website

preprint

medline

PARE: a tool for comparing protein abundance and mRNA expression data.

EZ Yu, AE Burba, M Gerstein (2007). BMC Bioinformatics 8: 309.

website

preprint

medline

Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation.

JE Karro, Y Yan, D Zheng, Z Zhang, N Carriero, P Cayting, P Harrrison, M Gerstein (2007). Nucleic Acids Res 35: D55-60.

website

preprint

medline

Helix Interaction Tool (HIT): a web-based tool for analysis of helix-helix interactions in proteins.

AE Burba, U Lehnert, EZ Yu, M Gerstein (2006). Bioinformatics 22: 2735-8.

website

preprint

medline

The tYNA platform for comparative interactomics: a web tool for managing, comparing and mining multiple networks.

KY Yip, H Yu, PM Kim, M Schultz, M Gerstein (2006). Bioinformatics 22: 2968-70.

website

preprint

medline

The Database of Macromolecular Motions: new features added at the decade mark.

S Flores, N Echols, D Milburn, B Hespenheide, K Keating, J Lu, S Wells, EZ Yu, M Thorpe, M Gerstein (2006). Nucleic Acids Res 34: D296-301.

website

preprint

medline

PubNet: a flexible system for visualizing literature derived networks.

SM Douglas, GT Montelione, M Gerstein (2005). Genome Biol 6: R80.

website

preprint

medline

Calculation of standard atomic volumes for RNA and comparison with proteins: RNA is packed more tightly.

NR Voss, M Gerstein (2005). J Mol Biol 346: 477-92.

website

preprint

medline

TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statistics.

H Yu, X Zhu, D Greenbaum, J Karro, M Gerstein (2004). Nucleic Acids Res 32: 328-37.

website

preprint

medline

MolMovDB: analysis and visualization of conformational change and structural flexibility.

N Echols, D Milburn, M Gerstein (2003). Nucleic Acids Res 31: 478-82.

website

preprint

medline

Calculations of protein volumes: sensitivity analysis and parameter database.

J Tsai, M Gerstein (2002). Bioinformatics 18: 985-95.

website

preprint

medline

Determining the minimum number of types necessary to represent the sizes of protein atoms.

J Tsai, N Voss, M Gerstein (2001). Bioinformatics 17: 949-56.

website

preprint

medline

Protein Geometry: Distances, Areas, and Volumes

M Gerstein, F M Richards (2001). International Tables for Crystallography (Volume F, Chapter 22.1.1, pages 531-539; M Rossmann & E Arnold, editors; Dordrecht: Kluwer)

website

preprint

link

The morph server: a standardized system for analyzing and visualizing macromolecular motions in a database framework.

WG Krebs, M Gerstein (2000). Nucleic Acids Res 28: 1665-75.

website

preprint

medline

A database of macromolecular motions.

M Gerstein, W Krebs (1998). Nucleic Acids Res 26: 4280-90.

website

preprint