Structural variants (SV), usually defined as variation from kilobase to megabase including insertions, duplications, and inversions, presumably contribute to more base-pair differences between individuals than SNPs and have considerable effects on human phenotypic variation such as disease. Copy number variation (CNV), for example, is a type of structural variation exhibiting difference in copy number between two or more individuals. When fixed in population, it becomes what is known as Segmental Duplication (SD). Due to the biological significance and importance of SV implicated in recent findings, it has become one of the focuses in genetics lately. To this end, we have developed different experimental and computational approaches to detect and analyze SVs, particularly in the human genome.

Related Material:

2004-early 2009 Report [ html ]
Mid 2009 Report [ html ] [ doc ]


Complex genetic variation in nearly complete human genomes
GA Logsdon, P Ebert, PA Audano, M Loftus, D Porubsky, J Ebler, F Yilmaz, P Hallast, T Prodanov, D Yoo, CA Paisie, WT Harvey, X Zhao, GV Martino, M Henglin, KM Munson, K Rabbani, CS Chin, B Gu, H Ashraf, O Austine-Orimoloye, P Balachandran, MJ Bonder, H Cheng, Z Chong, J Crabtree, M Gerstein, LA Guethlein, P Hasenfeld, G Hickey, K Hoekzema, SE Hunt, M Jensen, Y Jiang, S Koren, Y Kwon, C Li, H Li, J Li, PJ Norman, KK Oshima, B Paten, AM Phillippy, NR Pollock, T Rausch, M Rautiainen, S Scholz, Y Song, A Soylev, A Sulovari, L Surapaneni, V Tsapalou, W Zhou, Y Zhou, Q Zhu, MC Zody, RE Mills, SE Devine, X Shi, ME Talkowski, MJP Chaisson, AT Dilthey, MK Konkel, JO Korbel, C Lee, CR Beck, EE Eichler, T Marschall (Preprint). bioRxiv.

Recurrent repeat expansions in human cancer genomes
GS Erwin, G Gursoy, R Al-Abri, A Suriyaprakash, E Dolzhenko, K Zhu, CR Hoerner, SM White, L Ramirez, A Vadlakonda, A Vadlakonda, K von Kraut, J Park, CM Brannon, DA Sumano, RA Kirtikar, AA Erwin, TJ Metzner, RKC Yuen, AC Fan, JT Leppert, MA Eberle, M Gerstein, MP Snyder (2023). Nature 613: 96-102.

Landscape and variation of novel retroduplications in 26 human populations.
Y Zhang, S Li, A Abyzov, MB Gerstein (2017). PLoS Comput Biol 13: e1005567.

An integrated map of structural variation in 2,504 human genomes.
PH Sudmant, T Rausch, EJ Gardner, RE Handsaker, A Abyzov, J Huddleston, Y Zhang, K Ye, G Jun, MH Fritz, MK Konkel, A Malhotra, AM Stutz, X Shi, FP Casale, J Chen, F Hormozdiari, G Dayama, K Chen, M Malig, MJP Chaisson, K Walter, S Meiers, S Kashin, E Garrison, A Auton, HYK Lam, XJ Mu, C Alkan, D Antaki, T Bae, E Cerveira, P Chines, Z Chong, L Clarke, E Dal, L Ding, S Emery, X Fan, M Gujral, F Kahveci, JM Kidd, Y Kong, EW Lameijer, S McCarthy, P Flicek, RA Gibbs, G Marth, CE Mason, A Menelaou, DM Muzny, BJ Nelson, A Noor, NF Parrish, M Pendleton, A Quitadamo, B Raeder, EE Schadt, M Romanovitch, A Schlattl, R Sebra, AA Shabalin, A Untergasser, JA Walker, M Wang, F Yu, C Zhang, J Zhang, X Zheng-Bradley, W Zhou, T Zichner, J Sebat, MA Batzer, SA McCarroll, 1000 Genomes Project Consortium, RE Mills, MB Gerstein, A Bashir, O Stegle, SE Devine, C Lee, EE Eichler, JO Korbel (2015). Nature 526: 75-81.

Analysis of deletion breakpoints from 1,092 humans reveals details of mutation mechanisms.
A Abyzov, S Li, DR Kim, M Mohiyuddin, AM Stutz, NF Parrish, XJ Mu, W Clark, K Chen, M Hurles, JO Korbel, HY Lam, C Lee, MB Gerstein (2015). Nat Commun 6: 7256.

MetaSV: an accurate and integrative structural-variant caller for next generation sequencing.
M Mohiyuddin, JC Mu, J Li, N Bani Asadi, MB Gerstein, A Abyzov, WH Wong, HY Lam (2015). Bioinformatics 31: 2741-4.

Analysis of variable retroduplications in human populations suggests coupling of retrotransposition to cell division.
A Abyzov, R Iskow, O Gokcumen, DW Radke, S Balasubramanian, B Pei, L Habegger, 1000 Genomes Project Consortium, C Lee, M Gerstein (2013). Genome Res 23: 2042-52.

Child development and structural variation in the human genome.
Y Zhang, R Haraksingh, F Grubert, A Abyzov, M Gerstein, S Weissman, AE Urban (2013). Child Dev 84: 34-48.

Somatic copy number mosaicism in human skin revealed by induced pluripotent stem cells.
A Abyzov, J Mariani, D Palejev, Y Zhang, MS Haney, L Tomasini, AF Ferrandino, LA Rosenberg Belmaker, A Szekely, M Wilson, A Kocabas, NE Calixto, EL Grigorenko, A Huttner, K Chawarska, S Weissman, AE Urban, M Gerstein, FM Vaccarino (2012). Nature 492: 438-42.

Genome-wide mapping of copy number variation in humans: comparative analysis of high resolution array platforms.
RR Haraksingh, A Abyzov, M Gerstein, AE Urban, M Snyder (2011). PLoS One 6: e27859.

AlleleSeq: analysis of allele-specific expression and binding in a network framework.
J Rozowsky, A Abyzov, J Wang, P Alves, D Raha, A Harmanci, J Leng, R Bjornson, Y Kong, N Kitabayashi, N Bhardwaj, M Rubin, M Snyder, M Gerstein (2011). Mol Syst Biol 7: 522.

Identification of genomic indels and structural variations using split reads.
ZD Zhang, J Du, H Lam, A Abyzov, AE Urban, M Snyder, M Gerstein (2011). BMC Genomics 12: 375.

Analysis of genomic variation in non-coding elements using population-scale sequencing data from the 1000 Genomes Project.
XJ Mu, ZJ Lu, Y Kong, HY Lam, MB Gerstein (2011). Nucleic Acids Res 39: 7058-76.

CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing.
A Abyzov, AE Urban, M Snyder, M Gerstein (2011). Genome Res 21: 974-84.

Mapping copy number variation by population-scale genome sequencing.
RE Mills, K Walter, C Stewart, RE Handsaker, K Chen, C Alkan, A Abyzov, SC Yoon, K Ye, RK Cheetham, A Chinwalla, DF Conrad, Y Fu, F Grubert, I Hajirasouliha, F Hormozdiari, LM Iakoucheva, Z Iqbal, S Kang, JM Kidd, MK Konkel, J Korn, E Khurana, D Kural, HY Lam, J Leng, R Li, Y Li, CY Lin, R Luo, XJ Mu, J Nemesh, HE Peckham, T Rausch, A Scally, X Shi, MP Stromberg, AM Stutz, AE Urban, JA Walker, J Wu, Y Zhang, ZD Zhang, MA Batzer, L Ding, GT Marth, G McVean, J Sebat, M Snyder, J Wang, K Ye, EE Eichler, MB Gerstein, ME Hurles, C Lee, SA McCarroll, JO Korbel, 1000 Genomes Project (2011). Nature 470: 59-65.

AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision.
A Abyzov, M Gerstein (2011). Bioinformatics 27: 595-603.

Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model.
ZD Zhang, MB Gerstein (2010). BMC Bioinformatics 11: 539.

Segmental duplications in the human genome reveal details of pseudogene formation.
E Khurana, HY Lam, C Cheng, N Carriero, P Cayting, MB Gerstein (2010). Nucleic Acids Res 38: 6997-7007.

Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library.
HY Lam, XJ Mu, AM Stutz, A Tanzer, PD Cayting, M Snyder, PM Kim, JO Korbel, MB Gerstein (2010). Nat Biotechnol 28: 47-55.

PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data.
JO Korbel, A Abyzov, XJ Mu, N Carriero, P Cayting, Z Zhang, M Snyder, MB Gerstein (2009). Genome Biol 10: R23.

MSB: a mean-shift-based approach for the analysis of structural variation in the genome.
LY Wang, A Abyzov, JO Korbel, M Snyder, M Gerstein (2009). Genome Res 19: 106-17.

High-resolution copy-number variation map reflects human olfactory receptor diversity and evolution.
Y Hasin, T Olender, M Khen, C Gonzaga-Jauregui, PM Kim, AE Urban, M Snyder, MB Gerstein, D Lancet, JO Korbel (2008). PLoS Genet 4: e1000249.

Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary history.
PM Kim, HY Lam, AE Urban, JO Korbel, J Affourtit, F Grubert, X Chen, S Weissman, M Snyder, MB Gerstein (2008). Genome Res 18: 1865-74.

The current excitement about copy-number variation: how it relates to gene duplications and protein families.
JO Korbel, PM Kim, X Chen, AE Urban, S Weissman, M Snyder, MB Gerstein (2008). Curr Opin Struct Biol 18: 366-74.

Positive selection at the protein network periphery: evaluation in terms of structural constraints and cellular context.
PM Kim, JO Korbel, MB Gerstein (2007). Proc Natl Acad Sci U S A 104: 20274-9.

Paired-end mapping reveals extensive structural variation in the human genome.
JO Korbel, AE Urban, JP Affourtit, B Godwin, F Grubert, JF Simons, PM Kim, D Palejev, NJ Carriero, L Du, BE Taillon, Z Chen, A Tanzer, AC Saunders, J Chi, F Yang, NP Carter, ME Hurles, SM Weissman, TT Harkins, MB Gerstein, M Egholm, M Snyder (2007). Science 318: 420-6.

Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome.
JO Korbel, AE Urban, F Grubert, J Du, TE Royce, P Starr, G Zhong, BS Emanuel, SM Weissman, M Snyder, MB Gerstein (2007). Proc Natl Acad Sci U S A 104: 10110-5.

Assessing the need for sequence-based normalization in tiling microarray experiments.
TE Royce, JS Rozowsky, MB Gerstein (2007). Bioinformatics 23: 988-97.

Positional artifacts in microarrays: experimental verification and construction of COP, an automated detection tool.
H Yu, K Nguyen, T Royce, J Qian, K Nelson, M Snyder, M Gerstein (2007). Nucleic Acids Res 35: e8.

High-resolution mapping of DNA copy alterations in human chromosome 22 using high-density tiling oligonucleotide arrays.
AE Urban, JO Korbel, R Selzer, T Richmond, A Hacker, GV Popescu, JF Cubells, R Green, BS Emanuel, MB Gerstein, SM Weissman, M Snyder (2006). Proc Natl Acad Sci U S A 103: 4534-9.

Design optimization methods for genomic DNA tiling arrays.
P Bertone, V Trifonov, JS Rozowsky, F Schubert, O Emanuelsson, J Karro, MY Kao, M Snyder, M Gerstein (2006). Genome Res 16: 271-81.

Fast optimal genome tiling with applications to microarray design and homology search.
P Berman, P Bertone, B Dasgupta, M Gerstein, MY Kao, M Snyder (2004). J Comput Biol 11: 766-85.


Return to front page