<?xml version="1.0"?>
<!DOCTYPE PubmedArticleSet PUBLIC "-//NLM//DTD PubMedArticle, 1st January 2008//EN" "http://www.ncbi.nlm.nih.gov/entrez/query/DTD/pubmed_080101.dtd">
<PubmedArticleSet>
<PubmedArticle><UniqueData>
<labid/>
<PMID>0008816770</PMID>
<labtitle/>
<labcite/>
<preprint>http://papers.gersteinlab.org/e-print//preprint.pdf</preprint>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>issue-overview-cosb</labid>
<labtitle>Sequences and Topology</labtitle>
<Authors>M Gerstein, B Honig</Authors>
<MedlineTA>Current Opinion in Structural Biology</MedlineTA>
<Year>2001</Year>
<Volume>11</Volume>
<MedlinePgn>327-329</MedlinePgn>
<preprint>http://papers.gersteinlab.org/e-print/issue-overview-cosb/text.pdf</preprint>
<PMID>11406382</PMID>
<sortval>75</sortval>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>epub</labid>
<labtitle>An analysis of the present system of scientific publishing:
what's wrong and where to go from here</labtitle>
<labcite>D Greenbaum, J Lim, M Gerstein. Interdiscip Sci Rev. (in 
press)</labcite>
<preprint>http://papers.gersteinlab.org/e-print/epub/text.pdf</preprint>
<Authors>D Greenbaum, J Lim, M Gerstein</Authors>
<Journal>Interdiscip Sci Rev</Journal>
<Year>2003</Year>
<Volume>28</Volume>
<Pages>293-302</Pages>
<subject>epublishing</subject>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>paperdeath</labid>
<labtitle>The Death of 
the Scientific 
Paper</labtitle>
<labcite>Seringhaus M, Gerstein M (2006). The Scientist. 20(9): 25</labcite>
<preprint>http://papers.gersteinlab.org/e-print/paperdeath/preprint.pdf</preprint>
	<grant>keck</grant>
<Year>2006</Year>
	<subject>2006-summary,epublishing</subject>
<website>http://www.the-scientist.com/2006/9/1/25/1/</website>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>yeast-hub</labid>
<labtitle>An XML-Based Approach to Integrating Heterogeneous Yeast Genome
Data</labtitle>
<labcite>KH Cheung, D Pan, A Smith, M Seringhaus, SM Douglas, M Gerstein.
2004 International Conference on Mathematics and Engineering Techniques in
Medicine and Biological Sciences (METMBS); pp 236-242</labcite>
<preprint>http://papers.gersteinlab.org/e-print/yeast-hub/preprint.pdf</preprint>
<Year>2004</Year>
	<grant>nsfymd</grant>
	<subject>2004-summary,interrelate,student</subject>
<website>http://yeasthub.gersteinlab.org</website>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>funcgrid</labid>
<labtitle>
Towards a systematic definition of protein function that scales to the genome level: Defining function in terms of interactions.
</labtitle>
<labcite>
N Lan, R Jansen, M Gerstein (2002).
Proceedings of the IEEE 90:1848-1858
</labcite>
<grant>nesg</grant>
<volume>90</volume>
<pages>1848-1858</pages>
<Year>2002</Year>
<preprint>http://papers.gersteinlab.org/e-print/funcgrid/reprint.pdf</preprint>
<website>http://partslist.org/func</website>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>

<labid>geom-inttab</labid>

<labtitle>Protein Geometry: Distances, Areas, and Volumes</labtitle>

<website>http://bioinfo.mbb.yale.edu/geometry</website>

<website2>http://www.iucr.org/iucr-top/it/itf/itf.html</website2>

<preprint>http://papers.gersteinlab.org/e-print/geom-inttab/preprint.pdf</preprint>

<bookdata>(Volume F, Chapter 22.1.1, pages 531-539; M Rossmann &amp; E Arnold, editors; 

Dordrecht: Kluwer)</bookdata>

<MedlineTA>International Tables for Crystallography</MedlineTA>

<Year>2001</Year>

<Authors>M Gerstein, F M Richards</Authors>

<subject>volumes</subject>
<sortval>81</sortval>
<grant>mem</grant>
<labcite/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>semweb-dbinteg</labid>
<labtitle>Semantic Web Approach to Database Integration in the Life Sciences</labtitle>
<labcite>KH Cheung, AK Smith, KYL Yip, CJO Baker, MB Gerstein (2007) in Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences (eds. C Baker and K Cheung, Springer, NY), pp. 11-30</labcite>
<preprint>http://papers.gersteinlab.org/e-print/semweb-dbinteg/preprint.pdf</preprint>
	<grant>keck</grant>
	<subject>2007-summary,epublishing</subject>
<website>http://www.cs.concordia.ca/~baker/SW-RKDLS-May-2006.htm</website>
<Year>2007</Year>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labtitle> Motions in a Database Framework: from Structure to Sequence</labtitle>
<Authors>M Gerstein, R Jansen, T Johnson, J Tsai, W Krebs</Authors>
<Year>1999</Year>
<MedlineTA>Rigidity Theory and Applications</MedlineTA>
<bookdata>401-442 (ed. M F Thorpe and P M Duxbury, Kluwer Academic/Plenum Publishers)</bookdata>
<preprint>http://papers.gersteinlab.org/e-print/rigidity-conf.pdf</preprint>
<labid>rigidity-conf</labid>
<website>http://bioinfo.mbb.yale.edu/MolMovDB</website>
<sortval>51</sortval>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>

<labid>lesk-review-tibs</labid>

<preprint>http://papers.gersteinlab.org/e-print/lesk-review-tibs/index.txt</preprint>

<labtitle> A Bauhaus for Biologists: An Introduction to Protein Architecture by A. M. Lesk</labtitle>

<Authors>P Harrison, M Gerstein</Authors>

<MedlineTA>Trends Biochem Sci</MedlineTA>

<Volume>26</Volume>

<MedlinePgn>204-205</MedlinePgn>

<Year>2001</Year>


<sortval>80</sortval>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>CoarseSurf</labid>
<labtitle>A Resolution-Sensitive Procedure for Comparing Protein Surfaces and its Application to the Comparison of Antigen-Combining Sites.</labtitle>
<labcite>M Gerstein (1992) Acta Crystallographica A48: 271-276. </labcite>
<Year>1992</Year>
<grant/>
<subject/>
<website>http://bioinfo.mbb.yale.edu/hyper/mbg/CoarseSurf/</website>
<preprint>http://papers.gersteinlab.org/e-print/CoarseSurf/preprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>sim-h2o-prot-helix</labid>
<labtitle>Simulation of Water around a Model Protein Helix. 1. Two-dimensional Projections of Solvent Structure. </labtitle>
<labcite>M Gerstein, R Lynden-Bell (1993) Journal of Physical Chemistry 97: 2982-2991. </labcite>
<Year>1993</Year>
<grant/>
<subject/>
<website/>
<preprint>http://papers.gersteinlab.org/e-print/sim-h2o-prot-helix/preprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>qloc-ijqc</labid>
<preprint>http://papers.gersteinlab.org/e-print/qloc-ijqc/text.pdf</preprint>
<website>http://bioinfo.mbb.yale.edu/genome/localize</website>
<website2>http://www3.interscience.wiley.com/cgi-bin/fulltext/85008594/FILE?TPL=ftx_start</website2>
<Authors>V Alexandrov, M Gerstein</Authors>
<Journal>International Journal of Quantum Chemistry</Journal>
<Pages>693-696</Pages>
<Volume>85</Volume>
<Year>2001</Year> 
<labtitle>Calculating populations of subcellular compartments using density matrix formalism</labtitle>
	<subject/>
<sortval>85</sortval>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>ijcnnn</labid>
<labtitle>Inferring Protein-Protein Interactions Using Interaction Network Topologies</labtitle>
<labcite>A Paccanaro, V Trifonov, H Yu, M Gerstein (2005). International Joint Conference on Neural Networks (IJCNN, Jul. 31-Aug. 4, Montreal, Canada), pages 161 - 166, vol. 1</labcite>
<preprint>http://papers.gersteinlab.org/e-print/ijcnnn/preprint.pdf</preprint>
<grant/>
<subject>interactions</subject>
<Year>2005</Year>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>webantitrust</labid>
<labtitle>Semantic Web Standards: Legal and Social Issues and Implications</labtitle>
<labcite>D Greenbaum, M Gerstein (2007) in Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences (eds. C Baker and K Cheung, Springer, NY), pp. 413-433</labcite>
<preprint>http://papers.gersteinlab.org/e-print/webantitrust/preprint.pdf</preprint>
	<grant>keck</grant>
	<subject>2007-summary,epublishing</subject>
<website>http://www.cs.concordia.ca/~baker/SW-RKDLS-May-2006.htm</website>
<Year>2007</Year>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>powerlaw-analytic</labid>
<labtitle>Analytical Evolutionary Model for Protein Fold Occurrence in Genomes, Accounting for the Effects of Gene Duplication, Deletion, Acquisition and Selective Pressure</labtitle>
<labcite>M Kamal, N Luscombe, J Qian, M Gerstein (2006) in Power Laws, Scale-Free Networks and Genome Biology (edited by EV Koonin, YI Wolf, GP Karev; Springer, New York), pages 165-193</labcite>
<preprint>http://papers.gersteinlab.org/e-print/powerlaw-analytic/reprint.pdf</preprint>
	<grant/>
<Year>2006</Year>
	<subject>2006-summary</subject>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>transmemb-oxy-content</labid>
<labtitle>Transmembrane protein oxygen content and compartmentalization of cells.</labtitle>
<labcite>R Sasidharan, A Smith and MB Gerstein (in press) PLoS One</labcite>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>whatis-imia</labid>
<labtitle>What is Bioinformatics? A Proposed Definition and Overview of the Field</labtitle>
<MedlineTA>Intl. Medical Informatics Association</MedlineTA>
<Year>2001</Year>
<Authors>N Luscombe, D Greenbaum, M Gerstein</Authors>
<website>http://bioinfo.mbb.yale.edu/what-is-it</website>
<preprint>http://papers.gersteinlab.org/e-print/whatis-imia</preprint>
<bookdata>(Yearbook, Pages 83-99)</bookdata>
<sortval>74</sortval>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>epub-debate-nature</labid>
<labtitle>Blurring the boundaries between scientific 'papers' and biological databases</labtitle>
<Journal>Nature Yearbook of Science and Technology</Journal>
<Authors>M Gerstein, J Junker</Authors>
<bookdata>210-212 (ed. D Butler, Palgrave Macmillan Publishers)</bookdata>
<Year>2002</Year>
<preprint>http://papers.gersteinlab.org/e-print/epub-debate-nature/text.html</preprint>
<website>http://www.nature.com/nature/debates/e-access</website>
<website2>http://www.palgrave-usa.com/catalogue/index.asp?isbn=0333971477</website2>
<footnote>ISBN 0-333-971-97147-7</footnote>
<subject>epublishing</subject>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>xml-bibe</labid>
<labtitle>An XML Application for Genomic Data Interoperation</labtitle>
<labcite>
Cheung KH, Liu Y, Kumar K, Snyder M, Gerstein M, Miller P. IEEE International Symposium on Bio-Informatics and Biomedical Engineering (BIBE) 2001, pp. 97-103 
</labcite>
<Year>2001</Year>
<preprint>http://papers.gersteinlab.org/e-print/xml-bibe/reprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>purcell-nmr</labid>
<labtitle>Purcell's early work on NMR: Contingency versus Inevitability  </labtitle>
<labcite>M Gerstein (1994) American Journal of Physics 62: 596-601</labcite>
<Year>1994</Year>
<grant/>
<subject>epublishing</subject>
<website>http://bioinfo.mbb.yale.edu/hyper/mbg/Purcell/</website>
<preprint>http://papers.gersteinlab.org/e-print/purcell-nmr/preprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>intint-chapter</labid>
<labtitle>Protein Interaction Prediction by Integrating Genomic Features and Protein Interaction Network Analysis</labtitle>
<labcite>LJ Lu, Y Xia, H Yu, A Rives, H Lu, F Schubert, M Gerstein (2005). Data Analysis and Visualization in Genomics and Proteomics (Wiley, NY)</labcite>
<preprint>http://papers.gersteinlab.org/e-print/intint-chapter/preprint.pdf</preprint>
	<grant/>
<Year>2005</Year>
	<subject>2005-Summary,interactions,student</subject>
<website>http://networks.gersteinlab.org/intint/</website>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>inv-prob-synch-rad</labid>
<labtitle>Inverse Problem for Synchrotron Radiation in the Presence of Noise</labtitle>
<labcite>N Fisch, A Kritz, M Gerstein (1987) Proceedings of the Sixth Joint Workshop on Electron Cyclotron Emission and Electron Cyclotron Resonance Heating. (eds. A Riviere, A Costley), 23-30 (Oxford, 16-17 September).</labcite>
<preprint>http://papers.gersteinlab.org/e-print/inv-prob-synch-rad/preprint.pdf</preprint>
<Year>1987</Year>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>cyberlaw</labid>
<labtitle>Practicing Cyberlaw in the Year 2000.</labtitle>
<labcite>R Becker, M Gerstein (1996) New Jersey Lawyer Magazine 179: 12-27 (September).</labcite>
<preprint>http://papers.gersteinlab.org/e-print/cyberlaw/preprint.pdf</preprint>
<Year>1996</Year>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
	<labid>tiling-wabi</labid>
	<labtitle>Fast optimal genome tiling with applications to microarray design and homology search.</labtitle>
	<Authors>P Berman, P Bertone, B DasGupta, M Gerstein, M-Y Kao, M Snyder</Authors>
	<labcite>
	P Berman, P Bertone, B DasGupta, M Gerstein, M-Y Kao, M Snyder.  (2002) Proceedings of the 2nd International Workshop on Algorithms in Bioinformatics. Springer-Verlag LNCS 2452: 419-433
	</labcite>
	<Year>2002</Year>
	<preprint>http://papers.gersteinlab.org/e-print/tiling-wabi/all.pdf</preprint>
	<website>http://www.dis.uniroma1.it/~algo02</website>
	<grant>cegs</grant>
	<subject>microarray</subject>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>changing-charge</labid>
<labtitle>Keeping the Shape but Changing the Charges: A Simulation Study of Urea and its Isosteric Analogues</labtitle>
<labcite>J Tsai, M Gerstein, M Levitt (1996) Journal of Chemical Physics 104: 9417-9430</labcite>
<Year>1996</Year>
<grant/>
<subject/>
<website/>
<preprint>http://papers.gersteinlab.org/e-print/changing-charge/preprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>sim-h2o-prot-helix-2</labid>
<labtitle>Simulation of Water around a Model Protein Helix. 2. The Relative Contributions of Packing, Hydrophobicity, and Hydrogen Bonding.</labtitle>
<labcite>M Gerstein, R Lynden-Bell (1993) Journal of Physical Chemistry 97: 2991-2999.</labcite>
<Year>1993</Year>
<grant/>
<subject/>
<website/>
<preprint>http://papers.gersteinlab.org/e-print/sim-h2o-prot-helix-2/preprint.pdf</preprint>
</UniqueData></PubmedArticle><PubmedArticle><UniqueData>
<labid>chip-seq-simu</labid>
<labtitle>Modeling ChIP sequencing in silico with applications</labtitle>
<labcite>ZD Zhang, J Rozowsky, M Snyder, J Chang, M Gerstein (In press) PLoS Computational Biology</labcite>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle><PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Process">
        <PMID>18528383</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>06</Month>
            <Day>05</Day>
        </DateCreated>
        <Article PubModel="Print">
            <Journal>
                <ISSN IssnType="Electronic">1476-4687</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>453</Volume>
                    <Issue>7196</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jun</Month>
                        <Day>5</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Nature</Title>
                <ISOAbbreviation>Nature</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Genomics: protein fossils live on as RNA.</ArticleTitle>
            <Pagination>
                <MedlinePgn>729-31</MedlinePgn>
            </Pagination>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Sasidharan</LastName>
                    <ForeName>Rajkumar</ForeName>
                    <Initials>R</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Comment</PublicationType>
                <PublicationType>News</PublicationType>
            </PublicationTypeList>
        </Article>
        <MedlineJournalInfo>
            <Country>England</Country>
            <MedlineTA>Nature</MedlineTA>
            <NlmUniqueID>0410462</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>6</Month>
                <Day>6</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>6</Month>
                <Day>6</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">453729a</ArticleId>
            <ArticleId IdType="doi">10.1038/453729a</ArticleId>
            <ArticleId IdType="pubmed">18528383</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>sirnapgene</labid>
<PMID>18528383</PMID>
<labtitle>Genomics: protein fossils live on as RNA.</labtitle>
<labcite>Sasidharan R, Gerstein M (2008) Nature</labcite>
<preprint>http://papers.gersteinlab.org/e-print/sirnapgene/preprint.pdf</preprint>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Data-Review">
        <PMID>18511261</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>06</Month>
            <Day>16</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Print">0959-440X</ISSN>
                <JournalIssue CitedMedium="Print">
                    <Volume>18</Volume>
                    <Issue>3</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jun</Month>
                    </PubDate>
                </JournalIssue>
                <Title>Current opinion in structural biology</Title>
                <ISOAbbreviation>Curr. Opin. Struct. Biol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>The current excitement about copy-number variation: how it relates to gene duplications and protein families.</ArticleTitle>
            <Pagination>
                <MedlinePgn>366-74</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>Following recent technological advances there has been an increasing interest in genome structural variants (SVs), in particular copy-number variants (CNVs) - large-scale duplications and deletions. Although not immediately evident, CNV surveys make a conceptual connection between the fields of population genetics and protein families, in particular with regard to the stability and expandability of families. The mechanisms giving rise to CNVs can be considered as fundamental processes underlying gene duplication and loss; duplicated genes being the results of 'successful' copies, fixed and maintained in the population. Conversely, many 'unsuccessful' duplicates remain in the genome as pseudogenes. Here, we survey studies on CNVs, highlighting issues related to protein families. In particular, CNVs tend to affect specific gene functional categories, such as those associated with environmental response, and are depleted in genes related to basic cellular processes. Furthermore, CNVs occur more often at the periphery of the protein interaction network. In comparison, protein families associated with successful and unsuccessful duplicates are associated with similar functional categories but are differentially placed in the interaction network. These trends are likely reflective of CNV formation biases and natural selection, both of which differentially influence distinct protein families.</AbstractText>
            </Abstract>
            <Affiliation>Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA; European Molecular Biology Laboratory, D-69117 Heidelberg, Germany.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Korbel</LastName>
                    <ForeName>Jan O</ForeName>
                    <Initials>JO</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Kim</LastName>
                    <ForeName>Philip M</ForeName>
                    <Initials>PM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Chen</LastName>
                    <ForeName>Xueying</ForeName>
                    <Initials>X</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Urban</LastName>
                    <ForeName>Alexander Eckehart</ForeName>
                    <Initials>AE</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weissman</LastName>
                    <ForeName>Sherman</ForeName>
                    <Initials>S</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Snyder</LastName>
                    <ForeName>Michael</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark B</ForeName>
                    <Initials>MB</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>05</Month>
                <Day>27</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>England</Country>
            <MedlineTA>Curr Opin Struct Biol</MedlineTA>
            <NlmUniqueID>9107784</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="received">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>6</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="accepted">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>13</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>27</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>31</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>31</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">S0959-440X(08)00039-0</ArticleId>
            <ArticleId IdType="doi">10.1016/j.sbi.2008.02.005</ArticleId>
            <ArticleId IdType="pubmed">18511261</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>cosbcnv</labid>
<PMID>18511261</PMID>
<labtitle>The current excitement about copy-number variation: how it relates to gene duplications and protein families.</labtitle>
<labcite>JO Korbel, PM Kim, X Chen, AE Urban, S Weissman, M Snyder, MB Gerstein (2008) Curr Opin Struct Biol</labcite>
<preprint>http://papers.gersteinlab.org/e-print/cosbcnv/preprint.pdf</preprint>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="Publisher">
        <PMID>18487680</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>5</Month>
            <Day>19</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1535-9484</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <PubDate>
                        <Year>2008</Year>
                        <Month>May</Month>
                        <Day>18</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Molecular &amp; cellular proteomics : MCP</Title>
                <ISOAbbreviation>Mol. Cell Proteomics</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Targeting the human cancer pathway protein interaction network by structural genomics.</ArticleTitle>
            <Pagination>
                <MedlinePgn/>
            </Pagination>
            <Abstract>
                <AbstractText>Structural genomics provides an important approach for characterizing and understanding systems biology. As a step towards better integrating protein three-dimensional (3D) structural information in cancer systems biology, we have constructed a Human Cancer Pathway Protein Interaction Network (HCPIN) by analysis of several classical cancer-associated signaling pathways and their physical protein-protein interactions. Many well-known cancer-associated proteins play central roles as &quot;hubs&quot; or &quot;bottlenecks&quot; in the HCPIN. At least half of HCPIN proteins are either directly associated with or interact with multiple signaling pathways. While some 45% of residues in these proteins are in sequence segments that meet criteria sufficient for approximate homology modeling (Blast E-val &lt; 10-6), only ~ 20% of residues in these proteins are structurally covered using high-accuracy homology modeling criteria (i.e. Blast E_val &lt; 10-6 and at least 80% sequence identity) or by actual experimental structures. The HCPIN website (http://nmr.cabm.rutgers.edu/hcpin) provides a comprehensive description of this biomedical important multi-pathway network, together with experimental and homology models of HCPIN proteins useful for cancer biology research. In order to complement and enrich cancer systems biology, the Northeast Structural Genomics Consortium (NESG) (www.nesg.org) is targeting &gt; 1,000 human proteins and protein domains from the HCPIN for sample production and 3D structure determination. The long-range goal of this effort is to provide a comprehensive 3D structure-function database for human cancer-associated proteins and protein complexes, in the context of their interaction networks. The network-based target selection (BioNet) approach described here is an example of a general strategy for targeting co-functioning proteins by structural genomics projects.</AbstractText>
            </Abstract>
            <Affiliation>Center for advanced biotechnology and medicine, Rutgers University, Piscataway, NJ 08854.</Affiliation>
            <AuthorList>
                <Author>
                    <LastName>Huang</LastName>
                    <FirstName>Yuanpeng Janet</FirstName>
                    <Initials>YJ</Initials>
                </Author>
                <Author>
                    <LastName>Hang</LastName>
                    <FirstName>Dehua</FirstName>
                    <Initials>D</Initials>
                </Author>
                <Author>
                    <LastName>Lu</LastName>
                    <FirstName>Long Jason</FirstName>
                    <Initials>LJ</Initials>
                </Author>
                <Author>
                    <LastName>Tong</LastName>
                    <FirstName>Liang</FirstName>
                    <Initials>L</Initials>
                </Author>
                <Author>
                    <LastName>Gerstein</LastName>
                    <FirstName>Mark B</FirstName>
                    <Initials>MB</Initials>
                </Author>
                <Author>
                    <LastName>Montelione</LastName>
                    <FirstName>Gaetano T</FirstName>
                    <Initials>GT</Initials>
                </Author>
            </AuthorList>
            <Language>ENG</Language>
            <PublicationTypeList>
                <PublicationType>JOURNAL ARTICLE</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>18</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <MedlineTA>Mol Cell Proteomics</MedlineTA>
            <NlmUniqueID>101125647</NlmUniqueID>
        </MedlineJournalInfo>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>20</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>20</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>aheadofprint</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">M700550-MCP200</ArticleId>
            <ArticleId IdType="doi">10.1074/mcp.M700550-MCP200</ArticleId>
            <ArticleId IdType="pubmed">18487680</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>hcpin</labid>
<PMID>18487680</PMID>
<labtitle>Targeting the human cancer pathway protein interaction network by structural genomics.</labtitle>
<labcite>Huang YJ, Hang D, Lu LJ, Tong L, Gerstein MB, Montelione GT (2008) Mol Cell Proteomics</labcite>
<preprint>http://papers.gersteinlab.org/e-print/hcpin/preprint.pdf</preprint>
	<grant>nesg</grant>
	<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="Publisher">
        <PMID>18487515</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>5</Month>
            <Day>19</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Print">1088-9051</ISSN>
                <JournalIssue CitedMedium="Print">
                    <PubDate>
                        <Year>2008</Year>
                        <Month>May</Month>
                        <Day>16</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Genome research</Title>
                <ISOAbbreviation>Genome Res.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>A Genomics Analysis of RNA polymerase II modification and chromatin architecture related to 3' end RNA polyadenylation.</ArticleTitle>
            <Pagination>
                <MedlinePgn/>
            </Pagination>
            <Abstract>
                <AbstractText>Genomic analyses have been applied extensively to analyze the process of transcription initiation in mammalian cells, but much less to the events associated with transcript 3'end formation and transcription termination. We have used a novel approach to prepare 3' end fragment libraries from polyadenylated RNA of several cell types, and globally mapped the position of the poly(A) addition site using oligonucleotide arrays tiling one percent of the human genome. This approach revealed more 3' ends than had been previously annotated. The distribution of these ends relative to DNA sites bound by RNA polymerase II and the distributions with those for di- and trimethylated lysine 4 and lysine 36 of histone 3, was compared by ChIP-chip analysis. We found that a substantial fraction of unannotated 3' ends of RNA are intronic and reside antisense to the embedding gene. Poly(A) ends of annotated messages lie at a variable distance averaging approximately two kb upstream of the end of RNA polymerase binding (termination). Near the sites of RNA polymerase termination, as well as in some internal sites, there is an accumulation of both unphosphorylated and carboxy-terminal domain (CTD) serine 2 phosphorylated large subunit of polymerase II, suggesting pausing of the polymerase and perhaps dephosphorylation prior to release. Lysine 36 trimethylation occurs across the body of many transcribed genes, sometimes alternating with stretches of DNA in which lysine36 dimethylation is relatively more prominent. Lysine 36 methylation often decreases beginning at or near the site of polyadenylation, sometimes disappearing before disappearance of phosphorylated RNA polymerase II and release of RNA polymerase from DNA. Our results suggest that transcription termination may involve the separable events of loss of histone3 lysine 36 methylation and later release of RNA polymerase. The latter is often associated with polymerase pausing before release. Thus, overall our study reveals extensive sites of poly(A) addition across the human genome and provides insights into the events that may occur during 3' end formation.</AbstractText>
            </Abstract>
            <Affiliation>Yale University School of Medicine;</Affiliation>
            <AuthorList>
                <Author>
                    <LastName>Lian</LastName>
                    <FirstName>Zheng</FirstName>
                    <Initials>Z</Initials>
                </Author>
                <Author>
                    <LastName>Karpikov</LastName>
                    <FirstName>Alexander</FirstName>
                    <Initials>A</Initials>
                </Author>
                <Author>
                    <LastName>Lian</LastName>
                    <FirstName>Jin</FirstName>
                    <Initials>J</Initials>
                </Author>
                <Author>
                    <LastName>Mahajan</LastName>
                    <FirstName>Milind C</FirstName>
                    <Initials>MC</Initials>
                </Author>
                <Author>
                    <LastName>Hartman</LastName>
                    <FirstName>Stephen</FirstName>
                    <Initials>S</Initials>
                </Author>
                <Author>
                    <LastName>Gerstein</LastName>
                    <FirstName>Mark</FirstName>
                    <Initials>M</Initials>
                </Author>
                <Author>
                    <LastName>Snyder</LastName>
                    <FirstName>Michael</FirstName>
                    <Initials>M</Initials>
                </Author>
                <Author>
                    <LastName>Weissman</LastName>
                    <FirstName>Sherman M</FirstName>
                    <Initials>SM</Initials>
                </Author>
            </AuthorList>
            <Language>ENG</Language>
            <PublicationTypeList>
                <PublicationType>JOURNAL ARTICLE</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>16</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <MedlineTA>Genome Res</MedlineTA>
            <NlmUniqueID>9518021</NlmUniqueID>
        </MedlineJournalInfo>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>20</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>20</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>aheadofprint</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">gr.075804.107</ArticleId>
            <ArticleId IdType="doi">10.1101/gr.075804.107</ArticleId>
            <ArticleId IdType="pubmed">18487515</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>3prime</labid>
<PMID>18487515</PMID>
<labtitle>A Genomics Analysis of RNA polymerase II modification and chromatin architecture related to 3' end RNA polyadenylation.</labtitle>
<labcite>Z Lian, A Karpikov, J Lian, MC Mahajan, S Hartman, M Gerstein, M Snyder, SM Weissman (2008) Genome Res</labcite>
<preprint>http://papers.gersteinlab.org/e-print/3prime/preprint.pdf</preprint>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Process">
        <PMID>18478571</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>06</Month>
            <Day>03</Day>
        </DateCreated>
        <Article PubModel="Print">
            <Journal>
                <ISSN IssnType="Electronic">1097-0215</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>123</Volume>
                    <Issue>3</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Aug</Month>
                        <Day>1</Day>
                    </PubDate>
                </JournalIssue>
                <Title>International journal of cancer. Journal international du cancer</Title>
                <ISOAbbreviation>Int. J. Cancer</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Association of cytokeratin 7 and 19 expression with genomic stability and favorable prognosis in clear cell renal cell cancer.</ArticleTitle>
            <Pagination>
                <MedlinePgn>569-76</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>The purpose of our study was to demonstrate that distinct cytogenetic alterations in the most common subtype of renal cell cancer, clear cell renal cell carcinoma (ccRCC), are reflected in protein expression profiles. We performed conventional cytogenetics and immunohistochemical analysis for cytokeratins (CKs) on 126 ccRCCs. Protein expression was evaluated in situ using a semiautomated quantitative system. The results were validated using an independent cohort of 209 ccRCCs with long-term follow-up. Cytogenetic alterations were identified in 96 of 126 ccRCCs, most of them involving chromosome 3 through loss, deletion or translocation. Expression of CKs and E-cadherin in ccRCC was associated with lack of cytogenetic alterations and low nuclear grade. In the validation set, CK7 and CK19 protein expression was associated with better clinical outcome. At the multivariate level, the best model included metastatic status and CK19 expression. Expression microarray analysis on 21 primary ccRCCs and 14 ccRCC metastases identified genes significantly associated with CK7 and CK19 expressing ccRCCs. Two novel ccRCC biomarkers associated with the CK7 positive ccRCC phenotype, PMS2 and MT1-MMP (MMP14), were further validated. We conclude that the variability observed for CK expression in ccRCC can be explained by genetic heterogeneity. Distinct molecular subtypes of ccRCC with prognostic relevance were identified, and the CK7/CK19 expressing subtype is associated with better outcome.</AbstractText>
            </Abstract>
            <Affiliation>Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Mertz</LastName>
                    <ForeName>Kirsten D</ForeName>
                    <Initials>KD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Demichelis</LastName>
                    <ForeName>Francesca</ForeName>
                    <Initials>F</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Sboner</LastName>
                    <ForeName>Andrea</ForeName>
                    <Initials>A</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Hirsch</LastName>
                    <ForeName>Michelle S</ForeName>
                    <Initials>MS</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Cin</LastName>
                    <ForeName>Paola Dal</ForeName>
                    <Initials>PD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Struckmann</LastName>
                    <ForeName>Kirsten</ForeName>
                    <Initials>K</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Storz</LastName>
                    <ForeName>Martina</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Scherrer</LastName>
                    <ForeName>Simone</ForeName>
                    <Initials>S</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Schmid</LastName>
                    <ForeName>Daniel M</ForeName>
                    <Initials>DM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Strebel</LastName>
                    <ForeName>Räto T</ForeName>
                    <Initials>RT</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Probst-Hensch</LastName>
                    <ForeName>Nicole M</ForeName>
                    <Initials>NM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Moch</LastName>
                    <ForeName>Holger</ForeName>
                    <Initials>H</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Rubin</LastName>
                    <ForeName>Mark A</ForeName>
                    <Initials>MA</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, Non-U.S. Gov't</PublicationType>
            </PublicationTypeList>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>Int J Cancer</MedlineTA>
            <NlmUniqueID>0042124</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>15</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>15</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="doi">10.1002/ijc.23565</ArticleId>
            <ArticleId IdType="pubmed">18478571</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>rubincancer</labid>
<PMID>18478571</PMID>
<labtitle>Association of cytokeratin 7 and 19 expression with genomic stability and favorable prognosis in clear cell renal cell cancer.</labtitle>
<labcite>KD Mertz, F Demichelis, A Sboner, MS Hirsch, PD Cin, K Struckmann, M Storz, S Scherrer, DM Schmid, RT Strebel, NM Probst-Hensch, M Gerstein, H Moch, MA Rubin (2008) Int J Cancer 123: 569-76.</labcite>
<grant/>
<subject/>
<website/>
<preprint>http://papers.gersteinlab.org/e-print/rubincancer/preprint.pdf</preprint>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18451266</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>06</Month>
            <Day>06</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>06</Month>
            <Day>24</Day>
        </DateCompleted>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1095-9203</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>320</Volume>
                    <Issue>5881</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jun</Month>
                        <Day>6</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Science (New York, N.Y.)</Title>
                <ISOAbbreviation>Science</ISOAbbreviation>
            </Journal>
            <ArticleTitle>The transcriptional landscape of the yeast genome defined by RNA sequencing.</ArticleTitle>
            <Pagination>
                <MedlinePgn>1344-9</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>The identification of untranslated regions, introns, and coding regions within an organism remains challenging. We developed a quantitative sequencing-based method called RNA-Seq for mapping transcribed regions, in which complementary DNA fragments are subjected to high-throughput sequencing and mapped to the genome. We applied RNA-Seq to generate a high-resolution transcriptome map of the yeast genome and demonstrated that most (74.5%) of the nonrepetitive sequence of the yeast genome is transcribed. We confirmed many known and predicted introns and demonstrated that others are not actively used. Alternative initiation codons and upstream open reading frames also were identified for many yeast genes. We also found unexpected 3'-end heterogeneity and the presence of many overlapping genes. These results indicate that the yeast transcriptome is more complex than previously appreciated.</AbstractText>
            </Abstract>
            <Affiliation>Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Nagalakshmi</LastName>
                    <ForeName>Ugrappa</ForeName>
                    <Initials>U</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Wang</LastName>
                    <ForeName>Zhong</ForeName>
                    <Initials>Z</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Waern</LastName>
                    <ForeName>Karl</ForeName>
                    <Initials>K</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Shou</LastName>
                    <ForeName>Chong</ForeName>
                    <Initials>C</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Raha</LastName>
                    <ForeName>Debasish</ForeName>
                    <Initials>D</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Snyder</LastName>
                    <ForeName>Michael</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <DataBankList CompleteYN="Y">
                <DataBank>
                    <DataBankName>GEO</DataBankName>
                    <AccessionNumberList>
                        <AccessionNumber>GSE11209</AccessionNumber>
                    </AccessionNumberList>
                </DataBank>
            </DataBankList>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, N.I.H., Extramural</PublicationType>
                <PublicationType>Research Support, Non-U.S. Gov't</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>05</Month>
                <Day>01</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>Science</MedlineTA>
            <NlmUniqueID>0404511</NlmUniqueID>
        </MedlineJournalInfo>
        <ChemicalList>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>Codon, Initiator</NameOfSubstance>
            </Chemical>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>DNA, Complementary</NameOfSubstance>
            </Chemical>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>DNA, Intergenic</NameOfSubstance>
            </Chemical>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>RNA, Fungal</NameOfSubstance>
            </Chemical>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>Untranslated Regions</NameOfSubstance>
            </Chemical>
        </ChemicalList>
        <CitationSubset>IM</CitationSubset>
        <MeshHeadingList>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Algorithms</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Codon, Initiator</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Computational Biology</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">DNA, Complementary</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">DNA, Intergenic</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Gene Expression Profiling</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Genes, Fungal</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Genome, Fungal</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Genomics</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Introns</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Open Reading Frames</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">RNA, Fungal</DescriptorName>
                <QualifierName MajorTopicYN="N">genetics</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Saccharomyces cerevisiae</DescriptorName>
                <QualifierName MajorTopicYN="Y">genetics</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Sequence Analysis, RNA</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Transcription, Genetic</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Untranslated Regions</DescriptorName>
            </MeshHeading>
        </MeshHeadingList>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>1</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>5</Month>
                <Day>3</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>6</Month>
                <Day>25</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">1158441</ArticleId>
            <ArticleId IdType="doi">10.1126/science.1158441</ArticleId>
            <ArticleId IdType="pubmed">18451266</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>rnaseq</labid>
<PMID>18451266</PMID>
<labtitle>The transcriptional landscape of the yeast genome defined by RNA sequencing.</labtitle>
<labcite>U Nagalakshmi, Z Wang, K Waern, C Shou, D Raha, M Gerstein, M Snyder (2008) Science 320: 1344-9.</labcite>
<preprint>http://papers.gersteinlab.org/e-print/rnaseq/preprint.pdf</preprint>
<grant/>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="Publisher">
        <PMID>18433058</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>4</Month>
            <Day>24</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1097-0134</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Apr</Month>
                        <Day>23</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Proteins</Title>
                <ISOAbbreviation>Proteins</ISOAbbreviation>
            </Journal>
            <ArticleTitle>HingeMaster: Normal mode hinge prediction approach and integration of complementary predictors.</ArticleTitle>
            <Pagination>
                <MedlinePgn/>
            </Pagination>
            <Abstract>
                <AbstractText>Protein motion is often the link between structure and function and a substantial fraction of proteins move through a domain hinge bending mechanism. Predicting the location of the hinge from a single structure is thus a logical first step towards predicting motion. Here, we describe ways to predict the hinge location by grouping residues with correlated normal-mode motions. We benchmarked our normal-mode based predictor against a gold standard set of carefully annotated hinge locations taken from the Database of Macromolecular Motions. We then compared it with three existing structure-based hinge predictors (TLSMD, StoneHinge, and FlexOracle), plus HingeSeq, a sequence-based hinge predictor. Each of these methods predicts hinges using very different sources of information-normal modes, experimental thermal factors, bond constraint networks, energetics, and sequence, respectively. Thus it is logical that using these algorithms together would improve predictions. We integrated all the methods into a combined predictor using a weighted voting scheme. Finally, we encapsulated all our results in a web tool which can be used to run all the predictors on submitted proteins and visualize the results. (c) 2008 Wiley-Liss, Inc.</AbstractText>
            </Abstract>
            <Affiliation>Department of Physics, Yale University, Bass 432, New Haven, Connecticut 06520.</Affiliation>
            <AuthorList>
                <Author>
                    <LastName>Flores</LastName>
                    <FirstName>Samuel C</FirstName>
                    <Initials>SC</Initials>
                </Author>
                <Author>
                    <LastName>Keating</LastName>
                    <FirstName>Kevin S</FirstName>
                    <Initials>KS</Initials>
                </Author>
                <Author>
                    <LastName>Painter</LastName>
                    <FirstName>Jay</FirstName>
                    <Initials>J</Initials>
                </Author>
                <Author>
                    <LastName>Morcos</LastName>
                    <FirstName>Faruck</FirstName>
                    <Initials>F</Initials>
                </Author>
                <Author>
                    <LastName>Nguyen</LastName>
                    <FirstName>Khang</FirstName>
                    <Initials>K</Initials>
                </Author>
                <Author>
                    <LastName>Merritt</LastName>
                    <FirstName>Ethan A</FirstName>
                    <Initials>EA</Initials>
                </Author>
                <Author>
                    <LastName>Kuhn</LastName>
                    <FirstName>Leslie A</FirstName>
                    <Initials>LA</Initials>
                </Author>
                <Author>
                    <LastName>Gerstein</LastName>
                    <FirstName>Mark B</FirstName>
                    <Initials>MB</Initials>
                </Author>
            </AuthorList>
            <Language>ENG</Language>
            <PublicationTypeList>
                <PublicationType>JOURNAL ARTICLE</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>23</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <MedlineTA>Proteins</MedlineTA>
            <NlmUniqueID>8700181</NlmUniqueID>
        </MedlineJournalInfo>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>25</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>25</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>aheadofprint</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="doi">10.1002/prot.22060</ArticleId>
            <ArticleId IdType="pubmed">18433058</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>hingemaster</labid>
<PMID>18433058</PMID>
<labtitle>HingeMaster: Normal mode hinge prediction approach and integration of complementary predictors.</labtitle>
<labcite>SC Flores, KS Keating, J Painter, F Morcos, K Nguyen, EA Merritt, LA Kuhn, MB Gerstein (2008) Proteins</labcite>
<grant/>
<subject/>
<website>http://molmovdb.org/cgi-bin/submit-flexoracle.cgi</website>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Data-Review">
        <PMID>18414925</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>05</Month>
            <Day>23</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Print">0022-2844</ISSN>
                <JournalIssue CitedMedium="Print">
                    <Volume>66</Volume>
                    <Issue>5</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>May</Month>
                    </PubDate>
                </JournalIssue>
                <Title>Journal of molecular evolution</Title>
                <ISOAbbreviation>J. Mol. Evol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Rapid evolution by positive darwinian selection in T-cell antigen CD4 in primates.</ArticleTitle>
            <Pagination>
                <MedlinePgn>446-56</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>CD4, an integral membrane glycoprotein, plays a critical role in the immune response and in the life cycle of simian and human immunodeficiency virus (SIV and HIV). Pairwise comparisons of orthologous human and mouse genes show that CD4 is evolving much faster than the majority of mammalian genes. The acceleration is too great to be attributed to a simple relaxation of the action of purifying selection alone. Here we show that the selective pressure acting on CD4 is highly variable between regions in the protein and identify codon sites under strong positive selection. We reconstruct the coding sequences for ancestral primate CD4s and model tertiary structures of all ancestral and extant sequences. Structural mapping of positively selected sites shows they distribute on the surface of the D1 domain of CD4, where the exogenous SIV gp120 protein binds. Moreover, structural models of the ancestral sequences show substantially larger variation in the interfacial electrostatic charge on CD4 and in the surface complementary between CD4 and gp120 in CD4 lineages from primates with natural SIV infections than those without. Thus, positive selection on CD4 among primates may reflect forces driven by SIV infection and could provide a link between changes in sequence and structure of CD4 during evolution and the interaction with the immunodeficiency virus.</AbstractText>
            </Abstract>
            <Affiliation>Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Zhang</LastName>
                    <ForeName>Zhengdong D</ForeName>
                    <Initials>ZD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weinstock</LastName>
                    <ForeName>George</ForeName>
                    <Initials>G</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>04</Month>
                <Day>15</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>Germany</Country>
            <MedlineTA>J Mol Evol</MedlineTA>
            <NlmUniqueID>0360051</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="received">
                <Year>2007</Year>
                <Month>9</Month>
                <Day>9</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="accepted">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>28</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="revised">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>16</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>15</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>17</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>17</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="doi">10.1007/s00239-008-9097-1</ArticleId>
            <ArticleId IdType="pubmed">18414925</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>CD4</labid>
<labtitle>Rapid Evolution by Positive Darwinian Selection in T-cell Antigen CD4 in Primates</labtitle>
<labcite>ZD Zhang, G Weinstock, M Gerstein (In press) Journal of Molecular Evolution </labcite>
<preprint>http://papers.gersteinlab.org/e-print/CD4/preprint.pdf</preprint>
<grant>cegs</grant>
<subject/>
<website/>
<PMID>18414925</PMID>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18369428</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>03</Month>
            <Day>28</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>04</Month>
            <Day>23</Day>
        </DateCompleted>
        <Article PubModel="Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1553-7358</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>4</Volume>
                    <Issue>3</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Mar</Month>
                    </PubDate>
                </JournalIssue>
                <Title>PLoS computational biology</Title>
                <ISOAbbreviation>PLoS Comput. Biol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Open access: taking full advantage of the content.</ArticleTitle>
            <Pagination>
                <MedlinePgn>e1000037</MedlinePgn>
            </Pagination>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Bourne</LastName>
                    <ForeName>Philip E</ForeName>
                    <Initials>PE</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Fink</LastName>
                    <ForeName>J Lynn</ForeName>
                    <Initials>JL</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Editorial</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>03</Month>
                <Day>28</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>PLoS Comput Biol</MedlineTA>
            <NlmUniqueID>101238922</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
        <MeshHeadingList>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Abstracting and Indexing as Topic</DescriptorName>
                <QualifierName MajorTopicYN="Y">methods</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Authorship</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Information Dissemination</DescriptorName>
                <QualifierName MajorTopicYN="Y">methods</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Internet</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Natural Language Processing</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Periodicals as Topic</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">PubMed</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Publishing</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">United States</DescriptorName>
            </MeshHeading>
        </MeshHeadingList>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>29</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>24</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>epublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="doi">10.1371/journal.pcbi.1000037</ArticleId>
            <ArticleId IdType="pubmed">18369428</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>killapp</labid>
<PMID>18369428</PMID>
<labtitle>Open access: taking full advantage of the content.</labtitle>
<labcite>PE Bourne, JL Fink, M Gerstein (2008) PLoS Comput Biol 4: e1000037.</labcite>
<preprint>http://papers.gersteinlab.org/e-print/killapp/preprint.pdf</preprint>
<grant>keck</grant>
<subject>epublishing</subject>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18364713</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>03</Month>
            <Day>26</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>04</Month>
            <Day>21</Day>
        </DateCompleted>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1744-4292</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>4</Volume>
                    <PubDate>
                        <Year>2008</Year>
                    </PubDate>
                </JournalIssue>
                <Title>Molecular systems biology</Title>
                <ISOAbbreviation>Mol. Syst. Biol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>The role of disorder in interaction networks: a structural analysis.</ArticleTitle>
            <Pagination>
                <MedlinePgn>179</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>Recent studies have emphasized the value of including structural information into the topological analysis of protein networks. Here, we utilized structural information to investigate the role of intrinsic disorder in these networks. Hub proteins tend to be more disordered than other proteins (i.e. the proteome average); however, we find this only true for those with one or two binding interfaces ('single'-interface hubs). In contrast, the distribution of disordered residues in multi-interface hubs is indistinguishable from the overall proteome. Surprisingly, we find that the binding interfaces in single-interface hubs are highly structured, as is the case for multi-interface hubs. However, the binding partners of single-interface hubs tend to have a higher level of disorder than the proteome average, suggesting that their binding promiscuity is related to the disorder of their binding partners. In turn, the higher level of disorder of single-interface hubs can be partly explained by their tendency to bind to each other in a cascade. A good illustration of this trend can be found in signaling pathways and, more specifically, in kinase cascades. Finally, our findings have implications for the current controversy related to party and date-hubs.</AbstractText>
            </Abstract>
            <Affiliation>Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Kim</LastName>
                    <ForeName>Philip M</ForeName>
                    <Initials>PM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Sboner</LastName>
                    <ForeName>Andrea</ForeName>
                    <Initials>A</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Xia</LastName>
                    <ForeName>Yu</ForeName>
                    <Initials>Y</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <GrantList CompleteYN="Y">
                <Grant>
                    <GrantID>RR19895</GrantID>
                    <Acronym>RR</Acronym>
                    <Agency>United States NCRR</Agency>
                </Grant>
            </GrantList>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, N.I.H., Extramural</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>03</Month>
                <Day>25</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>England</Country>
            <MedlineTA>Mol Syst Biol</MedlineTA>
            <NlmUniqueID>101235389</NlmUniqueID>
        </MedlineJournalInfo>
        <ChemicalList>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>Saccharomyces cerevisiae Proteins</NameOfSubstance>
            </Chemical>
        </ChemicalList>
        <CitationSubset>IM</CitationSubset>
        <MeshHeadingList>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Evolution, Molecular</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Protein Binding</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Saccharomyces cerevisiae</DescriptorName>
                <QualifierName MajorTopicYN="Y">metabolism</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Saccharomyces cerevisiae Proteins</DescriptorName>
                <QualifierName MajorTopicYN="Y">chemistry</QualifierName>
                <QualifierName MajorTopicYN="Y">metabolism</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Signal Transduction</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Stochastic Processes</DescriptorName>
            </MeshHeading>
        </MeshHeadingList>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="received">
                <Year>2007</Year>
                <Month>10</Month>
                <Day>15</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="accepted">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>08</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="epublish">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>25</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>28</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>22</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">msb200816</ArticleId>
            <ArticleId IdType="doi">10.1038/msb.2008.16</ArticleId>
            <ArticleId IdType="pubmed">18364713</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>netdisorder</labid>
<PMID>18364713</PMID>
<labtitle>The role of disorder in interaction networks: a structural analysis.</labtitle>
<labcite>PM Kim, A Sboner, Y Xia, M Gerstein (2008) Mol Syst Biol 4: 179.</labcite>
<preprint>http://papers.gersteinlab.org/e-print/netdisorder/preprint.pdf</preprint>
	<grant>proteomicsctr</grant>
	<subject>interactions</subject>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Process">
        <PMID>18328823</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>04</Month>
            <Day>03</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Print">0014-5793</ISSN>
                <JournalIssue CitedMedium="Print">
                    <Volume>582</Volume>
                    <Issue>8</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Apr</Month>
                        <Day>9</Day>
                    </PubDate>
                </JournalIssue>
                <Title>FEBS letters</Title>
                <ISOAbbreviation>FEBS Lett.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Manually structured digital abstracts: a scaffold for automatic text mining.</ArticleTitle>
            <Pagination>
                <MedlinePgn>1170</MedlinePgn>
            </Pagination>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Seringhaus</LastName>
                    <ForeName>Michael</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Editorial</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>03</Month>
                <Day>06</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>Netherlands</Country>
            <MedlineTA>FEBS Lett</MedlineTA>
            <NlmUniqueID>0155157</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>6</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>11</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>3</Month>
                <Day>11</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">S0014-5793(08)00192-0</ArticleId>
            <ArticleId IdType="doi">10.1016/j.febslet.2008.02.073</ArticleId>
            <ArticleId IdType="pubmed">18328823</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>sdafollowup</labid>
<PMID>18328823</PMID>
<labtitle>Manually structured digital abstracts: A scaffold for automatic text mining.</labtitle>
<labcite>M Seringhaus, M Gerstein (2008) FEBS Lett 582: 1170. </labcite>
<preprint>http://papers.gersteinlab.org/e-print/sdafollowup/preprint.pdf</preprint>
<grant>keck</grant>
<subject>epublishing</subject>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18258921</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>03</Month>
            <Day>03</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>04</Month>
            <Day>03</Day>
        </DateCompleted>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Print">1088-9051</ISSN>
                <JournalIssue CitedMedium="Print">
                    <Volume>18</Volume>
                    <Issue>3</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Mar</Month>
                    </PubDate>
                </JournalIssue>
                <Title>Genome research</Title>
                <ISOAbbreviation>Genome Res.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets.</ArticleTitle>
            <Pagination>
                <MedlinePgn>393-403</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>The most widely used method for detecting genome-wide protein-DNA interactions is chromatin immunoprecipitation on tiling microarrays, commonly known as ChIP-chip. Here, we conducted the first objective analysis of tiling array platforms, amplification procedures, and signal detection algorithms in a simulated ChIP-chip experiment. Mixtures of human genomic DNA and &quot;spike-ins&quot; comprised of nearly 100 human sequences at various concentrations were hybridized to four tiling array platforms by eight independent groups. Blind to the number of spike-ins, their locations, and the range of concentrations, each group made predictions of the spike-in locations. We found that microarray platform choice is not the primary determinant of overall performance. In fact, variation in performance between labs, protocols, and algorithms within the same array platform was greater than the variation in performance between array platforms. However, each array platform had unique performance characteristics that varied with tiling resolution and the number of replicates, which have implications for cost versus detection power. Long oligonucleotide arrays were slightly more sensitive at detecting very low enrichment. On all platforms, simple sequence repeats and genome redundancy tended to result in false positives. LM-PCR and WGA, the most popular sample amplification techniques, reproduced relative enrichment levels with high fidelity. Performance among signal detection algorithms was heavily dependent on array platform. The spike-in DNA samples and the data presented here provide a stable benchmark against which future ChIP platforms, protocol improvements, and analysis methods can be evaluated.</AbstractText>
            </Abstract>
            <Affiliation>Department of Genetics, Stanford University Medical Center, Stanford, California 94305, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Johnson</LastName>
                    <ForeName>David S</ForeName>
                    <Initials>DS</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Li</LastName>
                    <ForeName>Wei</ForeName>
                    <Initials>W</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gordon</LastName>
                    <ForeName>D Benjamin</ForeName>
                    <Initials>DB</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Bhattacharjee</LastName>
                    <ForeName>Arindam</ForeName>
                    <Initials>A</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Curry</LastName>
                    <ForeName>Bo</ForeName>
                    <Initials>B</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Ghosh</LastName>
                    <ForeName>Jayati</ForeName>
                    <Initials>J</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Brizuela</LastName>
                    <ForeName>Leonardo</ForeName>
                    <Initials>L</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Carroll</LastName>
                    <ForeName>Jason S</ForeName>
                    <Initials>JS</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Brown</LastName>
                    <ForeName>Myles</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Flicek</LastName>
                    <ForeName>Paul</ForeName>
                    <Initials>P</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Koch</LastName>
                    <ForeName>Christoph M</ForeName>
                    <Initials>CM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Dunham</LastName>
                    <ForeName>Ian</ForeName>
                    <Initials>I</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Bieda</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Xu</LastName>
                    <ForeName>Xiaoqin</ForeName>
                    <Initials>X</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Farnham</LastName>
                    <ForeName>Peggy J</ForeName>
                    <Initials>PJ</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Kapranov</LastName>
                    <ForeName>Philipp</ForeName>
                    <Initials>P</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Nix</LastName>
                    <ForeName>David A</ForeName>
                    <Initials>DA</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gingeras</LastName>
                    <ForeName>Thomas R</ForeName>
                    <Initials>TR</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Zhang</LastName>
                    <ForeName>Xinmin</ForeName>
                    <Initials>X</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Holster</LastName>
                    <ForeName>Heather</ForeName>
                    <Initials>H</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Jiang</LastName>
                    <ForeName>Nan</ForeName>
                    <Initials>N</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Green</LastName>
                    <ForeName>Roland D</ForeName>
                    <Initials>RD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Song</LastName>
                    <ForeName>Jun S</ForeName>
                    <Initials>JS</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>McCuine</LastName>
                    <ForeName>Scott A</ForeName>
                    <Initials>SA</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Anton</LastName>
                    <ForeName>Elizabeth</ForeName>
                    <Initials>E</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Nguyen</LastName>
                    <ForeName>Loan</ForeName>
                    <Initials>L</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Trinklein</LastName>
                    <ForeName>Nathan D</ForeName>
                    <Initials>ND</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Ye</LastName>
                    <ForeName>Zhen</ForeName>
                    <Initials>Z</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Ching</LastName>
                    <ForeName>Keith</ForeName>
                    <Initials>K</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Hawkins</LastName>
                    <ForeName>David</ForeName>
                    <Initials>D</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Ren</LastName>
                    <ForeName>Bing</ForeName>
                    <Initials>B</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Scacheri</LastName>
                    <ForeName>Peter C</ForeName>
                    <Initials>PC</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Rozowsky</LastName>
                    <ForeName>Joel</ForeName>
                    <Initials>J</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Karpikov</LastName>
                    <ForeName>Alexander</ForeName>
                    <Initials>A</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Euskirchen</LastName>
                    <ForeName>Ghia</ForeName>
                    <Initials>G</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weissman</LastName>
                    <ForeName>Sherman</ForeName>
                    <Initials>S</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Snyder</LastName>
                    <ForeName>Michael</ForeName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Yang</LastName>
                    <ForeName>Annie</ForeName>
                    <Initials>A</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Moqtaderi</LastName>
                    <ForeName>Zarmik</ForeName>
                    <Initials>Z</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Hirsch</LastName>
                    <ForeName>Heather</ForeName>
                    <Initials>H</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Shulha</LastName>
                    <ForeName>Hennady P</ForeName>
                    <Initials>HP</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Fu</LastName>
                    <ForeName>Yutao</ForeName>
                    <Initials>Y</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weng</LastName>
                    <ForeName>Zhiping</ForeName>
                    <Initials>Z</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Struhl</LastName>
                    <ForeName>Kevin</ForeName>
                    <Initials>K</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Myers</LastName>
                    <ForeName>Richard M</ForeName>
                    <Initials>RM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Lieb</LastName>
                    <ForeName>Jason D</ForeName>
                    <Initials>JD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Liu</LastName>
                    <ForeName>X Shirley</ForeName>
                    <Initials>XS</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <DataBankList CompleteYN="Y">
                <DataBank>
                    <DataBankName>GEO</DataBankName>
                    <AccessionNumberList>
                        <AccessionNumber>GSE10114</AccessionNumber>
                    </AccessionNumberList>
                </DataBank>
            </DataBankList>
            <GrantList CompleteYN="Y">
                <Grant>
                    <GrantID>1R01 HG004069-01</GrantID>
                    <Acronym>HG</Acronym>
                    <Agency>United States NHGRI</Agency>
                </Grant>
            </GrantList>
            <PublicationTypeList>
                <PublicationType>Evaluation Studies</PublicationType>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, N.I.H., Extramural</PublicationType>
                <PublicationType>Research Support, Non-U.S. Gov't</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>02</Month>
                <Day>07</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>Genome Res</MedlineTA>
            <NlmUniqueID>9518021</NlmUniqueID>
        </MedlineJournalInfo>
        <ChemicalList>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>Oligonucleotide Probes</NameOfSubstance>
            </Chemical>
            <Chemical>
                <RegistryNumber>9007-49-2</RegistryNumber>
                <NameOfSubstance>DNA</NameOfSubstance>
            </Chemical>
        </ChemicalList>
        <CitationSubset>IM</CitationSubset>
        <MeshHeadingList>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Algorithms</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Chromatin Immunoprecipitation</DescriptorName>
                <QualifierName MajorTopicYN="Y">methods</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Chromosome Aberrations</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">DNA</DescriptorName>
                <QualifierName MajorTopicYN="N">chemistry</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Genome, Human</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Humans</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Oligonucleotide Array Sequence Analysis</DescriptorName>
                <QualifierName MajorTopicYN="Y">methods</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Oligonucleotide Probes</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Polymerase Chain Reaction</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">ROC Curve</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Reproducibility of Results</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Tandem Repeat Sequences</DescriptorName>
            </MeshHeading>
        </MeshHeadingList>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>7</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>9</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>4</Month>
                <Day>4</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">gr.7080508</ArticleId>
            <ArticleId IdType="doi">10.1101/gr.7080508</ArticleId>
            <ArticleId IdType="pubmed">18258921</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>encodespikein</labid>
<PMID>18258921</PMID>
<labtitle>Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets.</labtitle>
<labcite>DS Johnson, W Li, DB Gordon, A Bhattacharjee, B Curry, J Ghosh, L Brizuela, JS Carroll, M Brown, P Flicek, CM Koch, I Dunham, M Bieda, X Xu, PJ Farnham, P Kapranov, DA Nix, TR Gingeras, X Zhang, H Holster, N Jiang, R Green, JS Song, SA McCuine, E Anton, L Nguyen, ND Trinklein, Z Ye, K Ching, D Hawkins, B Ren, PC Scacheri, J Rozowsky, A Karpikov, G Euskirchen, S Weissman, M Gerstein, M Snyder, A Yang, Z Moqtaderi, H Hirsch, HP Shulha, Y Fu, Z Weng, K Struhl, RM Myers, JD Lieb, XS Liu (2008) Genome Res</labcite>
<preprint>http://papers.gersteinlab.org/e-print/encodespikein/preprint.pdf</preprint>
<grant>encode</grant>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="Publisher">
        <PMID>18254929</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>2</Month>
            <Day>7</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1465-6914</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>9</Volume>
                    <Issue>1</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jan</Month>
                        <Day>31</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Genome biology</Title>
                <ISOAbbreviation>Genome Biol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Uncovering trends in gene naming.</ArticleTitle>
            <Pagination>
                <MedlinePgn>401</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>ABSTRACT: We take stock of current genetic nomenclature and attempt to organize strange and notable gene names. We categorize, for instance, those that involve a naming system transferred from another context (for example, Pavlov's dogs). We hope this analysis provides clues to better steer gene naming in the future.</AbstractText>
            </Abstract>
            <Affiliation>Program in Computational Biology and Bioinformatics, Yale University, Whitney Avenue, New Haven, CT 06520, USA. mark.gerstein@yale.edu.</Affiliation>
            <AuthorList>
                <Author>
                    <LastName>Seringhaus</LastName>
                    <FirstName>Michael</FirstName>
                    <MiddleName>R</MiddleName>
                    <Initials>MR</Initials>
                </Author>
                <Author>
                    <LastName>Cayting</LastName>
                    <FirstName>Philip</FirstName>
                    <MiddleName>D</MiddleName>
                    <Initials>PD</Initials>
                </Author>
                <Author>
                    <LastName>Gerstein</LastName>
                    <FirstName>Mark</FirstName>
                    <MiddleName>B</MiddleName>
                    <Initials>MB</Initials>
                </Author>
            </AuthorList>
            <Language>ENG</Language>
            <PublicationTypeList>
                <PublicationType>JOURNAL ARTICLE</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>31</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <MedlineTA>Genome Biol</MedlineTA>
            <NlmUniqueID>100960660</NlmUniqueID>
        </MedlineJournalInfo>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>31</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>8</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>2</Month>
                <Day>8</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>aheadofprint</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">gb-2008-9-1-401</ArticleId>
            <ArticleId IdType="doi">10.1186/gb-2008-9-1-401</ArticleId>
            <ArticleId IdType="pubmed">18254929</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>funnygene</labid>
<PMID>18254929</PMID>
<labtitle>Uncovering trends in gene naming.</labtitle>
<labcite>MR Seringhaus, PD Cayting, MB Gerstein (2008) Genome Biol 9: 401.</labcite>
<preprint>http://papers.gersteinlab.org/e-print/funnygene/preprint.pdf</preprint>
<grant>keck</grant>
<subject>epublishing</subject>
<website>http://www.gersteinlab.org/proj/funnygene/</website>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="Publisher">
        <PMID>18173853</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>1</Month>
            <Day>28</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1465-6914</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>9</Volume>
                    <Issue>1</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jan</Month>
                        <Day>3</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Genome biology</Title>
                <ISOAbbreviation>Genome Biol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome.</ArticleTitle>
            <Pagination>
                <MedlinePgn>R3</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>ABSTRACT: BACKGROUND: Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. RESULTS: We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. CONCLUSION: We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.</AbstractText>
            </Abstract>
            <Affiliation>Molecular, Cellular and Developmental Biology Department, KBT918, Yale University, 266 Whitney Avenue, New Haven, Connecticut 06511, USA. Michael.Snyder@yale.edu.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Wu</LastName>
                    <FirstName>Jia</FirstName>
                    <MiddleName>Qian</MiddleName>
                    <Initials>JQ</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Du</LastName>
                    <FirstName>Jiang</FirstName>
                    <Initials>J</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Rozowsky</LastName>
                    <FirstName>Joel</FirstName>
                    <Initials>J</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Zhang</LastName>
                    <FirstName>Zhengdong</FirstName>
                    <Initials>Z</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Urban</LastName>
                    <FirstName>Alexander</FirstName>
                    <MiddleName>E</MiddleName>
                    <Initials>AE</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Euskirchen</LastName>
                    <FirstName>Ghia</FirstName>
                    <Initials>G</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weissman</LastName>
                    <FirstName>Sherman</FirstName>
                    <Initials>S</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <FirstName>Mark</FirstName>
                    <Initials>M</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Snyder</LastName>
                    <FirstName>Michael</FirstName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>ENG</Language>
            <PublicationTypeList>
                <PublicationType>JOURNAL ARTICLE</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>03</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <MedlineTA>Genome Biol</MedlineTA>
            <NlmUniqueID>100960660</NlmUniqueID>
        </MedlineJournalInfo>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="received">
                <Year>2007</Year>
                <Month>11</Month>
                <Day>7</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="revised">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>6</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="accepted">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>3</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>03</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>5</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>5</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>aheadofprint</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">gb-2008-9-1-r3</ArticleId>
            <ArticleId IdType="doi">10.1186/gb-2008-9-1-r3</ArticleId>
            <ArticleId IdType="pubmed">18173853</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>encoderaceseq</labid>
<PMID>18173853</PMID>
<labtitle>Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome.</labtitle>
<labcite>JQ Wu, J Du, J Rozowsky, Z Zhang, AE Urban, G Euskirchen, S Weissman, M Gerstein, M Snyder (2008) Genome Biol 9: R3</labcite>
<preprint>http://papers.gersteinlab.org/e-print/encoderaceseq/preprint.pdf</preprint>
<grant>encode</grant>
<subject/>
<website/>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18077332</PMID>
        <DateCreated>
            <Year>2007</Year>
            <Month>12</Month>
            <Day>20</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>01</Month>
            <Day>28</Day>
        </DateCompleted>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1091-6490</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>104</Volume>
                    <Issue>51</Issue>
                    <PubDate>
                        <Year>2007</Year>
                        <Month>Dec</Month>
                        <Day>18</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Proceedings of the National Academy of Sciences of the United States of America</Title>
                <ISOAbbreviation>Proc. Natl. Acad. Sci. U.S.A.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Positive selection at the protein network periphery: evaluation in terms of structural constraints and cellular context.</ArticleTitle>
            <Pagination>
                <MedlinePgn>20274-9</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>Because of recent advances in genotyping and sequencing, human genetic variation and adaptive evolution in the primate lineage have become major research foci. Here, we examine the relationship between genetic signatures of adaptive evolution and network topology. We find a striking tendency of proteins that have been under positive selection (as compared with the chimpanzee) to be located at the periphery of the interaction network. Our results are based on the analysis of two types of genome evolution, both in terms of intra- and interspecies variation. First, we looked at single-nucleotide polymorphisms and their fixed variants, single-nucleotide differences in the human genome relative to the chimpanzee. Second, we examine fixed structural variants, specifically large segmental duplications and their polymorphic precursors known as copy number variants. We propose two complementary mechanisms that lead to the observed trends. First, we can rationalize them in terms of constraints imposed by protein structure: We find that positively selected sites are preferentially located on the exposed surface of proteins. Because central network proteins (hubs) are likely to have a larger fraction of their surface involved in interactions, they tend to be constrained and under negative selection. Conversely, we show that the interaction network roughly maps to cellular organization, with the periphery of the network corresponding to the cellular periphery (i.e., extracellular space or cell membrane). This suggests that the observed positive selection at the network periphery may be due to an increase of adaptive events on the cellular periphery responding to changing environments.</AbstractText>
            </Abstract>
            <Affiliation>Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA. pmkim@alum.mit.edu</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Kim</LastName>
                    <ForeName>Philip M</ForeName>
                    <Initials>PM</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Korbel</LastName>
                    <ForeName>Jan O</ForeName>
                    <Initials>JO</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark B</ForeName>
                    <Initials>MB</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, N.I.H., Extramural</PublicationType>
                <PublicationType>Research Support, Non-U.S. Gov't</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>12</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>Proc Natl Acad Sci U S A</MedlineTA>
            <NlmUniqueID>7505876</NlmUniqueID>
        </MedlineJournalInfo>
        <ChemicalList>
            <Chemical>
                <RegistryNumber>0</RegistryNumber>
                <NameOfSubstance>Proteins</NameOfSubstance>
            </Chemical>
        </ChemicalList>
        <CitationSubset>IM</CitationSubset>
        <MeshHeadingList>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Adaptation, Biological</DescriptorName>
                <QualifierName MajorTopicYN="Y">genetics</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Animals</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Evolution, Molecular</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Gene Dosage</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Genome</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Genome, Human</DescriptorName>
                <QualifierName MajorTopicYN="Y">genetics</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Humans</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Mutation</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Polymorphism, Single Nucleotide</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Protein Conformation</DescriptorName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="N">Proteins</DescriptorName>
                <QualifierName MajorTopicYN="Y">genetics</QualifierName>
                <QualifierName MajorTopicYN="N">metabolism</QualifierName>
            </MeshHeading>
            <MeshHeading>
                <DescriptorName MajorTopicYN="Y">Selection (Genetics)</DescriptorName>
            </MeshHeading>
        </MeshHeadingList>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>12</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>14</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2008</Year>
                <Month>1</Month>
                <Day>29</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">0710183104</ArticleId>
            <ArticleId IdType="doi">10.1073/pnas.0710183104</ArticleId>
            <ArticleId IdType="pubmed">18077332</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>netpossel</labid>
<PMID>18077332</PMID>
<labtitle>Positive selection at the protein network periphery: Evaluation in terms of structural constraints and cellular context.</labtitle>
<labcite>PM Kim, JO Korbel, MB Gerstein (2007) Proc Natl Acad Sci U S A.</labcite>
	<grant>cegs</grant>
	<subject>2007-summary,interactions</subject>
<website>http://www.gersteinlab.org/proj/netpossel/</website>
<preprint>http://papers.gersteinlab.org/e-print/netpossel/preprint.pdf</preprint>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="In-Process">
        <PMID>18065488</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>01</Month>
            <Day>21</Day>
        </DateCreated>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1537-1719</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>25</Volume>
                    <Issue>1</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jan</Month>
                    </PubDate>
                </JournalIssue>
                <Title>Molecular biology and evolution</Title>
                <ISOAbbreviation>Mol. Biol. Evol.</ISOAbbreviation>
            </Journal>
            <ArticleTitle>Analysis of nuclear receptor pseudogenes in vertebrates: how the silent tell their stories.</ArticleTitle>
            <Pagination>
                <MedlinePgn>131-43</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>Transcription factor pseudogenes have not been systematically studied before. Nuclear receptors (NRs) constitute one of the largest groups of transcription factors in animals (e.g., 48 NRs in human). The availability of whole-genome sequences enables a global inventory of the NR pseudogenes in a number of vertebrate model organisms. Here we identify the NR pseudogenes in 8 vertebrate organisms and make our results available online at http://www.pseudogene.org/nr. The assignments reveal that NR pseudogenes as a group have characteristics related to generation and distribution contrary to expectations derived from previous large-scale pseudogene studies. In particular, 1) despite its large size, the NR gene family has only a very small number of pseudogenes in each of the vertebrate genomes examined; 2) despite the low transcription levels of NR genes, except for one, all other NR pseudogenes identified in this study are retropseudogenes; and 3) no duplicated NR pseudogenes are found, contrary to the fact that the NR gene family was expanded through several waves of gene duplication events. Our analyses further reveal a number of interesting aspects of NR pseudogenes. Specifically, through careful sequence analysis, we identify remnant introns in 2 mouse retropseudogenes, psiRev-erbbeta and psiLRH1. Generated from partially processed pre-mRNAs, they appear to be rare examples of highly unusual &quot;semiprocessed&quot; pseudogenes. Second, by comparing the genomic sequences, we uncover a pseudogene that is unique to the human lineage relative to chimpanzee. Generated by a recent duplication of a segment in the human genome, this pseudogene is a &quot;duplicated-processed&quot; pseudogene, belonging to a new pseudogene species. Finally, FXRbeta was nonfunctionalized in the human lineage and thus appears to be an example of a rare unitary pseudogene. By comparing orthologous sequences, we dated the FXR-FXRbeta duplication and the nonfunctionalization of FXRbeta in primates.</AbstractText>
            </Abstract>
            <Affiliation>Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA.</Affiliation>
            <AuthorList CompleteYN="Y">
                <Author ValidYN="Y">
                    <LastName>Zhang</LastName>
                    <ForeName>Zhengdong D</ForeName>
                    <Initials>ZD</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Cayting</LastName>
                    <ForeName>Philip</ForeName>
                    <Initials>P</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Weinstock</LastName>
                    <ForeName>George</ForeName>
                    <Initials>G</Initials>
                </Author>
                <Author ValidYN="Y">
                    <LastName>Gerstein</LastName>
                    <ForeName>Mark</ForeName>
                    <Initials>M</Initials>
                </Author>
            </AuthorList>
            <Language>eng</Language>
            <GrantList CompleteYN="Y">
                <Grant>
                    <GrantID>T15 LM07056</GrantID>
                    <Acronym>LM</Acronym>
                    <Agency>United States NLM</Agency>
                </Grant>
            </GrantList>
            <PublicationTypeList>
                <PublicationType>Journal Article</PublicationType>
                <PublicationType>Research Support, N.I.H., Extramural</PublicationType>
            </PublicationTypeList>
            <ArticleDate DateType="Electronic">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>07</Day>
            </ArticleDate>
        </Article>
        <MedlineJournalInfo>
            <Country>United States</Country>
            <MedlineTA>Mol Biol Evol</MedlineTA>
            <NlmUniqueID>8501455</NlmUniqueID>
        </MedlineJournalInfo>
        <CitationSubset>IM</CitationSubset>
    </MedlineCitation>
    <PubmedData>
        <History>
            <PubMedPubDate PubStatus="aheadofprint">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>7</Day>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="pubmed">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>11</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
            <PubMedPubDate PubStatus="medline">
                <Year>2007</Year>
                <Month>12</Month>
                <Day>11</Day>
                <Hour>9</Hour>
                <Minute>0</Minute>
            </PubMedPubDate>
        </History>
        <PublicationStatus>ppublish</PublicationStatus>
        <ArticleIdList>
            <ArticleId IdType="pii">msm251</ArticleId>
            <ArticleId IdType="doi">10.1093/molbev/msm251</ArticleId>
            <ArticleId IdType="pubmed">18065488</ArticleId>
        </ArticleIdList>
    </PubmedData>
<UniqueData>
<labid>nr-pseudogenes</labid>
<labtitle>Analysis of nuclear receptor pseudogenes in vertebrates: How the silent tell their stories</labtitle>
<labcite>ZD Zhang, P Cayting, G Weinstock, M Gerstein (In press) Molecular Biology and Evolution</labcite>
<preprint>http://papers.gersteinlab.org/e-print/nr-pseudogenes/preprint.pdf</preprint>
<grant/>
<subject>pseudogenes</subject>
<website>http://pseudogene.org/nr/</website>
<PMID>18065488</PMID>
</UniqueData></PubmedArticle>


<PubmedArticle>
    <MedlineCitation Owner="NLM" Status="MEDLINE">
        <PMID>18056067</PMID>
        <DateCreated>
            <Year>2008</Year>
            <Month>01</Month>
            <Day>15</Day>
        </DateCreated>
        <DateCompleted>
            <Year>2008</Year>
            <Month>02</Month>
            <Day>05</Day>
        </DateCompleted>
        <Article PubModel="Print-Electronic">
            <Journal>
                <ISSN IssnType="Electronic">1460-2059</ISSN>
                <JournalIssue CitedMedium="Internet">
                    <Volume>24</Volume>
                    <Issue>2</Issue>
                    <PubDate>
                        <Year>2008</Year>
                        <Month>Jan</Month>
                        <Day>15</Day>
                    </PubDate>
                </JournalIssue>
                <Title>Bioinformatics (Oxford, England)</Title>
                <ISOAbbreviation>Bioinformatics</ISOAbbreviation>
            </Journal>
            <ArticleTitle>An integrated system for studying residue coevolution in proteins.</ArticleTitle>
            <Pagination>
                <MedlinePgn>290-2</MedlinePgn>
            </Pagination>
            <Abstract>
                <AbstractText>Residue coevolution has recently emerged as an important concept, especially in the context of protein structures. While a multitude of different functions for quantifying it have been proposed, not much is known about their relative strengths and weaknesses. Also, subtle algorithmic details have discouraged implementing and comparing them. We addressed this issue by developing an integrated online system that enables comparative analyses with a comprehensive set of commonly used scoring functions, including Statistical Coupling Analysis (SCA), Explicit Likelihood of Subset Variation (ELSC), mutual information and correlation-based methods. A set of data preprocessing options are provided for improving the sensitivity and specificity of coevolution signal detection, including sequence weighting, residue grouping and the filtering of sequences, sites and site pairs. A total of more than 100 scoring variations are available. The system also provides facilities for studying the relationship between coevolution scores and inter-residue distances from a crystal structure if provided, which may help in understanding protein structures. AVAILABILITY: The system is available at http://coevolution.gersteinlab.org. The source code and JavaDoc API can also be downloaded from t