<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gm77</ui>
   <ji>1756-994X</ji>
   <fm>
      <dochead>Review</dochead>
      <bibl>
         <title>
            <p>Linking genes to diseases: it's all in the data</p>
         </title>
         <aug>
            <au ca="yes" id="A1">
               <snm>Tiffin</snm>
               <fnm>Nicki</fnm>
               <insr iid="I1"/>
               <email>nickitiffin@imaginet.co.za</email>
            </au>
            <au id="A2">
               <snm>Andrade-Navarro</snm>
               <mi>A</mi>
               <fnm>Miguel</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Perez-Iratxeta</snm>
               <fnm>Carolina</fnm>
               <insr iid="I3"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>MRC/UWC/SANBI Bioinformatics Capacity Development Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville 7535, South Africa</p>
            </ins>
            <ins id="I2">
               <p>Max-Delbr&#252;ck Center for Molecular Medicine, Robert R&#246;ssle Strasse 10, 13125 Berlin, Germany</p>
            </ins>
            <ins id="I3">
               <p>Ottawa Hospital Research Institute, 501 Smyth Road, Ottawa, Ontario K1H 8L6, Canada</p>
            </ins>
         </insg>
         <source>Genome Medicine</source>
         <issn>1756-994X</issn>
         <pubdate>2009</pubdate>
         <volume>1</volume>
         <issue>8</issue>
         <fpage>77</fpage>
         <url>http://genomemedicine.com/content/1/8/77</url>
         <xrefbib>
            
         <pubidlist><pubid idtype="pmpid">19678910</pubid><pubid idtype="doi">10.1186/gm77</pubid></pubidlist></xrefbib>
      </bibl>
      <history>
         <pub>
            <date>
               <day>07</day>
               <month>08</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>BioMed Central Ltd</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Genome-wide association analyses on large patient cohorts are generating large sets of candidate disease genes. This is coupled with the availability of ever-increasing genomic databases and a rapidly expanding repository of biomedical literature. Computational approaches to disease-gene association attempt to harness these data sources to identify the most likely disease gene candidates for further empirical analysis by translational researchers, resulting in efficient identification of genes of diagnostic, prognostic and therapeutic value. Existing computational methods analyze gene structure and sequence, functional annotation of candidate genes, characteristics of known disease genes, gene regulatory networks, protein-protein interactions, data from animal models and disease phenotype. To date, a few studies have successfully applied computational analysis of clinical phenotype data for specific diseases and shown genetic associations. In the near future, computational strategies will be facilitated by improved integration of clinical and computational research, and by increased availability of clinical phenotype data in a format accessible to computational approaches.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p/>
         </st>
         <p>Historically, disease phenotype has informed the selection of candidate disease genes through observations of the effects of perturbations in these candidates <it>in vitro</it>, in tissue cultures and in animal models. This hypothesis-driven approach is increasingly being superseded by genome-wide analyses that assume no prior knowledge of the underlying genotype, and hypotheses about the associated genes are inferred from large-scale genetic studies of samples with the disease phenotype. These studies include genome-wide linkage and association studies in affected and healthy patient populations to identify chromosomal regions most likely to contain etiological genes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, and the detailed analysis of genome-wide changes in the disease state by high-throughput techniques, such as single nucleotide polymorphism (SNP) <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and microarray expression analysis <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, serial analysis of gene expression (SAGE) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> and cap analysis of gene expression (CAGE) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Current approaches include next generation sequencing of linked regions, high-density SNP analysis and the study of copy number variation <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Typically, genome-wide approaches generate large sets of potential genetic associations for further analysis; for example, multifactorial disease loci identified by linkage analysis can be approximately 30 Mb in size and contain several hundred genes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This synergizes with ongoing research on many complex diseases, in which multiple gene variations, rather than single dysfunctional genes, are believed to underlie the disease phenotype <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Genome-wide analyses have therefore massively increased the number of candidate genes to be investigated for a given phenotype.</p>
         <p>Concurrently, available genetic information has increased as a result of more sophisticated experimental methods and centralization of genetic information in public genome databases (such as Ensembl <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, NCBI <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and UCSC <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>), gene expression databases (such as GEO <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>) and human variation databases (such as HapMap <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). Additional data on gene regulatory networks and pathways are becoming increasingly accessible (for example, KEGG <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and Reactome <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>). In addition, biomedical literature has become too massive a resource to be assimilated by individuals (for example, 17.8 million abstracts are listed by PubMed in May 2009, of which 10 million deal with human data).</p>
         <p>The subsequent challenge is to use this wide variety of data sources to identify relevant disease gene candidates within the lists of genes generated from genome-wide analyses for further empirical research, an overwhelming task to undertake manually. Computational analysis can facilitate efficient and accurate utilization of all such sources of information, and the resulting early prioritization allows streamlined empirical research and quicker and cheaper identification of disease-causing genes.</p>
         <p>To date, many computational methods have focused on the prediction of candidates by analysis of inherent sequence characteristics of genes, sequence similarity to known disease genes, and functional annotation of candidate genes <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. These approaches are briefly reviewed here. The computational analysis of phenotypes for the prioritization of disease candidates is less utilized, and is explored later in this article (Figure <figr fid="F1">1</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Miners in Germany (1952)</p>
            </caption>
            <text>
               <p>Miners in Germany (1952). As with mining of minerals, data mining of associations between genes and diseases can be dirty and disheartening, but the potential for reward is great. Photo: G&#252;nther Paalzow. Reproduced from Bundesarchiv, Bild 183-13175-0020 under the Creative Commons Attribution ShareAlike 3.0 license (Germany).</p>
            </text>
            <graphic file="gm77-1"/>
         </fig>
      </sec>
      <sec>
         <st>
            <p>Approaches for the identification of candidate disease genes</p>
         </st>
         <sec>
            <st>
               <p>Using intrinsic gene properties</p>
            </st>
            <p>By analyzing the intrinsic properties of genes already associated with an inherited disease regardless of its phenotype, differences can be found between disease genes and all human genes. The pattern of differences can be used to predict novel disease genes. Properties associated with disease genes include gene structure, such as longer size of the gene and its associated proteins, and longer regulatory regions such as the mRNA 3' un-translated region (UTR); phylogenetic information, including lower mutation rates, broader phylogenetic breadth and fewer paralogs (that is, fewer highly similar genes giving less opportunity for functional redundancy); and genomic properties such as a higher proportion of CpG islands in promoters and longer intergenic distances.</p>
            <p>The first method to apply this type of approach was DGP <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, followed by PROSPECTR <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, which included additional gene properties (for an extensive review of these approaches see <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>). Such analyses rely on the definition of genes as 'disease genes' and 'non-disease genes' and, although suited to analysis of monogenic (Mendelian) diseases, such approaches may preclude the selection of genes that do not produce an obvious phenotype but rather contribute to disease susceptibility or the severity of the effect of a simultaneous mutation in another gene. The efficacy of such approaches thus becomes limited in the study of complex phenotypes, in which the association between the gene and disease may not be one of direct or exclusive causation.</p>
         </sec>
         <sec>
            <st>
               <p>Similarity to genes previously associated with disease</p>
            </st>
            <p>Several methods of associating genes with diseases rely on the functional annotation of the gene and, under the hypothesis that similar diseases may have associated genes with similar functions, propose associations on the basis of genes already known to be associated with a disease. This approach is supported by multiple lines of evidence and is a logical way to initiate a search for candidate genes. For example, genes related to the detection or synthesis of neurotransmitters are likely to be good candidates for association with neural disorders, or immune-related genes with asthma and allergy phenotypes.</p>
            <p>This is a logical inference, but when there is a growth in the number of gene candidates it becomes difficult to get all the information on known diseases and related literature manually, and computational approaches are helpful. Computational analyses take advantage of both controlled vocabularies describing disease features (such as MeSH terms - an ontology developed at the National Library of Medicine covering different subject categories, including disease phenotype <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>) and similarity between gene functions measured by using their annotations with controlled vocabularies, such as the Gene Ontology <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Methods such as G2D <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, POCUS <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, ENDEAVOUR <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and TOM <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> use this approach.</p>
            <p>A limitation of methods relying on the functional annotations of genes is that just a small percentage of genes in the databases have an experimentally verified function (6% have links to non-genomic literature <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>). Most annotation (for approximately 71% of genes <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>) is based on functions assumed to associate with predicted protein domains from manually curated databases (such as the Gene Ontology Annotation (GOA) project <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>).</p>
         </sec>
         <sec>
            <st>
               <p>Implication of genes in regulatory networks or in protein-protein interaction networks</p>
            </st>
            <p>Information from interactions between genes can be used to find disease-related genes. These data are available from multiple public resources and may describe protein-protein interactions (such as STRING <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and UniHI <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>), proteins regulating gene expression (such as TRANSFAC <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>), and metabolic pathways (such as KEGG <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>). Some of these categories can overlap to some extent with functional annotations (for example, several genes encoding proteins from the same pathway or protein complex may be described by the same functional annotation comprising a common Gene Ontology term).</p>
            <p>The assumption made is that if two genes work together, the known association of one with a disease suggests that the other may also be associated with the same disease. For example, mutations in different subunits of the sarcoglycan complex can result in muscular dystrophy <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. For genes in a regulatory cascade, if the mutation of a gene produces a given phenotype, then mutations in genes further upstream, such as a transcription factor for the downstream gene or a protein kinase that phosphorylates it, could result in the same phenotype. Methods such as ENDEAVOUR <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and recent versions of G2D <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> exploit this hypothesis.</p>
         </sec>
         <sec>
            <st>
               <p>Gene expression information</p>
            </st>
            <p>The methods described earlier can be complemented using gene expression data. This can be done in relation to the particular disease under analysis (for example selecting genes that are expressed in an affected tissue, such as neural tissue in the case of a neurodegenerative disease); or gene co-expression can be used as another measure of gene similarity to find associations between genes. The second approach is based on the premise that genes acting together will be expressed together, as seen for subunits of protein complexes (such as is described in <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr><abbr bid="B34">34</abbr></abbrgrp>).</p>
         </sec>
         <sec>
            <st>
               <p>Disease phenotypes</p>
            </st>
            <p>Clinical knowledge is fundamental to defining disease phenotype, and some existing methods aim to make use of this knowledge directly. For example, GeneSeeker <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> is a web tool that uses phenotype search terms input directly by the researcher and filters positional candidate disease genes based on expression and phenotypic data. The Database of Chromosomal Imbalance and Phenotype in Humans using Ensembl Resources (DECIPHER) centralizes clinician-sourced phenotype data and their relationship to copy number variation <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and is ground-breaking in its aim to make accessible a global wealth of phenotype descriptions submitted directly by clinicians.</p>
            <p>Animal models allow in-depth studies of phenotypic variation associated with genes, which are impossible in human subjects (for example, genetic manipulations such as gene knockouts). The data thus generated can be used to associate human genes with phenotypes according to the properties of orthologous genes in such model organisms. In some cases, phenotype association may be in the form of quantitative trait loci (QTLs) involving a number of genes. These methods have to overcome the challenge of identifying the appropriate orthology relations between human and animal genes, which becomes harder with increasing evolutionary distance between the species under study and humans. This approach is used by methods such as GeneSeeker <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and ToppGene <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> with mouse phenotype data, and Fraser and Plotkin have used a similar approach with yeast data <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>.</p>
            <p>Some available methods define the disease phenotype in a formalized way that involves the use of existing or customized ontologies. As ontologies are formal representations of a set of concepts within a domain of knowledge and the relationships between those concepts, they are preferable to a definition of the problem phenotype by means of a mere set of keywords provided by the user. Ontologies can facilitate optimal use of available knowledge because many pieces of information can be linked through queries in the databases in which they are used. For example, most of the articles in MEDLINE are annotated with MeSH terms. In this way, they can directly link the phenotype described by a MeSH term to the information contained in the article annotated with it. Phenotype ontologies are used to mine textual databases, such as MEDLINE abstracts in PubMed and/or Online Mendelian Inheritance in Man (OMIM) records, and relate them to gene features and lists of candidate genes. eVOC, a controlled vocabulary for unifying gene expression data, is a purposely developed anatomical ontology that can integrate text mining of biomedical literature and data mining of available human gene expression data <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The GFINDer method uses an ontology developed from OMIM entries <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. G2D uses disease MeSH terms linked in the OMIM record associated with the phenotype of interest to link phenotypes and Gene Ontology terms <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. PhenoGO, another ontology that assigns phenotypic context to Gene Ontology annotations, also mines the literature to associate phenotypes to Gene Ontology terms <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. A Human Phenotype Ontology has been developed and used to annotate OMIM entries <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and the broader Mammalian Phenotype Ontology <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> is used in both the Mouse and Rat Genome Databases <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <p>Despite the obvious limitations of transferring information between animal models and humans, the broad range of phenotypic measures that can be obtained from animals is impossible to collect from humans. In the context of complex phenotypes, mice are being predominantly used to study the (usually small) quantitative phenotypic differences associated with a genetic variation (QTLs) <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Large-scale projects are underway to induce knockouts in mice and analyze the corresponding genotypes by high-throughput techniques <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. These projects will need extensive and more sophisticated annotation systems such as Phenotype and Trait Ontology (PATO <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>), which combines existing phenotype ontologies with phenotype qualities; for example 'insect eye', from the fly anatomy ontology, can bear the quality 'red', giving the combined 'red eye' phenotype.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Application of computational approaches to specific diseases</p>
         </st>
         <p>Only a few studies so far have used phenotype-based computational approaches to identify disease genes and followed this through in patient samples. The GeneSeeker tool was used to prioritize candidates for skeletal dysplasia, and the contribution of a selected candidate to the disease phenotype was demonstrated <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. In this study, linkage analysis identified 77 candidate genes in a 17.1 cM interval. GeneSeeker identified the disease-causing gene as <it>RMRP </it>(an untranslated RNA gene), and its etiological role was confirmed in patients with the disease. Mutations in this gene have been identified previously as disease-causing in milder types of autosomal recessive skeletal dysplasias with differing phenotypes; identification of this additional disease phenotype associated with the gene, however, has furthered understanding of gene function and suggested its involvement in other related disease phenotypes.</p>
         <p>The G2D method was used to prioritize candidate genes for asthma and atopy (a type of allergic hypersensitivity) at two previously linked loci in a French Canadian population <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. Ten genes were selected by G2D for a subsequent association study, and SNPs within the candidate genes were genotyped and analyzed using a family-based association test. The results suggested a protective association with allergic asthma for the protein tyrosine phosphatase gene <it>PTPRE </it>in this French Canadian population, although the association could not be replicated in a different cohort <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
         <p>Other than these translational studies, computational analyses have generally been applied to specific diseases and where there are known etiological genes for the disease in question, the accuracy of the results has been assessed by the ranking of these known genes. The candidates that have been flagged as most likely and warranting further empirical research are published for use by the research community, as in our previous studies on candidate genes for metabolic syndrome <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> and type 2 diabetes (T2D) <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. In these, we used multiple computational techniques, including those based on phenotype <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B39">39</abbr></abbrgrp>, for disease gene prioritization, using sets of genes defined by multiple linkage analysis data available through the biomedical literature as starting point.</p>
         <p>In the case of metabolic syndrome <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, we initially selected candidates for discrete phenotypes that are associated with the disorder from a starting set of 13,882 genes, and identified candidate genes showing commonality across multiple phenotypes. The phenotype-specific candidates were then weighted according to the prevalence of each phenotype in patient populations, with 19 candidates prioritized as the most likely etiological genes. For T2D, we used multiple computational approaches to identify obesity- and T2D-specific candidates from a starting set of 9,556 positional candidates. This allowed us to generate a final list of nine primary T2D candidates, two of which were also primary candidates for obesity. A SNP in the lipoprotein lipase gene <it>LPL</it>, which was one of the proposed two top candidates, has since been associated with T2D in Korean patients <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
         <p>Similar approaches have been used for other diseases, such as the use of multiple existing computational methods for prediction of genes associated with osteoporosis <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, and the use of extensive phenotype data to select candidate etiological genes for fetal alcohol syndrome<abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. ENDEAVOUR has also been used to prioritize candidate genes for a variety of phenotypes, reviewed in <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Future directions for computational prioritization of candidate disease genes</p>
         </st>
         <p>With increased understanding and availability of human genome and transcriptome data, additional resources can refine the computational prioritization of candidate disease genes. These include data on copy number variation, which has already been used to identify candidate genes for autism <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and on RNA editing in candidate genes <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Phenotype can be affected by perturbations in additional elements such as long non-coding genes <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>; long range non-coding RNAs, as identified for short stature phenotypes <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> and cancer <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>; natural antisense transcripts <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>; promoter elements, such as those associated with degenerative heart disease <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>; and microRNAs <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Epidemiological data for disease occurrence used in conjunction with genome-wide data on population variation <abbrgrp><abbr bid="B65">65</abbr></abbrgrp> can facilitate associations between disease phenotypes prevalent in particular populations and their underlying genotypes. Finally, collation and standardization of phenotype data (as undertaken in the DECIPHER project <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>) and the further development of phenotype ontologies that have an appropriate degree of granularity and are accessible to scientists are essential for the compilation of clinical phenotype data in a format that allows the computational analysis of associations between disease phenotype and genotype.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Understanding underlying disease genetics is crucial for the development of appropriate disease-specific diagnostic, prognostic and therapeutic approaches, and increasing the efficiency of this process can result in substantial progress in the clinical management of disease. Computational approaches for the identification of disease genes have contributed significantly to our understanding of gene and protein characteristics. These include the tendency of enzymes and transporters to underlie recessive diseases, while transcription regulators and structural molecules often underlie dominant inheritance <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. More generally, they have shown evidence that disease gene function and expression patterns correlate with the type of disease they cause <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. The computational analysis of disease phenotypes has revealed the tendency for similar disease phenotypes to be caused by functionally related genes <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. Such analyses have also shown that the phenotypic similarity between syndromes correlates with the sequence similarity of their associated genes <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B68">68</abbr></abbrgrp>. We believe, however, that there is great scope to better harness clinical phenotypic data to improve computational disease gene prioritization. In an ideal scenario, extensive clinical phenotypic data would be available to computational scientists to use in conjunction with genome-wide empirical data, allowing for effective prediction of most likely disease gene candidates and leading to rapid and economical empirical identification of etiological genes.</p>
         <p>For this to become a reality, several objectives need first to be realized. Most importantly, clinical phenotype data need to be routinely standardized in an accessible, patient-anonymous format for computational use. To this end, prospective studies on patient populations could include a computational component at the design stage, so that standardized clinical/phenotypic data can be collected throughout the study. To ensure that this becomes standard practice, however, clinicians need to be convinced of the utility of computational approaches in determining candidate disease genes, and computational studies for specific diseases need to reach the appropriate clinical audience. Collaborative studies between clinical researchers and computational scientists are invaluable in bridging this gap and promoting recognition of computational applications in clinical research, but these are not yet standard practice.</p>
         <p>In addition, many computational scientists focus on the development of novel generic approaches for candidate gene prediction for diseases in general. This should not, however, preclude the application of existing methods to specific diseases. Such disease-specific studies, presented in a format accessible to non-computational researchers, would facilitate translation of computational research into the clinical environment and promote recognition of the role of computational studies in disease gene identification.</p>
         <p>The ability to investigate the genetics underlying disease phenotypes at a genome-wide level will result in more rapid disease gene identification, as genome-wide analyses can return multiple potential candidate genes simultaneously, rather than verifying or refuting the implication of individual genes in a sequential way. This transition is serendipitous, given that the field of disease genetics is moving on from Mendelian diseases and focusing on complex diseases in which multiple etiological genes are believed to act in concert <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. As increasingly sophisticated techniques uncover the stronger and more frequent gene-disease associations, research techniques will shift towards defining our understanding of more subtle or indirect effects of genes on disease phenotype, in parallel with our increased understanding of the subtleties and complexities of the biological mechanisms of the human cell.</p>
         <p>The challenge now lies in finding relevant candidates within the lists of potential disease genes generated from genome-wide approaches. Computational methods are well suited to the systematic analysis of these large gene lists to generate encompassing hypotheses about disease genotype, predict the most likely disease gene candidates from large datasets, and rapidly disseminate the results to clinical researchers performing translational research. Computational disease gene prediction can thus contribute substantially to faster and more cost-efficient empirical identification of disease-causing genes.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>MeSH: Medical Subject Heading; OMIM: Online Mendelian Inheritance in Man; QTL: quantitative trait locus; SNP: single nucleotide polymorphism; T2D: type 2 diabetes.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>NT drafted the initial manuscript and coordinated further work on it. All authors wrote the manuscript and have read and approved the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>MAA-N acknowledges funding support from the Helmholtz Alliance on Systems Biology. NT is supported by the South African Medical Research Council.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Meta-analysis in genome-wide association studies.</p>
            </title>
            <aug>
               <au>
                  <snm>Zeggini</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ioannidis</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Pharmacogenomics</source>
            <pubdate>2009</pubdate>
            <volume>10</volume>
            <fpage>191</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2217/14622416.10.2.191</pubid>
                  <pubid idtype="pmcid">2695132</pubid>
                  <pubid idtype="pmpid" link="fulltext">19207020</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>What can genome-wide association studies tell us about the genetics of common disease?</p>
            </title>
            <aug>
               <au>
                  <snm>Iles</snm>
                  <fnm>MM</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e33</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pgen.0040033</pubid>
                  <pubid idtype="pmcid">2323402,2323402</pubid>
                  <pubid idtype="pmpid" link="fulltext">18454206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Guilt beyond a reasonable doubt.</p>
            </title>
            <aug>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Daly</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>813</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0707-813</pubid>
                  <pubid idtype="pmpid" link="fulltext">17597768</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A new multipoint method for genome-wide association studies by imputation of genotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>Marchini</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Howie</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McVean</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Donnelly</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2007</pubdate>
            <volume>39</volume>
            <fpage>906</fpage>
            <lpage>913</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng2088</pubid>
                  <pubid idtype="pmpid" link="fulltext">17572673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Integrating global gene expression analysis and genetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Farber</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Lusis</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Adv Genet</source>
            <pubdate>2008</pubdate>
            <volume>60</volume>
            <fpage>571</fpage>
            <lpage>601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">full_text</pubid>
                  <pubid idtype="pmpid" link="fulltext">18358333</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Deep analysis of cellular transcriptomes - LongSAGE versus classic MPSS.</p>
            </title>
            <aug>
               <au>
                  <snm>Hene</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sreenu</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Vuong</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Abidi</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Rowland-Jones</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>EJ</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>333</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-8-333</pubid>
                  <pubid idtype="pmcid">2104538</pubid>
                  <pubid idtype="pmpid" link="fulltext">17892551</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Deep cap analysis gene expression (CAGE): genome-wide identification of promoters, quantification of their expression, and network inference.</p>
            </title>
            <aug>
               <au>
                  <snm>de Hoon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Biotechniques</source>
            <pubdate>2008</pubdate>
            <volume>44</volume>
            <fpage>627</fpage>
            <lpage>628</lpage>
            <note>630, 632.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2144/000112802</pubid>
                  <pubid idtype="pmpid" link="fulltext">18474037</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The impact of next-generation sequencing technology on genetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Mardis</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>133</fpage>
            <lpage>141</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18262675</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Association of genes to genetically inherited diseases using data mining.</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andrade</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>316</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12006977</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Searching for genetic determinants in the new millennium.</p>
            </title>
            <aug>
               <au>
                  <snm>Risch</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>847</fpage>
            <lpage>856</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35015718</pubid>
                  <pubid idtype="pmpid" link="fulltext">10866211</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Ensembl 2008.</p>
            </title>
            <aug>
               <au>
                  <snm>Flicek</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ballester</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Eyre</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fernandez-Banet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gr&#228;f</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jenkinson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>K&#228;h&#228;ri</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Megy</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D707</fpage>
            <lpage>D714</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm988</pubid>
                  <pubid idtype="pmcid">2238821</pubid>
                  <pubid idtype="pmpid" link="fulltext">18000006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Database resources of the National Center for Biotechnology Information.</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chetvernin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Dicuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Feolo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Helmberg</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kapustin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Landsman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Shumway</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Souvorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D13</fpage>
            <lpage>D21</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm1000</pubid>
                  <pubid idtype="pmcid">2238880</pubid>
                  <pubid idtype="pmpid" link="fulltext">18045790</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The UCSC Genome Browser Database: update 2009.</p>
            </title>
            <aug>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Rhead</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pheasant</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Giardine</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fujita</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dreszer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>D755</fpage>
            <lpage>D761</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn875</pubid>
                  <pubid idtype="pmcid">2686463</pubid>
                  <pubid idtype="pmpid" link="fulltext">18996895</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>NCBI GEO: archive for high-throughput functional genomic data.</p>
            </title>
            <aug>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Troup</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Wilhite</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Ledoux</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rudnev</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Evangelista</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>IF</fnm>
               </au>
               <au>
                  <snm>Soboleva</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tomashevsky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Phillippy</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Sherman</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Muertter</snm>
                  <fnm>RN</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>D885</fpage>
            <lpage>D890</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn764</pubid>
                  <pubid idtype="pmcid">2686538</pubid>
                  <pubid idtype="pmpid" link="fulltext">18940857</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A second generation human haplotype map of over 3.1 million SNPs.</p>
            </title>
            <aug>
               <au>
                  <cnm>International HapMap Consortium</cnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Ballinger</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Hinds</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Stuve</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Belmont</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Boudreau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hardenbol</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Leal</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Pasternak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Willis</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>449</volume>
            <fpage>851</fpage>
            <lpage>861</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06258</pubid>
                  <pubid idtype="pmcid">2689609</pubid>
                  <pubid idtype="pmpid" link="fulltext">17943122</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>KEGG for linking genomes to life and the environment.</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okuda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tokimatsu</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamanishi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D480</fpage>
            <lpage>D484</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm882</pubid>
                  <pubid idtype="pmcid">2238879</pubid>
                  <pubid idtype="pmpid" link="fulltext">18077471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Reactome knowledgebase of human biological pathways and processes.</p>
            </title>
            <aug>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gopinath</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Caudy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Croft</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>de Bono</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Garapati</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hemish</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jassal</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mahajan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>D619</fpage>
            <lpage>D622</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn863</pubid>
                  <pubid idtype="pmcid">2686536</pubid>
                  <pubid idtype="pmpid" link="fulltext">18981052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The modular nature of genetic diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>Clin Genet</source>
            <pubdate>2007</pubdate>
            <volume>71</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1399-0004.2006.00708.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17204041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genome-wide identification of genes likely to be involved in human genetic disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>3108</fpage>
            <lpage>3114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh605</pubid>
                  <pubid idtype="pmcid">434425</pubid>
                  <pubid idtype="pmpid" link="fulltext">15181176</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Speeding disease gene discovery by sequence based candidate prioritization.</p>
            </title>
            <aug>
               <au>
                  <snm>Adie</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Evans</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Porteous</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pickard</snm>
                  <fnm>BS</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>55</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-6-55</pubid>
                  <pubid idtype="pmcid">1274252</pubid>
                  <pubid idtype="pmpid" link="fulltext">15766383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Predicting disease genes using protein-protein interactions.</p>
            </title>
            <aug>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>J Med Genet</source>
            <pubdate>2006</pubdate>
            <volume>43</volume>
            <fpage>691</fpage>
            <lpage>698</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1136/jmg.2006.041376</pubid>
                  <pubid idtype="pmcid">2564594</pubid>
                  <pubid idtype="pmpid" link="fulltext">16611749</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>United States National Library of Medicine, National Institutes of Health, Medical Subject Headings</p>
            </title>
            <url>http://www.nlm.nih.gov/mesh/meshhome.html</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>POCUS: mining genomic sequence annotation to predict disease genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Clutterbuck</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R75</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-11-r75</pubid>
                  <pubid idtype="pmcid">329128</pubid>
                  <pubid idtype="pmpid" link="fulltext">14611661</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>ENDEAVOUR update: a web resource for gene prioritization in multiple species.</p>
            </title>
            <aug>
               <au>
                  <snm>Tranchevent</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Barriot</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van Vooren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Van Loo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Coessens</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>W377</fpage>
            <lpage>W384</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn325</pubid>
                  <pubid idtype="pmcid">2447805</pubid>
                  <pubid idtype="pmpid" link="fulltext">18508807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>TOM: enhancement and extension of a tool suite for in silico approaches to multigenic hereditary disorders.</p>
            </title>
            <aug>
               <au>
                  <snm>Masotti</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nardini</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bonora</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Romeo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Volinia</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Benini</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>428</fpage>
            <lpage>429</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm588</pubid>
                  <pubid idtype="pmpid" link="fulltext">18048394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Towards completion of the Earth's proteome.</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Palidwor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>1135</fpage>
            <lpage>1141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.embor.7401117</pubid>
                  <pubid idtype="pmcid">2267224</pubid>
                  <pubid idtype="pmpid" link="fulltext">18059312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The Gene Ontology Annotation (GOA) Database: sharing knowledge in Uniprot with Gene Ontology.</p>
            </title>
            <aug>
               <au>
                  <snm>Camon</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Magrane</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dimmer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Maslen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Binns</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D262</fpage>
            <lpage>D266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh021</pubid>
                  <pubid idtype="pmcid">308756</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681408</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>STRING 8 - a global view on proteins and their functional interactions in 630 organisms.</p>
            </title>
            <aug>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chaffron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Creevey</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Julien</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Simonovic</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>D412</fpage>
            <lpage>D416</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn760</pubid>
                  <pubid idtype="pmcid">2686466</pubid>
                  <pubid idtype="pmpid" link="fulltext">18940858</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>UniHI 4: new tools for query, analysis and visualization of the human protein-protein interactome.</p>
            </title>
            <aug>
               <au>
                  <snm>Chaurasia</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Malhotra</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Russ</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schnoegl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hanig</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wanker</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Futschik</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>D657</fpage>
            <lpage>D660</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkn841</pubid>
                  <pubid idtype="pmcid">2686569</pubid>
                  <pubid idtype="pmpid" link="fulltext">18984619</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The TRANSFAC project as an example of framework technology that supports the analysis of genomic regulation.</p>
            </title>
            <aug>
               <au>
                  <snm>Wingender</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>326</fpage>
            <lpage>332</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/bbn016</pubid>
                  <pubid idtype="pmpid" link="fulltext">18436575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Autosomal recessive muscular dystrophy and mutations of the sarcoglycan complex.</p>
            </title>
            <aug>
               <au>
                  <snm>Duggan</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Hoffman</snm>
                  <fnm>EP</fnm>
               </au>
            </aug>
            <source>Neuromuscul Disord</source>
            <pubdate>1996</pubdate>
            <volume>6</volume>
            <fpage>475</fpage>
            <lpage>482</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-8966(96)00388-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9027858</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Update of the G2D tool for prioritization of gene candidates to inherited diseases.</p>
            </title>
            <aug>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>W212</fpage>
            <lpage>W216</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm223</pubid>
                  <pubid idtype="pmcid">1933178</pubid>
                  <pubid idtype="pmpid" link="fulltext">17478516</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A human phenome-interactome network of protein complexes implicated in genetic disorders.</p>
            </title>
            <aug>
               <au>
                  <snm>Lage</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karlberg</snm>
                  <fnm>EO</fnm>
               </au>
               <au>
                  <snm>Storling</snm>
                  <fnm>ZM</fnm>
               </au>
               <au>
                  <snm>Olason</snm>
                  <fnm>PI</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Rigina</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hinsby</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Tumer</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Pociot</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tommerup</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>309</fpage>
            <lpage>316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1295</pubid>
                  <pubid idtype="pmpid" link="fulltext">17344885</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases.</p>
            </title>
            <aug>
               <au>
                  <snm>van Driel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Cuelenaere</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kemmeren</snm>
                  <fnm>PP</fnm>
               </au>
               <au>
                  <snm>Leunissen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Vriend</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W758</fpage>
            <lpage>W761</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki435</pubid>
                  <pubid idtype="pmcid">1160196</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980578</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources.</p>
            </title>
            <aug>
               <au>
                  <snm>Firth</snm>
                  <fnm>HV</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Bevan</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Corpas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rajan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Van Vooren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moreau</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Pettett</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>NP</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2009</pubdate>
            <volume>84</volume>
            <fpage>524</fpage>
            <lpage>533</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ajhg.2009.03.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">19344873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Improved human disease candidate gene prioritization using mouse phenotype.</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Aronow</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Jegga</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>392</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-8-392</pubid>
                  <pubid idtype="pmcid">2194797</pubid>
                  <pubid idtype="pmpid" link="fulltext">17939863</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Using protein complexes to predict phenotypic effects of gene mutation.</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Plotkin</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R252</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-11-r252</pubid>
                  <pubid idtype="pmcid">2258176</pubid>
                  <pubid idtype="pmpid" link="fulltext">18042286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Integration of text- and data-mining using ontologies successfully selects disease gene candidates.</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kelso</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Powell</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bajic</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>1544</fpage>
            <lpage>1552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki296</pubid>
                  <pubid idtype="pmcid">1065256</pubid>
                  <pubid idtype="pmpid" link="fulltext">15767279</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>GFINDer: genetic disease and phenotype location statistical analysis and mining of dynamically annotated gene lists.</p>
            </title>
            <aug>
               <au>
                  <snm>Masseroli</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Galati</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pinciroli</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W717</fpage>
            <lpage>W723</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gki454</pubid>
                  <pubid idtype="pmcid">1160215</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>PhenoGO: an integrated resource for the multiscale mining of clinical and biological data.</p>
            </title>
            <aug>
               <au>
                  <snm>Sam</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Mendonca</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lussier</snm>
                  <fnm>YA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2009</pubdate>
            <volume>10</volume>
            <issue>Suppl 2</issue>
            <fpage>S8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-10-S2-S8</pubid>
                  <pubid idtype="pmcid">2646241</pubid>
                  <pubid idtype="pmpid" link="fulltext">19208196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson</snm>
                  <fnm>PN</fnm>
               </au>
               <au>
                  <snm>Kohler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Seelow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mundlos</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2008</pubdate>
            <volume>83</volume>
            <fpage>610</fpage>
            <lpage>615</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ajhg.2008.09.017</pubid>
                  <pubid idtype="pmcid">2668030</pubid>
                  <pubid idtype="pmpid" link="fulltext">18950739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information.</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Goldsmith</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R7</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2004-6-1-r7</pubid>
                  <pubid idtype="pmcid">549068</pubid>
                  <pubid idtype="pmpid" link="fulltext">15642099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The Mouse Genome Database (MGD): mouse biology and model systems.</p>
            </title>
            <aug>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Kadin</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D724</fpage>
            <lpage>D728</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkm961</pubid>
                  <pubid idtype="pmcid">2238849</pubid>
                  <pubid idtype="pmpid" link="fulltext">18158299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The Rat Genome Database, update 2007 - easing the path from disease to data and back again.</p>
            </title>
            <aug>
               <au>
                  <snm>Twigger</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Shimoyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bromberg</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kwitek</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Jacob</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D658</fpage>
            <lpage>D662</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl988</pubid>
                  <pubid idtype="pmcid">1761441</pubid>
                  <pubid idtype="pmpid" link="fulltext">17151068</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Genome-wide genetic association of complex traits in heterogeneous stock mice.</p>
            </title>
            <aug>
               <au>
                  <snm>Valdar</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Solberg</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Gauguier</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Burnett</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Klenerman</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cookson</snm>
                  <fnm>WO</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Rawlins</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Mott</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Flint</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>879</fpage>
            <lpage>887</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1840</pubid>
                  <pubid idtype="pmpid" link="fulltext">16832355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The European dimension for the mouse genome mutagenesis program.</p>
            </title>
            <aug>
               <au>
                  <snm>Auwerx</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Avner</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Baldock</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ballabio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Balling</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barbacid</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Berns</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bradley</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carmeliet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chambon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Duboule</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Forejt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Granucci</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hastie</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>de Angelis</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kioussis</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kollias</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lathrop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lendahl</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Malumbres</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>von Melchner</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Partanen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ricciardi-Castagnoli</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rigby</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <fpage>925</fpage>
            <lpage>927</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng0904-925</pubid>
                  <pubid idtype="pmcid">2716028</pubid>
                  <pubid idtype="pmpid" link="fulltext">15340424</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>A mouse for all reasons.</p>
            </title>
            <aug>
               <au>
                  <snm>Collins</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Rossant</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wurst</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>128</volume>
            <fpage>9</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.12.018</pubid>
                  <pubid idtype="pmpid" link="fulltext">17218247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Phenotypic Quality Ontology, PATO</p>
            </title>
            <url>http://obofoundry.org/wiki/index.php/PATO:Main_Page</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Severely incapacitating mutations in patients with extreme short stature identify RNA-processing endoribonuclease RMRP as an essential cell growth regulator.</p>
            </title>
            <aug>
               <au>
                  <snm>Thiel</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zabel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ekici</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Salinas</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gebhart</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>R&#252;schendorf</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Sticht</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Spranger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zweier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Reis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rauch</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2005</pubdate>
            <volume>77</volume>
            <fpage>795</fpage>
            <lpage>806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/497708</pubid>
                  <pubid idtype="pmcid">1271388</pubid>
                  <pubid idtype="pmpid" link="fulltext">16252239</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Genes to diseases (G2D) computational method to identify asthma candidate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tremblay</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lemire</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Potvin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tremblay</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hunninghake</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Raby</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Hudson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Laprise</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>PLoS One</source>
            <pubdate>2008</pubdate>
            <volume>3</volume>
            <fpage>e2907</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0002907</pubid>
                  <pubid idtype="pmcid">2488373</pubid>
                  <pubid idtype="pmpid" link="fulltext">18682798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Prioritization of candidate disease genes for metabolic syndrome by computational analysis of its defining phenotypes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okpechi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ramesar</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Physiol Genomics</source>
            <pubdate>2008</pubdate>
            <volume>35</volume>
            <fpage>55</fpage>
            <lpage>64</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1152/physiolgenomics.90247.2008</pubid>
                  <pubid idtype="pmpid" link="fulltext">18612082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Adie</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>van Driel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Oti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Perez-Iratxeta</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Andrade-Navarro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Adeyemo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Patti</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Semple</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>3067</fpage>
            <lpage>3081</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkl381</pubid>
                  <pubid idtype="pmcid">1475747</pubid>
                  <pubid idtype="pmpid" link="fulltext">16757574</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Association of lipoprotein lipase (LPL) single nucleotide polymorphisms with type 2 diabetes mellitus.</p>
            </title>
            <aug>
               <au>
                  <snm>Cho</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Go</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Cha</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>HT</fnm>
               </au>
               <au>
                  <snm>Min</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Cho</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kimm</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Exp Mol Med</source>
            <pubdate>2008</pubdate>
            <volume>40</volume>
            <fpage>523</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.3858/emm.2008.40.5.523</pubid>
                  <pubid idtype="pmcid">2679352</pubid>
                  <pubid idtype="pmpid" link="fulltext">18985010</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Prediction of osteoporosis candidate genes by computational disease-gene identification strategy.</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>QY</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>YQ</fnm>
               </au>
               <au>
                  <snm>Kung</snm>
                  <fnm>AW</fnm>
               </au>
            </aug>
            <source>J Hum Genet</source>
            <pubdate>2008</pubdate>
            <volume>53</volume>
            <fpage>644</fpage>
            <lpage>655</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s10038-008-0295-x</pubid>
                  <pubid idtype="pmpid" link="fulltext">18463784</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Computational selection and prioritization of candidate genes for fetal alcohol syndrome.</p>
            </title>
            <aug>
               <au>
                  <snm>Lombard</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Tiffin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hofmann</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bajic</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Hide</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ramsay</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>389</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-8-389</pubid>
                  <pubid idtype="pmcid">2194724</pubid>
                  <pubid idtype="pmpid" link="fulltext">17961254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Gene-network analysis identifies susceptibility genes related to glycobiology in autism.</p>
            </title>
            <aug>
               <au>
                  <snm>Zwaag</snm>
                  <mnm>van der</mnm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Franke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Poot</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hochstenbach</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Spierenburg</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Vorstman</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>van Daalen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>de Jonge</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Verbeek</snm>
                  <fnm>NE</fnm>
               </au>
               <au>
                  <snm>Brilstra</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>van 't Slot</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ophoff</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>van Es</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Blauw</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Veldink</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Buizer-Voskamp</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Beemer</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <mnm>van den</mnm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Wijmenga</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>van Amstel</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>van Engeland</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Burbach</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Staal</snm>
                  <fnm>WG</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2009</pubdate>
            <volume>4</volume>
            <fpage>e5324</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pone.0005324</pubid>
                  <pubid idtype="pmcid">2683930</pubid>
                  <pubid idtype="pmpid" link="fulltext">19492091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing.</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Levanon</snm>
                  <fnm>EY</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Aach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Leproust</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2009</pubdate>
            <volume>324</volume>
            <fpage>1210</fpage>
            <lpage>1213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1170995</pubid>
                  <pubid idtype="pmpid" link="fulltext">19478186</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Long non-coding RNAs: insights into functions.</p>
            </title>
            <aug>
               <au>
                  <snm>Mercer</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Dinger</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2009</pubdate>
            <volume>10</volume>
            <fpage>155</fpage>
            <lpage>159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg2521</pubid>
                  <pubid idtype="pmpid" link="fulltext">19188922</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Long-range conserved non-coding SHOX sequences regulate expression in developing chicken limb and are associated with short stature phenotypes in human patients.</p>
            </title>
            <aug>
               <au>
                  <snm>Sabherwal</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bangs</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Weiss</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jantz</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tiecke</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hinkel</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Spaich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hauffa</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Kamp</snm>
                  <mnm>van der</mnm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kapeller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tickle</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rappold</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <fpage>210</fpage>
            <lpage>222</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/ddl470</pubid>
                  <pubid idtype="pmpid" link="fulltext">17200153</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Action at a distance: epigenetic silencing of large chromosomal regions in carcinogenesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2007</pubdate>
            <volume>16</volume>
            <issue>Spec 1</issue>
            <fpage>R88</fpage>
            <lpage>R95</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/ddm051</pubid>
                  <pubid idtype="pmpid" link="fulltext">17613553</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>In search of antisense.</p>
            </title>
            <aug>
               <au>
                  <snm>Lavorgna</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dahary</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lehner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sorek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Casari</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2004</pubdate>
            <volume>29</volume>
            <fpage>88</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tibs.2003.12.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15102435</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Identification of gene co-regulatory modules and associated cis-elements involved in degenerative heart disease.</p>
            </title>
            <aug>
               <au>
                  <snm>Danko</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Pertsov</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>BMC Med Genomics</source>
            <pubdate>2009</pubdate>
            <volume>2</volume>
            <fpage>31</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1755-8794-2-31</pubid>
                  <pubid idtype="pmcid">2700136</pubid>
                  <pubid idtype="pmpid" link="fulltext">19476647</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>MicroRNA and mRNA integrated analysis (MMIA): a web tool for examining biological functions of microRNA expression.</p>
            </title>
            <aug>
               <au>
                  <snm>Nam</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Balch</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nephew</snm>
                  <fnm>KP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>W356</fpage>
            <lpage>W362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkp294</pubid>
                  <pubid idtype="pmcid">2703907</pubid>
                  <pubid idtype="pmpid" link="fulltext">19420067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Genetic variants associated with complex human diseases show wide variation across multiple populations.</p>
            </title>
            <aug>
               <au>
                  <snm>Adeyemo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rotimi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Public Health Genomics</source>
            <pubdate>2009</pubdate>
            <note>doi:10.1159/000218711.</note>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">19439916</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Highly consistent patterns for inherited human diseases at the molecular level.</p>
            </title>
            <aug>
               <au>
                  <snm>Lopez-Bigas</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Blencowe</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>269</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti781</pubid>
                  <pubid idtype="pmpid" link="fulltext">16287936</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>A similarity-based method for genome-wide prediction of disease-relevant human genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Freudenberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Propping</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <issue>Suppl 2</issue>
            <fpage>S110</fpage>
            <lpage>S115</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12385992</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>A text-mining analysis of the human phenome.</p>
            </title>
            <aug>
               <au>
                  <snm>van Driel</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Bruggeman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vriend</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunner</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Leunissen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Eur J Hum Genet</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <fpage>535</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.ejhg.5201585</pubid>
                  <pubid idtype="pmpid" link="fulltext">16493445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Improving the prediction of complex diseases by testing for multiple disease-susceptibility genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Khoury</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Botto</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Flanders</snm>
                  <fnm>WD</fnm>
               </au>
            </aug>
            <source>Am J Hum Genet</source>
            <pubdate>2003</pubdate>
            <volume>72</volume>
            <fpage>636</fpage>
            <lpage>649</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/367923</pubid>
                  <pubid idtype="pmcid">1180239</pubid>
                  <pubid idtype="pmpid" link="fulltext">12592605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>