<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gm39</ui>
   <ji>1756-994X</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>A kernel-based integration of genome-wide data for clinical decision support</p>
         </title>
         <aug>
            <au ca="yes" id="A1"><snm>Daemen</snm><fnm>Anneleen</fnm><insr iid="I1"/><email>anneleen.daemen@esat.kuleuven.be</email></au>
            <au id="A2"><snm>Gevaert</snm><fnm>Olivier</fnm><insr iid="I1"/><email>olivier.gevaert@esat.kuleuven.be</email></au>
            <au id="A3"><snm>Ojeda</snm><fnm>Fabian</fnm><insr iid="I1"/><email>fabian.ojeda@esat.kuleuven.be</email></au>
            <au id="A4"><snm>Debucquoy</snm><fnm>Annelies</fnm><insr iid="I2"/><email>annelies.debucquoy@med.kuleuven.be</email></au>
            <au id="A5"><snm>Suykens</snm><mi>AK</mi><fnm>Johan</fnm><insr iid="I1"/><email>johan.suykens@esat.kuleuven.be</email></au>
            <au id="A6"><snm>Sempoux</snm><fnm>Christine</fnm><insr iid="I3"/><email>christine.sempoux@clin.ucl.ac.be</email></au>
            <au id="A7"><snm>Machiels</snm><fnm>Jean-Pascal</fnm><insr iid="I4"/><email>jean-pascal.machiels@uclouvain.be</email></au>
            <au id="A8"><snm>Haustermans</snm><fnm>Karin</fnm><insr iid="I2"/><email>karin.haustermans@uz.kuleuven.ac.be</email></au>
            <au id="A9"><snm>De Moor</snm><fnm>Bart</fnm><insr iid="I1"/><email>bart.demoor@esat.kuleuven.be</email></au>
         </aug>
         <insg>
            <ins id="I1"><p>Department of Electrical Engineering (ESAT-SCD), Katholieke Universiteit Leuven, Kasteelpark Arenberg, 3001 Leuven, Belgium</p></ins>
            <ins id="I2"><p>Department of Experimental Radiotherapy, Katholieke Universiteit Leuven, UZ Herestraat, 3000 Leuven, Belgium</p></ins>
            <ins id="I3"><p>Department of Pathology, Universit&#233; Catholique de Louvain, St Luc University Hospital, Avenue Hippocrate, 1200 Brussels, Belgium</p></ins>
            <ins id="I4"><p>Department of Medical Oncology, Universit&#233; Catholique de Louvain, St Luc University Hospital, Avenue Hippocrate, 1200 Brussels, Belgium</p></ins>
         </insg>
         <source>Genome Medicine</source>
         <issn>1756-994X</issn>
         <pubdate>2009</pubdate>
         <volume>1</volume>
         <issue>4</issue>
         <fpage>39</fpage>
         <url>http://www.genomemedicine.com/content/1/4/39</url>
         <xrefbib><pubidlist><pubid idtype="pmpid">19356222</pubid><pubid idtype="doi">10.1186/gm39</pubid></pubidlist></xrefbib>
      </bibl>
      <history><rec><date><day>4</day><month>11</month><year>2008</year></date></rec><revrec><date><day>20</day><month>3</month><year>2009</year></date></revrec><acc><date><day>3</day><month>4</month><year>2009</year></date></acc><pub><date><day>3</day><month>4</month><year>2009</year></date></pub></history>
      <cpyrt><year>2009</year><collab>Daemen et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Although microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation. This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. The current increase in the amount of available omics data emphasizes the need for a methodological integration framework.</p>
            </sec>
            <sec>
               <st>
                  <p>Methods</p>
               </st>
               <p>We propose a kernel-based approach for clinical decision support in which many genome-wide data sources are combined. Integration occurs within the patient domain at the level of kernel matrices before building the classifier. As supervised classification algorithm, a weighted least squares support vector machine is used. We apply this framework to two cancer cases, namely, a rectal cancer data set containing microarray and proteomics data and a prostate cancer data set containing microarray and genomics data. For both cases, multiple outcomes are predicted.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>For the rectal cancer outcomes, the highest leave-one-out (LOO) areas under the receiver operating characteristic curves (AUC) were obtained when combining microarray and proteomics data gathered during therapy and ranged from 0.927 to 0.987. For prostate cancer, all four outcomes had a better LOO AUC when combining microarray and genomics data, ranging from 0.786 for recurrence to 0.987 for metastasis.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>For both cancer sites the prediction of all outcomes improved when more than one genome-wide data set was considered. This suggests that integrating multiple genome-wide data sources increases the predictive performance of clinical decision support models. This emphasizes the need for comprehensive multi-modal data. We acknowledge that, in a first phase, this will substantially increase costs; however, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Kernel methods are a powerful class of methods for pattern analysis. In recent years, they have become a standard tool in data analysis, computational statistics, and machine learning applications <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Based on a strong theoretical framework, their rapid uptake in applications such as bioinformatics <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, chemoinformatics, and even computational linguistics is due to their reliability, accuracy, and computational efficiency. In addition, they have the capability to handle a very wide range of data types (for example, kernel methods have been used to analyze sequences, vectors, networks, phylogenetic trees, and so on). The ability of kernel methods to deal with complex structured data makes them ideally positioned for heterogeneous data integration. More specifically, in this study we used a weighted least squares support vector machine (LS-SVM), an extension of the support vector machine (SVM) for supervised classification <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Compared to the SVM, the LS-SVM is easier and faster for high dimensional data because the quadratic programming problem is converted into a linear problem. To account for the unbalancedness in many two-class problems, this linear problem is extended with weights that are different for the positive and negative classes.</p>
         <p>The growing amount of data combined with factors such as time, cost, and personalized treatment is complicating clinical decision making. Using advanced mathematical models such as the above mentioned LS-SVM can aid clinical decision support because information arising from clinical risk factors (for example, tumor size, number of positive lymph nodes) is not accurate enough to reliably predict patient prognoses. Patients with the same clinical and pathological characteristics but different clinical outcomes can potentially be discerned with microarray technology. This technology investigates the transcriptomic make-up of a tumor in one experiment. A decade ago, it was first used in cancer studies to classify tissues as cancerous or non-cancerous <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. Within the domain of cancer, microarray technology has earned a prominent place for its capacity to characterize underlying tumor behavior in detail. Although the first gene expression profile signature is being validated in clinical trials <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, microarray technology can not measure the complete transcription profile due to the limited number of probes per gene on a chip; nor does the transcriptome completely reflect the biology underlying a disease.</p>
         <p>Besides transcription, pathological conditions such as cancer also influence alternative splicing, chromosomal aberrations, and methylation <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. For example, chromosomal aberrations have been found in the general population as well as in all major tumor types <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. These regions of increased or decreased DNA copy number can be detected using, for example, array comparative genomic hybridization (CGH) technology. This technique measures copy number variations (CNVs) within the entire genome of a disease sample compared to a normal sample <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Many small aberrations have emerged as prognostic and predictive markers. Numerous aberrations, however, also affect large genomic regions, encompassing multiple genes or whole chromosome arms.</p>
         <p>Due to differential splicing or post-translational modifications such as phosphorylation or acetylation, the proteome is many orders of magnitude bigger than the transcriptome. This makes the proteome, which reflects the functional state of the cell, a potentially richer source of data for unraveling diseases <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. It can be measured using mass spectrometry <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, or protein or antibody microarrays <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Additionally, other available omics data, such as epigenomics - the study of epigenetic changes such as DNA methylation and histone modifications <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> - and single nucleotide polymorphism genotyping <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, should be considered as they promise to be useful in unraveling cancer mechanisms and the refinement of their molecular descriptions. Although the technologies are available, joint analysis of multiple hierarchical layers of biological regulation is at a preliminary stage.</p>
         <p>In this study we investigate whether the integration of information from multiple layers of biological regulation improves the prediction of cancer outcome.</p>
         <sec>
            <st>
               <p>Related work</p>
            </st>
            <p>Other research groups have already proposed the idea of data integration, but most groups have only investigated the integration of clinical and microarray data. Tibshirani and colleagues <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> proposed such a framework by reducing the microarray data to one variable, addable to models based on clinical characteristics such as age, grade, and size of the tumor. Nevins and colleagues <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> combined clinical risk factors with metagenes (that is, the weighted average expression of a group of genes) in a tree-based classification system. Wang <it>et al. </it>combined microarray data with knowledge on two clinicopathological variables by defining a gene signature only for the subset of patients for whom the clinicopathological variables were not sufficient to predict outcome <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <p>A further evolution can be seen in studies in which two omics data sources are simultaneously considered, in most cases microarray data combined with proteomics or array CGH data. Much literature on such studies involving data integration already exists. However, the current definition of the integration of high-throughput data sources as it is used in the literature differs from our point of view.</p>
            <p>In a first group of integration studies, heterogeneous data from different sources were analyzed sequentially; that is, one data source was analyzed while the second was used as confirmation of the found results or for further deepening the understanding of the results <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Such approaches are used for biological discovery and a better understanding of the development of a disease, but not for predictive purposes. For example, Fridlyand and colleagues <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> found three breast tumor subtypes with a distinct CNV pattern based on array CGH data. Microarray data were subsequently analyzed to identify the functional categories that characterized these subtypes. Tomioka <it>et al. </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp> analyzed microarray and array CGH data of patients with neuroblastoma in a similar way. Genomic signatures resulted from the array CGH data, while molecular signatures were found after the microarray analysis. The authors suggested that a combination of these independent prognostic indicators would be clinically useful.</p>
            <p>The term data integration has also been used as a synonym for data merging in which different data sets are concatenated at the database level by cross-referencing the sequence identifiers, which requires semantic compatibility among data sets <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Data merging is a complex task due to, for example, the use of different identifiers, the absence of a 'one gene-one protein' relationship, alternative splicing, and measurement of multiple signals for one gene. In most studies, the concordance between the merged data sets and their interpretation in the context of biological pathways and regulatory mechanisms are investigated. Analyses of the merged data set by clustering or correlating the protein and microarray data can help identify candidate targets when changes in expression occur at both the gene and protein levels. However, there has been only modest success from correlation studies of gene and protein expression. Bitton <it>et al</it>. <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> combined proteomics data with exon array data, which allowed a much more fine-grained analysis by assigning peptides to their originating exons instead of mapping transcripts and proteins based on their IDs.</p>
            <p>Our definition for the combination of heterogeneous biological data is different. We integrate multiple layers of experimental data into one mathematical model for the development of more homogeneous classifiers in clinical decision support. For this purpose, we present a kernel-based integration framework. Integration occurs within the patient domain at a level not so far described in the literature. Instead of merging data sets or analyzing them in turn, the variables from different omics data are treated equally. This leads to the selection of the most relevant features from all available data sources, which are combined in a machine learning-based model. We were inspired by the idea of Lanckriet and colleagues <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. They presented an integration framework in which each data set is transformed into a kernel matrix. Integration occurs on this kernel level without referring back to the data. They applied their framework to amino acid sequence information, expression data, protein-protein interaction data, and other types of genomic information to solve a single classification problem: the classification of transmembrane versus non-transmembrane proteins. In this study by Lanckriet and colleagues, all considered data sets were publicly available. This requires a computationally intensive framework for determining the relevance of each data set by solving an optimization problem. Within our set-up, however, all data sources are derived from the patients themselves. This makes the gathering of these data sets highly costly and limits the number of data sets, but guarantees more relevance for the problem at hand.</p>
            <p>We previously investigated whether the prediction of distant metastasis in breast cancer patients could be improved when considering microarray data besides clinical data <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. In this manuscript, we consider not only microarray data but also high-throughput data from multiple biological levels. Three different strategies for clinical decision support are proposed: the use of individual data sets (referred to as step A); an integration of each data type over time by manually calculating the change in expression (step B); and an approach in which data sets are integrated over multiple layers in the genome (and over time) by treating variables from the different data sets equally (step C).</p>
            <p>We apply our framework to two cases, summarized in Table <tblr tid="T1">1</tblr>. In the first case on rectal cancer, tumor regression grade, lymph node status, and circumferential margin involvement (CRM) are predicted for 36 patients based on microarray and proteomics data, gathered at two time points during therapy. The second case on prostate cancer involves microarray and copy number variation data from 55 patients. Tumor grade, stage, metastasis, and occurrence of recurrence were available for prediction <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>.</p>
            <tbl id="T1"><title><p>Table 1</p></title><caption><p>Overview of the two case studies on rectal and prostate cancer</p></caption><tblbdy cols="3">
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>Data set I: rectal cancer</p>
         </c>
         <c ca="left">
            <p>Data set II: prostate cancer</p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Number of samples</p>
         </c>
         <c ca="left">
            <p>36</p>
         </c>
         <c ca="left">
            <p>55</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Data sources</p>
         </c>
         <c ca="left">
            <p>Microarray</p>
         </c>
         <c ca="left">
            <p>Microarray</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>Proteomics</p>
         </c>
         <c ca="left">
            <p>Genomics</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Number of features (after preprocessing)</p>
         </c>
         <c ca="left">
            <p><it>T</it><sub>0</sub>: 6,913 genes; 90 proteins</p>
         </c>
         <c ca="left">
            <p>6,974 genes</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p><it>T</it><sub>1</sub>: 6,913 genes; 92 proteins</p>
         </c>
         <c ca="left">
            <p>7,305 CNVs</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Outcomes</p>
         </c>
         <c ca="left">
            <p>WHEELER</p>
         </c>
         <c ca="left">
            <p>GRADE</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>pN-STAGE</p>
         </c>
         <c ca="left">
            <p>STAGE</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>CRM</p>
         </c>
         <c ca="left">
            <p>METASTASIS</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>RECURRENCE</p>
         </c>
      </r>
   </tblbdy></tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Data set I: rectal cancer</p>
            </st>
            <sec>
               <st>
                  <p>Patients and treatment</p>
               </st>
               <p>Forty patients with rectal cancer (T3-T4 and/or N+) from seven Belgian centers were enrolled in a phase I/II study investigating the combination of cetuximab, capecitabine, and external beam radiotherapy in the preoperative treatment of patients with rectal cancer <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. These patients received preoperative radiotherapy (1.8 Gy, 5 days/week for 5 weeks) in combination with cetuximab (initial dose 400 mg/m<sup>2 </sup>intravenous given 1 week before the beginning of radiation followed by 250 mg/m<sup>2</sup>/week for 5 weeks) and capecitabine for the duration of radiotherapy (first dose level, 650 mg/m<sup>2</sup>orally twice-daily; second dose level, 825 mg/m<sup>2 </sup>twice-daily; including weekends). Details of the eligibility criteria, pretreatment evaluation, radiotherapy, chemotherapy and cetuximab administration, surgery, follow-up, and histopathological assessment of response to chemoradiation have been published <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Data preprocessing</p>
               </st>
               <p>Tissue and plasma samples were gathered at three time points: before treatment (<it>T</it><sub>0</sub>); after the first loading dose of cetuximab but before the start of radiotherapy with capecitabine (<it>T</it><sub>1</sub>); and at the moment of surgery (<it>T</it><sub>2</sub>). All experimental procedures were done following standard laboratory procedures, or following the manufacturers' instructions. Because of the exclusion of some patients due to a missing outcome value, death before surgery, or not having surgery, the data set ultimately contained 36 patients.</p>
               <p>The frozen tissue samples were hybridized to Affymetrix human U133 2.0 plus gene chip arrays. The resulting data were first preprocessed for each time point separately using robust multichip analysis <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Secondly, the number of features was reduced from 54,613 probe sets to 27,650 genes by taking the median of all probe sets that matched on the same gene. Probe sets that matched on multiple genes were excluded because of the danger of cross-hybridization. Taking into account the low signal-to-noise ratio of microarray data, we finally filtered out genes with low variation across all samples. Only retaining the genes with a variance in the top 25% reduced the number of features to 6,913 genes.</p>
               <p>Ninety-six proteins known to be involved in cancer were measured in the plasma samples using a Luminex 100 instrument. Proteins that had absolute values above the detection limit in less than 20% of the samples were excluded for each time point separately. This resulted in the exclusion of six proteins at <it>T</it><sub>0</sub>, four at <it>T</it><sub>1</sub>, and six at <it>T</it><sub>2</sub>. The proteomics expression values of transforming growth factor alpha, which had too many values below the detection limit, were replaced by the results of ELISA tests performed at the Department of Experimental Oncology in Leuven, Belgium. For the remaining proteins the missing values were replaced by half of the minimum detected for each protein over all samples, and values exceeding the upper limit were replaced by the upper limit value. Because most of the proteins had a positively skewed distribution, a log transformation (base 2) was performed.</p>
               <p>In this paper, only the data sets at <it>T</it><sub>0 </sub>and <it>T</it><sub>1 </sub>were used because our goal is to predict the four different outcomes before therapy or early in therapy.</p>
            </sec>
            <sec>
               <st>
                  <p>Response classification</p>
               </st>
               <p>A semiquantitative classification system has been described by Wheeler <it>et al. </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp> for determining histopathological tumor regression (that is, the therapy response). There are also two prognostic factors important in rectal cancer: pathologic lymph node involvement and CRM <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Because the completeness of tumor resection relies on the assessment of resection margins by the pathologist, knowledge of the CRM before therapy provides important prognostic information for local recurrence and for development of distant metastasis and survival <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
               <p>These three outcomes were registered for 36 patients at the moment of surgery. For all these outcomes, 'responders' are distinguished from 'non-responders'. The grading of regression established by Wheeler and colleagues <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> (from now on referred to as WHEELER) is a modified pathological staging system for irradiated rectal cancer. It includes a measurement of tumor response after preoperative therapy: grade 1, good responsiveness (tumor is sterilized or only microscopic foci of adenocarcinoma remain); grade 2, moderate responsiveness (marked fibrosis but still with a macroscopic tumor); grade 3, poor responsiveness (little or no fibrosis with abundant macroscopic tumor). Tumors are classified as 'responder' when assigned to grade 1 (26 patients) and 'non-responder' when assigned to grade 2 or 3 (10 patients). Response can also be evaluated with the pathologic lymph node stage at surgery (pN-STAGE). The 'responder' class contains 22 patients with no lymph nodes found at surgery while the 'non-responder' class contains 14 patients with at least 1 regional lymph node. CRM was measured according to the guidelines of Quirke <it>et al. </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. CRM was considered positive when the distance between the tumor and the mesorectal fascia was &#8804; 2 mm. Tumors with a negative CRM are classified as 'responder' (27 patients), while tumors with a positive CRM belong to the 'non-responder' class (9 patients). Thirteen patients belong to the 'responder' class for all three outcomes, while there is an overlap of two patients between the 'non-responder' classes.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Data set II: prostate cancer</p>
            </st>
            <sec>
               <st>
                  <p>Patients and treatment</p>
               </st>
               <p>We also applied our method to a publicly available data set of prostate cancer. Lapointe and colleagues <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> first profiled gene expression in 71 prostate tumor cases of which 62 were primary and 9 had lymph node metastases. All tumors were removed by radical prostatectomy (that is, the surgical removal of the prostate gland). A cDNA microarray was used, containing 39,711 human cDNAs representing 26,260 mapped genes. Additionally, DNA CNVs were profiled on cDNA microarrays for CGH for 64 prostate tumor cases, among which 55 were primary tumors and 9 had pelvic lymph node metastases. The arrays were obtained from the Stanford Functional Genomics Facility and included 39,632 human cDNAs corresponding to 22,279 genes <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Among the primary tumors, the available gene expression and genomics data were in common for 55.</p>
            </sec>
            <sec>
               <st>
                  <p>Data preprocessing</p>
               </st>
               <p>Median fluorescence ratios were calculated for genes represented by multiple arrayed cDNAs. Missing gene expression values were imputed unsupervised using the k-nearest neighbors method of Troyanskaya <it>et al. </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The parameter k was set to 15 such that a missing value for a spot S in a sample was estimated as the weighted average of the 15 spots that are most similar to spot S in the remaining samples. The same unsupervised prefiltering as applied on the rectal cancer data set was used for both the microarray and genomics data sets. Features with a variance in the top 50% were retained, reducing the data sets to 6,974 genes and 7,305 CNVs, respectively.</p>
            </sec>
            <sec>
               <st>
                  <p>Response classification</p>
               </st>
               <p>Two pathological variables, stage and grade, metastasis of the tumor, as well as the outcome after prostatectomy defined as recurrence were considered. For grade (from now on referred to as GRADE), the Gleason Grading system was used, which is based on the most common and second most common architectural patterns of the glands of the tumor <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Two groups could be distinguished based on the architecture of the most common pattern: 36 tumors were well differentiated (that is, low-grade), 19 were poorly differentiated (that is, high-grade). According to the extent of the primary tumor (STAGE), 25 samples were of stage T2 (that is, the cancer is confined within one lobe of the prostate gland), while 25 samples were of advanced stage T3 (that is, the tumor has extended through the fibrous tissue surrounding the prostate gland but no other organs are affected). The stage of the remaining five patients was not known. The cancer had metastasized to distant lymph nodes in 12 tumors, while the cancer had not spread beyond the regional lymph nodes in 38 of the tumors (METASTASIS). Tumor recurrence was defined as a rise in prostate-specific antigen of at least 0.07 ng/ml or as occurrence of clinical metastasis (RECURRENCE). Seven tumors recurred while 22 tumors did not. The recurrence status of the remaining 26 patients was not available.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Kernel methods and weighted least squares support vector machines</p>
            </st>
            <p>Kernel methods are a group of algorithms that can handle a very wide range of data types, such as vectors, sequences, networks, and so on. They map the data <it>x </it>from the original input space to a high dimensional feature space with the mapping function &#934;(<it>x</it>). This embedding into the feature space is performed by a mathematical object <it>K</it>(<it>x</it><sub><it>k</it></sub>, <it>x</it><sub><it>l</it></sub>), called a 'kernel function'. This function efficiently computes the inner product &#10216;&#934;(<it>x</it><sub><it>k</it></sub>), &#934;(<it>x</it><sub><it>l</it></sub>)&#10217; between all pairs of data items x<sub><it>k </it></sub>and x<sub><it>l </it></sub>in the feature space, resulting in the kernel matrix. The size of this matrix is determined only by the number of data items, whatever the nature or the complexity of these items. For example, a set of 100 patients each characterized by 6,913 gene expression values is still represented by a 100 &#215; 100 kernel matrix <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The representation of all data sets by this real-valued square matrix, independent of the nature or complexity of the data to be analyzed, makes kernel methods ideally positioned for heterogeneous data integration.</p>
            <p>Any symmetric, positive semidefinite function is a valid kernel function, resulting in many possible kernels - for example, linear, polynomial, and diffusion kernels. They all correspond to a different transformation of the data, meaning that they extract a specific type of information from the data set. In this paper, the normalized linear kernel function:</p>
            <p>
               <display-formula>
                  <graphic file="gm39-i1.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="gm39-i2.gif"/></inline-formula> is used instead of the linear kernel function <inline-formula><graphic file="gm39-i3.gif"/></inline-formula>. With the normalized version, the values in the kernel matrix will be bounded because the data points are projected onto the unit sphere while these elements can take very large values without normalization. Normalizing is thus required when combining multiple data sources to guarantee the same order of magnitude for the kernel matrices of the data sets.</p>
            <p>A kernel algorithm for supervised classification is the SVM developed by Vapnik <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and others. Contrary to most other classification methods and due to the way data are represented through kernels, SVMs can tackle high dimensional data (for example microarray data). Given a training set <inline-formula><graphic file="gm39-i4.gif"/></inline-formula> of N samples with feature vectors <it>x</it><sub><it>k </it></sub>&#8712; <it>R</it><sup><it>n </it></sup>and output labels <it>y</it><sub><it>k </it></sub>&#8712; {-1, +1}, the SVM forms a linear discriminant boundary <it>y</it>(<it>x</it>) = sign[<it>W</it><sup><it>T</it></sup>&#934;(<it>x</it>)+<it>b</it>] in the feature space with maximum distance between samples of the two considered classes, with <it>w </it>representing the weights for the data items in the feature space and <it>b </it>the bias term. This corresponds to a non-linear discriminant function in the original input space. A modified version of SVM, LS-SVM, was developed by Suykens <it>et al. </it><abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. On high dimensional data sets, this modified version is much faster for classification because a linear system instead of a quadratic programming problem needs to be solved.</p>
            <p>The constrained optimization problem for an LS-SVM has the following form:</p>
            <p>
               <display-formula>
                  <graphic file="gm39-i5.gif"/>
               </display-formula>
            </p>
            <p>subject to:</p>
            <p>
               <display-formula>
                  <graphic file="gm39-i6.gif"/>
               </display-formula>
            </p>
            <p>with <it>e</it><sub><it>k </it></sub>the error variables, tolerating misclassifications in cases of overlapping distributions, and <it>&#947; </it>the regularization parameter, which allows tackling the problem of overfitting. It has been shown that regularization seems to be very important when applying classification methods on high dimensional data <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>.</p>
            <p>In many two-class problems, data sets are skewed in favor of one class such that the contribution of false negative and false positive errors to the performance assessment criterion are not balanced. We therefore used a weighted LS-SVM in which a different weight <it>&#950;</it><sub><it>k </it></sub>is given to positive and negative samples in order to account for the unbalancedness in the data set <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The objective function changes into:</p>
            <p>
               <display-formula>
                  <graphic file="gm39-i7.gif"/>
               </display-formula>
            </p>
            <p>with</p>
            <p>
               <display-formula>
                  <graphic file="gm39-i8.gif"/>
               </display-formula>
            </p>
            <p>and <it>N</it><sub><it>P </it></sub>and <it>N</it><sub><it>N </it></sub>representing the number of positive and negative samples, respectively.</p>
         </sec>
         <sec>
            <st>
               <p>Feature selection</p>
            </st>
            <p>Univariate feature selection techniques are computationally simple but do not incorporate feature-feature interactions. However, due to small sample size limitations, multivariate approaches are often not appropriate for discovering the underlying complex, multivariate correlations. Because it has been shown that univariate gene selection methods lead to good and stable performances across many cancer types and yield in many cases consistently better results than multivariate approaches <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, we used the method DEDS (differential expression via distance synthesis) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. This technique is based on the integration of different univariate test statistics via a distance synthesis scheme because features highly ranked simultaneously by multiple statistics are more likely to be differentially expressed than features highly ranked by a single test statistic. The statistical tests combined are ordinary fold changes, ordinary <it>t</it>-statistics, SAM (significance analysis for microarrays) statistics and moderated <it>t</it>-statistics. DEDS is available as a BioConductor package in R.</p>
            <p>We applied DEDS to the microarray data sets as well as the genomics data set. From our experience, DEDS is less appropriate for data with a limited set of features (data not shown). Since the proteomics data on rectal cancer contain only 90-92 cancer-related proteins, one test statistic suffices, for which we chose the Wilcoxon rank sum test.</p>
         </sec>
         <sec>
            <st>
               <p>Model building</p>
            </st>
            <p>To determine the optimal number of features, we use a leave-one-out (LOO) cross-validation approach in which we increase the number of included features iteratively according to the obtained feature ranking but in which we do not include more features than the number of samples in the data set on which the optimal number of features is determined, as discussed by Li and Yang <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Besides the number of features, the parameters of the kernel method (parameter <it>&#947; </it>for LS-SVM with normalized linear kernel) also need to be selected. This selection occurs on a k-dimensional grid with k - 1 the number of data sets included. We considered 40 possible values for <it>&#947;</it>, ranging from 10<sup>-4 </sup>to 10<sup>6 </sup>on a logarithmic scale. In each LOO iteration, a sample is left out, feature selection is performed on the remaining n - 1 samples, and models are built for all possible combinations of parameters on this grid. Each model with the instantiated parameters is evaluated on the left out sample. This whole procedure is repeated for all samples. The model parameters are chosen corresponding to the model with the highest LOO area under the receiver operating characteristic (ROC) curve (AUC). If multiple models have the same AUC, the model with the lowest balanced error rate and an as high as possible sum of sensitivity and specificity is chosen. For each considered outcome, the AUC of the best performing model is compared with the AUC of the other models using the method of Hanley and McNeil <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The final features are chosen as those that occurred most often in the top rankings determined in each LOO iteration.</p>
            <p>Three kinds of model building strategies are proposed, different in the degree of integration. Figure <figr fid="F1">1</figr> shows these strategies in more detail. The data sets are represented as matrices with rows corresponding to patients and columns corresponding to genes, proteins, or CNVs. The matrices representing microarray or genomics data are larger than those for the proteomics data to emphasize the difference in dimensionality.</p>
            <fig id="F1"><title><p>Figure 1</p></title><caption><p>Overview of the three applied model building strategies</p></caption><text>
   <p>Overview of the three applied model building strategies. <b>(a) </b>Use of a single data set; <b>(b) </b>manual integration of data over time; <b>(c) </b>a genome-wide integration approach. The data sets are represented as matrices with rows corresponding to patients and columns corresponding to genes, proteins, or CNVs. In step A, LS-SVM models are built on each data set separately. A two-dimensional grid is used for the optimization of the regularization parameter and the number of features. For step B, data sets over time are combined. By using the changes in expression or abundance as features, a two-dimensional grid is suficient. In step C, an intermediate integration method is used for the integration of all available data sets. A k-dimensional grid is required for optimizing the regularization parameter and the number of features selected from the (k - 1) integrated data sets. FS, feature selection; M<sub><it>i</it></sub>, model for parameter combination i; NF, number of features; T, time point.</p>
</text><graphic file="gm39-1"/></fig>
            <p>All three strategies were applied to the microarray and proteomics data sets of rectal cancer. For the prostate cancer data set, however, only two strategies were applicable due to a lack of measurements repeated over time. For all models the parameters were trained according to the same approach, which makes the corresponding LOO results comparable for each outcome separately.</p>
            <sec>
               <st>
                  <p>Step A models: single data set</p>
               </st>
               <p>In a first step, LS-SVM models are built on each data set separately, mimicking the results that would have been obtained when only static data from one platform were available. For rectal cancer, the single data sets are microarray at <it>T</it><sub>0</sub>, microarray at <it>T</it><sub>1</sub>, proteomics at <it>T</it><sub>0</sub>, and proteomics at <it>T</it><sub>1 </sub>for the prediction of a regression grading system and two prognostic factors (Figure <figr fid="F1">1a</figr>). For prostate cancer, LS-SVM models are built on the microarray and genomics data separately for the prediction of grade, stage, metastasis, and recurrence. Because of only one set of features, a two-dimensional grid is used for the optimization of the regularization parameter and the number of features.</p>
            </sec>
            <sec>
               <st>
                  <p>Step B models: manual integration of data over time</p>
               </st>
               <p>When measurements are repeated at multiple time points, knowledge over time can be exploited. For rectal cancer, data were available before and early in therapy and, therefore, can be combined in the models. This is done for each data type separately by manually calculating the change in gene expression or protein abundance between the first two time points (<it>T</it><sub>0</sub>-<it>T</it><sub>1</sub>). These changes over time are used as features for the models as shown in Figure <figr fid="F1">1b</figr>. Also for these models, a two-dimensional grid suffices for the optimization of the regularization parameter and the number of features.</p>
            </sec>
            <sec>
               <st>
                  <p>Step C models: multiple omics integration approach</p>
               </st>
               <p>The previous two types of models (steps A and B) are considered to verify whether complex integration of data over multiple layers of biological regulation is crucial. The ability of kernel methods to deal with complexly structured data makes them ideally positioned for more advanced integration of heterogeneous data sources. We will use the intermediate integration method proposed in <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> in which a kernel matrix is computed for each data source separately. Subsequently, these data sources can be integrated in a straightforward way by summing the multiple kernel matrices. Positive semidefiniteness of the linear combination of kernel matrices is guaranteed by constraining the weights of the kernels to be non-negative. A weighted LS-SVM is trained on the explicitly heterogeneous kernel matrix. The choice of the weights to give to each data set is important. A kernel framework for optimizing weights is proposed in <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. This optimization is important when dealing with many data sets of which only several are relevant. However, when the number of data sets is limited and most of them are reliable and relevant to the problem at hand, a trade-off needs to be made between performance and computational burden (for example, extra required cross-validation loops). Due to the rather small sample size in both case studies, weights were chosen equally. Moreover, our aim is to emphasize that classification becomes more accurate when data from multiple layers in the genome are available and to offer a machine learning-based method for integrating these data sources, rather than to improve an algorithm for the optimization of weights (for example, <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>). A three-dimensional grid is used for the optimization of the parameters, that is, the regularization parameter, the number of genes selected from the microarray data sets, and the number of proteins or CNVs obtained from the proteomics data sets or the genomics data set, respectively. For the data on rectal cancer, the number of genes/proteins selected at <it>T</it><sub>0 </sub>and <it>T</it><sub>1 </sub>were taken equally when data from both time points were considered. Figure <figr fid="F1">1c</figr> gives an overview of the strategy.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Study I: rectal cancer</p>
            </st>
            <p>Using the methodologies shown in Figure <figr fid="F1">1</figr>, models were built using microarray and proteomics data of 36 rectal cancer patients at two time points during therapy for the prediction of three outcomes registered at the moment of surgery: a tumor regression grading system (WHEELER) and two prognostic factors, pathologic N stage at surgery (pN-STAGE) and the circumferential margin involvement (CRM). The models with the highest AUC, lowest balanced error rate and an as high as possible sum of sensitivity and specificity are shown in Table <tblr tid="T2">2</tblr>. The step A models are <it>MT</it><sub>0 </sub>(model based on microarray data at <it>T</it><sub>0</sub>), <it>MT</it><sub>1 </sub>(model based on microarray data at <it>T</it><sub>1</sub>), <it>PT</it><sub>0 </sub>(model based on proteomics data at <it>T</it><sub>0</sub>), and <it>PT</it><sub>1 </sub>(model based on proteomics data at <it>T</it><sub>1</sub>). The step B models consist of <it>MT</it><sub>0</sub>-<it>T</it><sub>1 </sub>(model based on change in gene expression between <it>T</it><sub>0 </sub>and <it>T</it><sub>1</sub>) and <it>PT</it><sub>0</sub>-<it>T</it><sub>1 </sub>(model based on change in protein abundances between <it>T</it><sub>0 </sub>and <it>T</it><sub>1</sub>). Finally, the step C models comprise <it>MT</it><sub>01 </sub>(model based on microarray data at both time points), <it>PT</it><sub>01 </sub>(model based on proteomics data at both time points), <it>MPT</it><sub>0 </sub>(model based on microarray and proteomics data at <it>T</it><sub>0</sub>), <it>MPT</it><sub>1 </sub>(model based on microarray and proteomics data at <it>T</it><sub>1</sub>), all possible combinations of three data sets (using the same name convention), and <it>MPT</it><sub>01 </sub>(model based on all data (microarray and proteomics data at both time points)). The numbers of genes and proteins were chosen to optimize the LOO performance of the LS-SVM models. The features selected most often in the 36 LOO iterations are listed and discussed. For each outcome, the ROC curve of the best model was compared with the ROC curves of all other models <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The <it>P</it>-values of these significance tests are reported as well.</p>
            <tbl id="T2"><title><p>Table 2</p></title><caption><p>LS-SVM models for the prediction of WHEELER, pN-STAGE and CRM in rectal cancer</p></caption><tblbdy cols="6">
      <r>
         <c ca="left">
            <p>Outcome</p>
         </c>
         <c ca="left">
            <p>Model</p>
         </c>
         <c ca="center">
            <p>NG*</p>
         </c>
         <c ca="center">
            <p>NP<sup>&#8224;</sup></p>
         </c>
         <c ca="center">
            <p>AUC (SE)<sup>&#8225;</sup></p>
         </c>
         <c ca="center">
            <p><it>p</it>-value<sup>&#167;</sup></p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>WHEELER</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.7538 (0.1085)</p>
         </c>
         <c ca="center">
            <p>0.0987</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>29</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.9038 (0.0502)</p>
         </c>
         <c ca="center">
            <p>0.6861</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>0</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>35</p>
         </c>
         <c ca="center">
            <p>0.7423 (0.0867)</p>
         </c>
         <c ca="center">
            <p>0.0540</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>1</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>11</p>
         </c>
         <c ca="center">
            <p>0.9038 (0.0575)</p>
         </c>
         <c ca="center">
            <p>0.7273</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>B</p>
         </c>
         <c ca="left">
            <p><it>MT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.6846 (0.1215)</p>
         </c>
         <c ca="center">
            <p>0.0598</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p><it>PT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>5</p>
         </c>
         <c ca="center">
            <p>0.8654 (0.0621)</p>
         </c>
         <c ca="center">
            <p>0.4135</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>3<sup>&#182;</sup></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.7808 (0.0985)</p>
         </c>
         <c ca="center">
            <p>0.1320</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>01</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>21<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.7692 (0.0831)</p>
         </c>
         <c ca="center">
            <p>0.0831</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>3</p>
         </c>
         <c ca="center">
            <p>35</p>
         </c>
         <c ca="center">
            <p>0.8461 (0.0718)</p>
         </c>
         <c ca="center">
            <p>0.2760</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>25</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>12</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.9269 (0.0425)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>2<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>31<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.8846 (0.0558)</p>
         </c>
         <c ca="center">
            <p>0.4858</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>0</sub>
               <it>PT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c ca="center">
            <p>0.9385 (0.0444)</p>
         </c>
         <c ca="center">
            <p>0.8101<sup>&#165;</sup></p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>pN-STAGE</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>25</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.6493 (0.0914)</p>
         </c>
         <c ca="center">
            <p>2.315e-4</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>22</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.8506 (0.0665)</p>
         </c>
         <c ca="center">
            <p>0.0362</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>0</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>0.6753 (0.0906)</p>
         </c>
         <c ca="center">
            <p>6.659e-4</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>1</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>0.8409 (0.0652)</p>
         </c>
         <c ca="center">
            <p>0.0238</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>B</p>
         </c>
         <c ca="left">
            <p><it>MT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c ca="center">
            <p>4</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.6071 (0.0986)</p>
         </c>
         <c ca="center">
            <p>1.359e-4</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p><it>PT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c ca="center">
            <p>0.7662 (0.0900)</p>
         </c>
         <c ca="center">
            <p>0.0153</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>24<sup>&#182;</sup></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.9286 (0.0450)</p>
         </c>
         <c ca="center">
            <p>0.1998</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>01</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>34<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.8182 (0.0695)</p>
         </c>
         <c ca="center">
            <p>0.0145</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>27</p>
         </c>
         <c ca="center">
            <p>27</p>
         </c>
         <c ca="center">
            <p>0.9188 (0.0469)</p>
         </c>
         <c ca="center">
            <p>0.1591</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>21</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>14</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.9870 (0.0135)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>23<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>16<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.9610 (0.0280)</p>
         </c>
         <c ca="center">
            <p>0.3421</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>0</sub>
               <it>PT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>26</p>
         </c>
         <c ca="center">
            <p>20<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>1 (0)</p>
         </c>
         <c ca="center">
            <p>0.3347<sup>&#165;</sup></p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>CRM</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>33</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.6790 (0.1016)</p>
         </c>
         <c ca="center">
            <p>0.0072</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>9</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.9259 (0.0472)</p>
         </c>
         <c ca="center">
            <p>0.4955</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>0</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>34</p>
         </c>
         <c ca="center">
            <p>0.8518 (0.0624)</p>
         </c>
         <c ca="center">
            <p>0.0935</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>1</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>34</p>
         </c>
         <c ca="center">
            <p>0.7654 (0.0831)</p>
         </c>
         <c ca="center">
            <p>0.0281</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>B</p>
         </c>
         <c ca="left">
            <p><it>MT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c ca="center">
            <p>6</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.9136 (0.0480)</p>
         </c>
         <c ca="center">
            <p>0.4030</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p><it>PT</it><sub>0</sub>-<it>T</it><sub>1</sub></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>2</p>
         </c>
         <c ca="center">
            <p>0.8272 (0.0709)</p>
         </c>
         <c ca="center">
            <p>0.0849</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>16<sup>&#182;</sup></p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.8066 (0.0846)</p>
         </c>
         <c ca="center">
            <p>0.0468</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>PT</it>
               <sub>01</sub>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>3<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.7531 (0.0865)</p>
         </c>
         <c ca="center">
            <p>0.0227</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>7</p>
         </c>
         <c ca="center">
            <p>27</p>
         </c>
         <c ca="center">
            <p>0.8477 (0.0688)</p>
         </c>
         <c ca="center">
            <p>0.1340</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>7</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>33</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.9630 (0.0344)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MPT</it>
               <sub>01</sub>
            </p>
         </c>
         <c ca="center">
            <p>2<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>3<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>0.8230 (0.0771)</p>
         </c>
         <c ca="center">
            <p>0.0973</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>1</sub>
               <it>PT</it>
               <sub>0</sub>
            </p>
         </c>
         <c ca="center">
            <p>16</p>
         </c>
         <c ca="center">
            <p>14</p>
         </c>
         <c ca="center">
            <p>0.9630 (0.0376)</p>
         </c>
         <c ca="center">
            <p>1</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>MT</it>
               <sub>01</sub>
               <it>PT</it>
               <sub>1</sub>
            </p>
         </c>
         <c ca="center">
            <p>9<sup>&#182;</sup></p>
         </c>
         <c ca="center">
            <p>29</p>
         </c>
         <c ca="center">
            <p>0.9876 (0.0146)</p>
         </c>
         <c ca="center">
            <p>0.4924<sup>&#165;</sup></p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*Number of genes selected in each LOO iteration. <sup>&#8224;</sup>Number of proteins selected in each LOO iteration. <sup>&#8225;</sup>Area under the ROC curve (standard error) obtained with leave-one-out. <sup>&#167;</sup>Comparison of AUC between each model and the best model in bold <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. <sup>&#182;</sup>Number of features used at both time points. <sup>&#165;</sup>This model is better than the model in bold we compare with.</p>
   </tblfn></tbl>
            <p>Table <tblr tid="T2">2</tblr> shows the LS-SVM models for the considered combinations of data sets to predict WHEELER, pN-STAGE, and CRM with the optimal number of genes and proteins selected with DEDS and the Wilcoxon rank sum test, respectively. The corresponding ROC curves are shown in Additional data file 1. The performance of the models based on three data sets is given in Additional data file 2. Due to the slightly, but not significantly, better performance for each outcome of one model based on three data sets compared to models based on two data sets, we report the results for the best model combining two data sets. Such models would only require a sample to be taken at one time point (<it>MPT</it><sub>0</sub>, <it>MPT</it><sub>1</sub>) or one technology to be applied on two time points (<it>MT</it><sub>01</sub>, <it>PT</it><sub>01</sub>). For the prediction of WHEELER, the expression of 25 genes and 12 proteins at <it>T</it><sub>1 </sub>was best, although not significantly, with an AUC of 0.9269. Also for pN-STAGE, combining both data sets at <it>T</it><sub>1 </sub>using the expression of 21 genes and 14 proteins resulted in the best LOO AUC of 0.9870. This performance is significantly better than all step A and B models as well as <it>PT</it><sub>01</sub>. Finally, the inclusion of 7 genes and 33 proteins at <it>T</it><sub>1 </sub>led to an AUC of 0.9630 for the prediction of CRM. Four models based on only one data type perform significantly worse compared to <it>MPT</it><sub>1</sub>. For all outcomes, none of the selected proteins are a product of the selected genes.</p>
            <p>The contribution of the genes and/or proteins in rectal or colorectal cancer that were selected most often in the LOO iterations of <it>MPT</it><sub>1 </sub>and predicted most accurately WHEELER, pN-STAGE, or CRM are shown in Table <tblr tid="T3">3</tblr>. A protein important for CRM, for example, is the epidermal growth factor receptor (EGFR), involved in signaling pathways affecting cellular growth, differentiation, and proliferation. This protein represents one of the most promising targets allowing progress in colorectal cancer treatment. It has been suggested that EGFR polymorphisms as well as polymorphisms of other genes active in the EGFR pathway may be potential indicators of radiosensitivity in patients with rectal cancer treated with chemoradiation <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. In colorectal cancer, pro-inflammatory cytokines such as interleukin-1 beta and interleukin-6 may be accountable for the overexpression of <it>Cox-2</it>, important in the early stage and for progression <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Transforming growth factor alpha, down-regulated in our patients with a good responsiveness to preoperative therapy, is implicated in metastatic spread of colon cancer cells <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The expression of interleukin-8 is associated with induction and progression of colorectal carcinoma and the development of colorectal liver metastases <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. In our data set, it is down-regulated in the group of patients with no lymph nodes found at surgery. Finally, elevated carcinoembryonic antigen and cancer antigen 19-9 are related to poor outcome in colorectal cancer <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Their levels are low in patients with no lymph nodes, while carcinoembryonic antigen is also less expressed in patients with a negative CRM, that is, belonging to the class of 'responders'. A complete list of the genes and proteins chosen by the models <it>MPT</it><sub>1 </sub>are shown, for each outcome separately, in Additional data file 3. The predictions seem to depend on mainly different subsets of features. The gene encoding PAI-2 is important for both WHEELER and CRM, while the proteins important for two of the three outcomes are interleukin-4, ferritin, apolipoprotein H, epidermal growth factor, matrix metalloproteinase-2, and lymphotactin. Notably, these genes and proteins were also selected by the other models based on microarray and/or proteomics data at <it>T</it><sub>1</sub>, although the specific feature ranking depends on the number of features included. Some of these genes and proteins were also included in the models based on data at <it>T</it><sub>0</sub>.</p>
            <tbl id="T3"><title><p>Table 3</p></title><caption><p>Features for (colo)rectal cancer selected by <it>MPT</it><sub>1 </sub>and known to be involved in this type of cancer</p></caption><tblbdy cols="7">
      <r>
         <c ca="left">
            <p>Outcome*</p>
         </c>
         <c ca="left">
            <p>Gene/protein</p>
         </c>
         <c ca="center">
            <p>Hits<sup>&#8224;</sup></p>
         </c>
         <c ca="left">
            <p>Region</p>
         </c>
         <c ca="left">
            <p>Function</p>
         </c>
         <c ca="left">
            <p>Up/down<sup>&#8225;</sup></p>
         </c>
         <c ca="center">
            <p>Reference</p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>Cox-2</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>1q25.2-q25.3</p>
         </c>
         <c ca="left">
            <p>Progression</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B50">50</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>IL-1B</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>2q14</p>
         </c>
         <c ca="left">
            <p>Inflammatory response</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B50">50</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>Ferritin</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>11q13; 19q13.3-q13.4</p>
         </c>
         <c ca="left">
            <p>Iron storage</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B63">63</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>EGF</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>4q25</p>
         </c>
         <c ca="left">
            <p>Cell growth/proliferation/differentiation</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B64">64</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>MMP-2</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>16q13-q21</p>
         </c>
         <c ca="left">
            <p>Invasion/metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B65">65</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>TGF<it>&#945;</it></p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>2p13</p>
         </c>
         <c ca="left">
            <p>Angiogenesis/cell proliferation</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B51">51</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>SELE</p>
         </c>
         <c ca="center">
            <p>25</p>
         </c>
         <c ca="left">
            <p>1q22-q25</p>
         </c>
         <c ca="left">
            <p>Progression/metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B66">66</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>GM-CSF</p>
         </c>
         <c ca="center">
            <p>24</p>
         </c>
         <c ca="left">
            <p>5q31.1</p>
         </c>
         <c ca="left">
            <p>Maintenance of granulocytes/macrophages</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B67">67</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>W</p>
         </c>
         <c ca="left">
            <p>MMP-1</p>
         </c>
         <c ca="center">
            <p>15</p>
         </c>
         <c ca="left">
            <p>11q22.3</p>
         </c>
         <c ca="left">
            <p>Tumor invasion/metastasis/poor prognosis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B68">68</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>Reg4</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>1p13.1-p12</p>
         </c>
         <c ca="left">
            <p>Early carcinogenesis</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B69">69</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>MUC2</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>11p15.5</p>
         </c>
         <c ca="left">
            <p>Deregulated by TNF&#945;</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B70">70</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>CA1</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>8q13-q22.1</p>
         </c>
         <c ca="left">
            <p>Carbonate dehydratase activity</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B71">71</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>CA2</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>8q22</p>
         </c>
         <c ca="left">
            <p>Carbonate dehydratase activity</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B71">71</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>CLDN8</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>21q22.11</p>
         </c>
         <c ca="left">
            <p>Tumorigenesis</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B72">72</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>CEA</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>19q13.1-q13.2</p>
         </c>
         <c ca="left">
            <p>Cell adhesion; tumor marker for recurrence</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B53">53</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>IL-1ra</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>2q14.2</p>
         </c>
         <c ca="left">
            <p>Carcinogenesis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B73">73</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>CA19-9</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>Tumor marker for recurrence</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B53">53</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>Ferritin</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>11q13; 19q13.3-q13.4</p>
         </c>
         <c ca="left">
            <p>Iron storage</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B63">63</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>IL-1beta</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>2q14</p>
         </c>
         <c ca="left">
            <p>Inflammatory response</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B50">50</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>beta2-microglobulin</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>15q21-q22.2</p>
         </c>
         <c ca="left">
            <p>Metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B74">74</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>RARRES1</p>
         </c>
         <c ca="center">
            <p>31</p>
         </c>
         <c ca="left">
            <p>3q25.32-q25.33</p>
         </c>
         <c ca="left">
            <p>Cell proliferation</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B75">75</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>IL-8</p>
         </c>
         <c ca="center">
            <p>28</p>
         </c>
         <c ca="left">
            <p>4q13-q21</p>
         </c>
         <c ca="left">
            <p>Progression/metastasis</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B52">52</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>N</p>
         </c>
         <c ca="left">
            <p>TNFRII</p>
         </c>
         <c ca="center">
            <p>24</p>
         </c>
         <c ca="left">
            <p>1p36.3-p36.2</p>
         </c>
         <c ca="left">
            <p>Apoptosis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B76">76</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>ICAM-1</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>19p13.3-p13.2</p>
         </c>
         <c ca="left">
            <p>Metastasis</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B77">77</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>CEA</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>19q13.1-q13.2</p>
         </c>
         <c ca="left">
            <p>Cell adhesion; tumor marker for recurrence</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B53">53</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>MMP-2</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>16q13-q21</p>
         </c>
         <c ca="left">
            <p>Invasion/metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B65">65</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>Adiponectin</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>3q27</p>
         </c>
         <c ca="left">
            <p>Metabolic/hormonal processes</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B78">78</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>Thrombospondin-1</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>15q15</p>
         </c>
         <c ca="left">
            <p>Angiogenesis/tumor growth</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B79">79</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>EGFR</p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>7p12</p>
         </c>
         <c ca="left">
            <p>Cell growth/proliferation/differentiation</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B49">49</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>Tissue factor</p>
         </c>
         <c ca="center">
            <p>35</p>
         </c>
         <c ca="left">
            <p>1p22-p21</p>
         </c>
         <c ca="left">
            <p>Angiogenesis/metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B80">80</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>CYP1B1</p>
         </c>
         <c ca="center">
            <p>35</p>
         </c>
         <c ca="left">
            <p>2p21</p>
         </c>
         <c ca="left">
            <p>Drug metabolism</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B81">81</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>EGF</p>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
         <c ca="left">
            <p>4q25</p>
         </c>
         <c ca="left">
            <p>Cell growth/proliferation/differentiation</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B64">64</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*W, WHEELER; N, pN-STAGE; C, CRM. <sup>&#8224;</sup>Number of occurrences of the gene/protein in the 36 LOO iterations. <sup>&#8225;</sup>Up/down-regulation in the good responders with respect to moderate or poor responders; no lymph nodes with respect to at least one regional lymph node; negative CRM with respect to positive CRM. CRC, (colo)rectal cancer.</p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>Study II: prostate cancer</p>
            </st>
            <p>The same methodology was applied to microarray and genomics data of 55 patients with prostate cancer. Table <tblr tid="T4">4</tblr> shows the results for the prediction of the grade and stage of the tumor (GRADE and STAGE), as well as the tumors that metastasized to distant lymph nodes (METASTASIS) or that recurred (RECURRENCE). Because the data were gathered at one time point, only step A and C models are applicable. The step A models are represented as <it>M </it>(model based on microarray data) and <it>G </it>(model based on genomics data), and the step C model based on both microarray and genomics data as <it>MG</it>. Also, after having optimized the essential number of features to be included using a LOO cross-validation, the final genes and CNVs were selected based on their position and number of occurrences in the 55 LOO rankings.</p>
            <tbl id="T4"><title><p>Table 4</p></title><caption><p>LS-SVM models for the prediction of GRADE, STAGE, METASTASIS and RECURRENCE in prostate cancer</p></caption><tblbdy cols="6">
      <r>
         <c ca="left">
            <p>Outcome</p>
         </c>
         <c ca="left">
            <p>Model</p>
         </c>
         <c ca="center">
            <p>NG*</p>
         </c>
         <c ca="center">
            <p>NC<sup>&#8224;</sup></p>
         </c>
         <c ca="center">
            <p>AUC (SE)<sup>&#8225;</sup></p>
         </c>
         <c ca="center">
            <p><it>p</it>-value<sup>&#167;</sup></p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>GRADE</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>M</it>
            </p>
         </c>
         <c ca="center">
            <p>24</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.8304 (0.0623)</p>
         </c>
         <c ca="center">
            <p>0.2727</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>G</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>8</p>
         </c>
         <c ca="center">
            <p>0.7822 (0.0632)</p>
         </c>
         <c ca="center">
            <p>0.0503</p>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MG</it>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>6</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>8</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.9006 (0.0413)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>STAGE</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>M</it>
            </p>
         </c>
         <c ca="center">
            <p>18</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.6576 (0.0778)</p>
         </c>
         <c ca="center">
            <p>0.0191</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>G</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>32</p>
         </c>
         <c ca="center">
            <p>0.7936 (0.0631)</p>
         </c>
         <c ca="center">
            <p>0.3466</p>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MG</it>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>42</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>22</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.8528 (0.0550)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>METASTASIS</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>M</it>
            </p>
         </c>
         <c ca="center">
            <p>18</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.9759 (0.0178)</p>
         </c>
         <c ca="center">
            <p>0.4392</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>G</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>12</p>
         </c>
         <c ca="center">
            <p>0.8114 (0.0755)</p>
         </c>
         <c ca="center">
            <p>0.0166</p>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MG</it>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>18</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>3</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.9868 (0.0121)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>RECURRENCE</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>A</p>
         </c>
         <c ca="left">
            <p>
               <it>M</it>
            </p>
         </c>
         <c ca="center">
            <p>24</p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>0.7208 (0.0936)</p>
         </c>
         <c ca="center">
            <p>0.5392</p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <it>G</it>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>26</p>
         </c>
         <c ca="center">
            <p>0.4481 (0.1433)</p>
         </c>
         <c ca="center">
            <p>0.0354</p>
         </c>
      </r>
      <r>
         <c indent="1" ca="left">
            <p>C</p>
         </c>
         <c ca="left">
            <p>
               <it>MG</it>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>32</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>2</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.7857 (0.0934)</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*Number of genes selected in each LOO iteration. <sup>&#8224;</sup>Number of copy number variations selected in each LOO iteration. <sup>&#8225;</sup>Area under the ROC curve (standard error) obtained with leave-one-out. <sup>&#167;</sup>Comparison of AUC between each model and the best model in bold <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
   </tblfn></tbl>
            <p>We obtained similar results as for rectal cancer. Combining gene expression with measurements at the DNA level (<it>MG</it>) led, for all four outcomes, to an improvement in classification accuracy and was significant in some cases (Table <tblr tid="T4">4</tblr>). For the prediction of GRADE, six genes and eight CNVs selected with DEDS resulted in an AUC of 0.9006. For STAGE, 42 genes and 22 CNVs were needed for a performance of 0.8528. The model <it>MG </it>for the prediction of METASTASIS had an AUC of 0.9868 when fusing the expression of 18 genes with 3 CNVs. Finally, the prediction of RECURRENCE was most difficult, with an AUC of 0.7857 when combining 32 genes and 2 CNVs. Additional data file 1 shows the ROC curves of the models listed in Table <tblr tid="T4">4</tblr>.</p>
            <p>Several genes and CNVs have been selected by <it>MG </it>and are known to be involved in, and important for, prostate cancer (Table <tblr tid="T5">5</tblr>). The gene <it>ALOX15B </it>is a suppressor of prostate tumor development <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> and in this data set is down-regulated in tumors of high-grade and in tumors that recurred. Both <it>SFRP4 </it>and <it>CXCL14 </it>on the other hand are inhibitors of prostate tumor growth <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. <it>SFRP4 </it>is up-regulated in tumors of high-grade, and <it>CXCL14 </it>in tumors of advanced stage. A small deletion involving chromosomal band 21q22.3 fuses all coding exons of <it>ERG </it>to androgen-related sequences in the promoter of the prostate-specific <it>TMPRSS2 </it>gene. This chromosomal rearrangement is a highly prevalent oncogenic alteration in prostate tumor cells and leads to an aberrant expression of the <it>ERG </it>proto-oncogene, important for early prostate carcinogenesis <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. In this data set, <it>ERG </it>is overexpressed in tumors in which the cancer metastasized to distant lymph nodes. It has been shown that this genetic biomarker is a strong prognostic factor for disease recurrence, and can be used for early detection and outcome prediction in prostate cancer <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. <it>VAV3</it>, an oncogene involved in development and progression of prostate cancer, is up-regulated in tumors that metastasized <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. It has previously been shown that strong overexpression of <it>TIAM1 </it>is significantly associated with disease recurrence and a decreased disease-free survival <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Also, <it>JAG1 </it>is significantly associated with recurrence <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> and plays a role in cell growth, progression, and metastasis. In this data set, both genes are up-regulated in the group of tumors that recurred. Finally, several germline mutations or variants in <it>RNASEL </it>have been observed among hereditary prostate cancer cases, indicating that polymorphic changes within the <it>RNASEL </it>gene may be associated with increased risk of familial but not sporadic prostate cancer <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. A list of all the genes and CNVs selected by the models <it>MG </it>are shown in Additional data file 3. As for rectal cancer, the outcomes for prostate cancer seem to be characterized by mainly different sets of features. Five genes overlap between at least two outcomes (<it>ERG</it>, <it>AHSG</it>, <it>SEMA4G</it>, <it>F5</it>, and <it>ALOX15B</it>), while the same holds for four CNVs of the genes <it>GPD1L</it>, <it>KCTD12</it>, <it>SMYD5</it>, and <it>TRO</it>.</p>
            <tbl id="T5"><title><p>Table 5</p></title><caption><p>Features for prostate cancer selected by <it>MG </it>and known to be involved in this type of cancer</p></caption><tblbdy cols="7">
      <r>
         <c ca="left">
            <p>Outcome*</p>
         </c>
         <c ca="left">
            <p>Gene/CNV</p>
         </c>
         <c ca="center">
            <p>Hits<sup>&#8224;</sup></p>
         </c>
         <c ca="left">
            <p>Region</p>
         </c>
         <c ca="left">
            <p>Function</p>
         </c>
         <c ca="left">
            <p>Up/down<sup>&#8225;</sup></p>
         </c>
         <c ca="center">
            <p>Reference</p>
         </c>
      </r>
      <r>
         <c cspan="7">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>G</p>
         </c>
         <c ca="left">
            <p>
               <it>SFRP4</it>
            </p>
         </c>
         <c ca="center">
            <p>55</p>
         </c>
         <c ca="left">
            <p>7p14.1</p>
         </c>
         <c ca="left">
            <p>Inhibitor of PT growth/invasion</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B55">55</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>G</p>
         </c>
         <c ca="left">
            <p>
               <it>VCAN</it>
            </p>
         </c>
         <c ca="center">
            <p>55</p>
         </c>
         <c ca="left">
            <p>5q14.3</p>
         </c>
         <c ca="left">
            <p>Contributor to PC pathology</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B82">82</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>G</p>
         </c>
         <c ca="left">
            <p>
               <it>ALOX15B</it>
            </p>
         </c>
         <c ca="center">
            <p>36</p>
         </c>
         <c ca="left">
            <p>17p13.1</p>
         </c>
         <c ca="left">
            <p>Suppressor of PT development</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B54">54</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>MAGEA4</it>
            </p>
         </c>
         <c ca="center">
            <p>50</p>
         </c>
         <c ca="left">
            <p>Xq28</p>
         </c>
         <c ca="left">
            <p>Only expressed in PC (diagnosis and therapy)</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B83">83</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>ANPEP</it>
            </p>
         </c>
         <c ca="center">
            <p>50</p>
         </c>
         <c ca="left">
            <p>15q25-q26</p>
         </c>
         <c ca="left">
            <p>PT cell invasion</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B84">84</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>POU4F1</it>
            </p>
         </c>
         <c ca="center">
            <p>50</p>
         </c>
         <c ca="left">
            <p>13q31.1</p>
         </c>
         <c ca="left">
            <p>PC cell growth</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B85">85</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>CXCL14</it>
            </p>
         </c>
         <c ca="center">
            <p>48</p>
         </c>
         <c ca="left">
            <p>5q31</p>
         </c>
         <c ca="left">
            <p>Inhibitor of PT growth</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B56">56</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>RNASEL</it>
            </p>
         </c>
         <c ca="center">
            <p>48</p>
         </c>
         <c ca="left">
            <p>1q25</p>
         </c>
         <c ca="left">
            <p>Polymorphic changes as tumor; suppressor in hereditary PC</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B62">62</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>S</p>
         </c>
         <c ca="left">
            <p>
               <it>GDEP</it>
            </p>
         </c>
         <c ca="center">
            <p>41</p>
         </c>
         <c ca="left">
            <p>4q21.1</p>
         </c>
         <c ca="left">
            <p>Prostate-specific gene</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B86">86</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>M</p>
         </c>
         <c ca="left">
            <p>
               <it>ERG</it>
            </p>
         </c>
         <c ca="center">
            <p>50</p>
         </c>
         <c ca="left">
            <p>21q22.3</p>
         </c>
         <c ca="left">
            <p>Proto-oncogene; early prostate carcinogenesis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B57">57</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>M</p>
         </c>
         <c ca="left">
            <p>
               <it>AREG</it>
            </p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
         <c ca="left">
            <p>4q13-q21</p>
         </c>
         <c ca="left">
            <p>PC progression/growth via TARP</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B87">87</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>M</p>
         </c>
         <c ca="left">
            <p>
               <it>VAV3</it>
            </p>
         </c>
         <c ca="center">
            <p>49</p>
         </c>
         <c ca="left">
            <p>1p13.3</p>
         </c>
         <c ca="left">
            <p>Oncogene; PC development/progression</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B59">59</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>M</p>
         </c>
         <c ca="left">
            <p>
               <it>ADAMTS1</it>
            </p>
         </c>
         <c ca="center">
            <p>26</p>
         </c>
         <c ca="left">
            <p>21q21.2</p>
         </c>
         <c ca="left">
            <p>Negatively affected by TGFbeta1, which increases VCAN-expression</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B82">82</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>AZGP1</it>
            </p>
         </c>
         <c ca="center">
            <p>29</p>
         </c>
         <c ca="left">
            <p>7q22.1</p>
         </c>
         <c ca="left">
            <p>Inversely associated to tumor stage; predictor of biochemical recurrence</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B88">88</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>TIAM1</it>
            </p>
         </c>
         <c ca="center">
            <p>29</p>
         </c>
         <c ca="left">
            <p>21q22.1-11</p>
         </c>
         <c ca="left">
            <p>Predictor of decreased disease-free survival/recurrence</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B60">60</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>FGG</it>
            </p>
         </c>
         <c ca="center">
            <p>28</p>
         </c>
         <c ca="left">
            <p>4q28</p>
         </c>
         <c ca="left">
            <p>PC cell growth</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B89">89</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>ATF3</it>
            </p>
         </c>
         <c ca="center">
            <p>26</p>
         </c>
         <c ca="left">
            <p>1q32.3</p>
         </c>
         <c ca="left">
            <p>Inversely related to invasion/angiogenesis; positively correlated to metastases</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B90">90</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>JAG1</it>
            </p>
         </c>
         <c ca="center">
            <p>26</p>
         </c>
         <c ca="left">
            <p>20p12.1-11.23</p>
         </c>
         <c ca="left">
            <p>Cell growth/progression/metastasis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B61">61</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>ERG</it>
            </p>
         </c>
         <c ca="center">
            <p>14</p>
         </c>
         <c ca="left">
            <p>21q22.3</p>
         </c>
         <c ca="left">
            <p>Proto-oncogene; early prostate carcinogenesis</p>
         </c>
         <c ca="left">
            <p>Up</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B57">57</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>R</p>
         </c>
         <c ca="left">
            <p>
               <it>ALOX15B</it>
            </p>
         </c>
         <c ca="center">
            <p>14</p>
         </c>
         <c ca="left">
            <p>17p13.1</p>
         </c>
         <c ca="left">
            <p>Suppressor of PT development</p>
         </c>
         <c ca="left">
            <p>Down</p>
         </c>
         <c ca="center">
            <p>
               <abbrgrp>
                  <abbr bid="B54">54</abbr>
               </abbrgrp>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*G, GRADE; S, STAGE; M, METASTASIS; R, RECURRENCE. <sup>&#8224;</sup>Number of occurrences of the gene/CNV in all LOO iterations (number of LOO iterations for G = 55, S = 50, M = 50, R = 29). <sup>&#8225;</sup>Up/down-regulation in high-grade with respect to low-grade; advanced stage with respect to early stage; metastasis with respect to no metastasis; recurrence with respect to no recurrence. PC, prostate cancer; PT, prostate tumor.</p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>Comparison with an ensemble approach</p>
            </st>
            <p>To assess the benefit of our kernel-based integration approach over standard data fusion techniques, we implemented an ensemble approach in which each data set gives rise to a separate LS-SVM classifier. These individual LS-SVM models were built similarly to the step A models, with the same number of genes, proteins or CNVs selected as included in the best models <it>MPT</it><sub>1 </sub>and <it>MG</it>. Subsequently, as a late integration step, the continuous outputs of these models were added.</p>
            <p>For the study on rectal cancer, the AUC values of the ensemble models integrating the microarray and proteomics data set gathered at <it>T</it><sub>1</sub>, and the corresponding AUC values of the best model obtained with our strategy (<it>MPT</it><sub>1</sub>) are shown in Table <tblr tid="T6">6</tblr>. The <it>P</it>-values of the significance tests comparing the ROC curves are reported as well <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. For CRM, our strategy was significantly better than the ensemble approach at a significance level of 0.05. For WHEELER and pN-STAGE, the AUC values did not differ significantly. Similarly for the study on prostate cancer, the AUC values of <it>MG </it>were compared with the AUC values of the ensemble models combining microarray and genomics data (Table <tblr tid="T6">6</tblr>). For all four outcomes, the AUC of <it>MG </it>was better than the AUC of the ensemble models, although being significantly better for RECURRENCE only.</p>
            <tbl id="T6"><title><p>Table 6</p></title><caption><p>Comparison of our kernel-based integration approach with the ensemble approach</p></caption><tblbdy cols="4">
      <r>
         <c ca="left">
            <p>Outcome</p>
         </c>
         <c ca="center">
            <p>AUC (SE)*:<it>MPT</it><sub>1</sub>/<it>MG</it></p>
         </c>
         <c ca="center">
            <p>AUC (SE)*: ensemble approach</p>
         </c>
         <c ca="center">
            <p><it>p</it>-value</p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>WHEELER</p>
         </c>
         <c ca="center">
            <p>0.9269 (0.0425)</p>
         </c>
         <c ca="center">
            <p>0.9500 (0.0339)</p>
         </c>
         <c ca="center">
            <p>0.6160</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>pN-STAGE</p>
         </c>
         <c ca="center">
            <p>0.9870 (0.0135)</p>
         </c>
         <c ca="center">
            <p>0.9253 (0.0432)</p>
         </c>
         <c ca="center">
            <p>0.1422</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>CRM</p>
         </c>
         <c ca="center">
            <p>0.9630 (0.0344)</p>
         </c>
         <c ca="center">
            <p>0.7860 (0.0783)</p>
         </c>
         <c ca="center">
            <p>
               <b>0.0384</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>GRADE</p>
         </c>
         <c ca="center">
            <p>0.9006 (0.0413)</p>
         </c>
         <c ca="center">
            <p>0.8567 (0.0521)</p>
         </c>
         <c ca="center">
            <p>0.3745</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>STAGE</p>
         </c>
         <c ca="center">
            <p>0.8528 (0.0550)</p>
         </c>
         <c ca="center">
            <p>0.8304 (0.0582)</p>
         </c>
         <c ca="center">
            <p>0.6836</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>METASTASIS</p>
         </c>
         <c ca="center">
            <p>0.9868 (0.0121)</p>
         </c>
         <c ca="center">
            <p>0.9452 (0.0309)</p>
         </c>
         <c ca="center">
            <p>0.1313</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>RECURRENCE</p>
         </c>
         <c ca="center">
            <p>0.7857 (0.0934)</p>
         </c>
         <c ca="center">
            <p>0.4545 (0.1352)</p>
         </c>
         <c ca="center">
            <p>
               <b>0.0182</b>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>*Area under the ROC curve (standard error) obtained with leave-one-out. <sup>&#8224;</sup>Comparison in AUC between the best models obtained with our strategy (<it>MPT</it><sub>1 </sub>for rectal cancer, <it>MG </it>for prostate cancer) and the corresponding ensemble models based on the same number of features <abbrgrp><abbr bid="B46">46</abbr></abbrgrp></p>
   </tblfn></tbl>
         </sec>
         <sec>
            <st>
               <p>Correlation analysis</p>
            </st>
            <p>We additionally verified whether, in both cases, data from multiple layers of molecular biology were complementary. After mapping the entities of the data sets based on their entrez gene IDs, we investigated the correlation between the microarray and proteomics data of rectal cancer on the one hand, and between the microarray and genomics data of prostate cancer on the other hand. Using the Spearman correlation coefficient, there was no significant correlation for rectal cancer between the abundances of the 90-92 proteins and their corresponding transcripts at a significance level of 0.05. The microarray and genomics data sets for prostate cancer were slightly more correlated. While for GRADE the 6 genes selected by the model <it>MG </it>did not correlate with their DNA expression, 2 of the 42 selected genes for STAGE were significantly correlated (<it>P </it>&lt; 0.05). For METASTASIS and RECURRENCE, there was a significant correlation for one and three genes, respectively. The regions, with involved CNVs selected from the genomics data, were also compared with the regions in which the selected genes from the microarray data were located. For the majority of regions, there was no overlap. For the other regions with the same rough chromosomal location, the genes selected by both data sets were different.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The proposed integration approach has been applied to two patient data sets, each with two high-throughput data sources. Microarray and proteomics were gathered from 36 patients with rectal cancer at two time points during preoperative treatment, while microarray and genomics were gathered from 55 patients with prostate cancer. To verify the merit of our integration approach over the use of a single omics data source, models were built for classifying cancer patients according to therapy response, prognostic factors, metastasis, or recurrence. In many studies, only single data sources are explored for the development of such profiles. However, in our opinion, a single layer of molecular information is inadequate to explain the complete network of molecules underlying a disease. In this study, LS-SVMs were first built on all data sets individually (Figure <figr fid="F1">1</figr>). Next, we manually integrated data measured at multiple time points by building LS-SVMs using the change in expression between two time points. Because the integration of data may be more complex than the change in expression over time, we subsequently applied an intermediate integration approach in which data from multiple omics were combined at the kernel level within the patient domain.</p>
         <p>For the data on rectal cancer, all three outcomes - a tumor regression grading system and two prognostic factors - could be predicted most accurately and most cost-efficiently with an AUC ranging from 0.9269 to 0.9870 when fusing microarray and proteomics data gathered during therapy (<it>MPT</it><sub>1</sub>; Table <tblr tid="T2">2</tblr>). For WHEELER, for example, <it>MPT</it><sub>0 </sub>performance is better than each of the models based on data from an individual technology (<it>MT</it><sub>0 </sub>and <it>PT</it><sub>0</sub>), as is the case for <it>MPT</it><sub>01 </sub>compared to <it>MT</it><sub>01 </sub>and <it>PT</it><sub>01</sub>. This trend of increased performance when combining data from two different technologies was further confirmed by our second data set for prostate cancer patients. Best results for the prediction of grade, stage, metastasis, and recurrence were obtained when integrating microarray and genomics data (<it>MG</it>). The corresponding AUC values were 0.9006, 0.8528, 0.9868, and 0.7857, respectively (Table <tblr tid="T4">4</tblr>). For many of the genes, proteins, and CNVs included in these models, involvement in rectal or prostate cancer has been defined, indicating the reliability of the selected features (Tables <tblr tid="T3">3</tblr> and <tblr tid="T5">5</tblr>). These models were compared with models obtained with an ensemble approach in which classiffiers are combined instead of data sets at the kernel level. Globally, our approach performed better, although not always significantly (Table <tblr tid="T6">6</tblr>).</p>
         <p>By looking at the correlation between two data sets gathered from the same set of patients, we show that data from different layers are mainly complementary. For rectal cancer, there was a lack of correlation between the selected genes and their corresponding proteins. Also, the selected proteins did not significantly correlate with their transcript level, suggesting alternative splicing and post-translational modification. With newer technologies such as mass spectrometry, the whole proteome will become measurable. For prostate cancer, up to three genes included in the model <it>MG </it>were significantly correlated with their corresponding CNV.</p>
         <p>More specific for the study on rectal cancer, we can conclude from Table <tblr tid="T2">2</tblr> that data gathered after an initial dose of cetuximab are more informative for prediction of therapy response than data gathered before the start of the therapy. Neither microarray nor proteomics data can predict the outcomes more accurately at <it>T</it><sub>0 </sub>than <it>T</it><sub>1</sub>, except for the proteomics data at <it>T</it><sub>0 </sub>being more informative for the prediction of CRM. Moreover, when combining both data types at one time point (<it>MPT</it><sub>0 </sub>and <it>MPT</it><sub>1</sub>), the models applicable after the initial dose of cetuximab outperform those at <it>T</it><sub>0</sub>.</p>
         <p>We acknowledge that the models proposed in this manuscript are quite expensive. Applying a model for rectal cancer would require microarray and/or proteomics data, gathered at one or two time points during therapy. However, we have attempted to keep the cost to a minimum. The performance difference between models combining two data sets, only requiring a sample to be taken at one time point or one technology to be applied at two time points, and models requiring a sample to be taken at both time points and both technologies to be performed was minimal and not statistically significant. We therefore chose the best model among the models based on two data sets. We admit that there may exist other, less expensive data sources that can contain complementary information as well. Firstly, clinical information is routinely gathered during therapy, such as tumor size, tumor location and number of positive lymph nodes. However, we only had access to the clinical parameter age, for which we performed an additional analysis to verify whether this parameter could be of use. A univariate analysis based on the Wilcoxon rank sum test showed no significant difference in age between the two classes of samples according to the considered outcomes. In a multivariate logistic regression model, the parameter age was not significant as well. Secondly, there is an increasing need for multi-modal studies in which, among others, clinical, genomic and genetic data are collected. Also, imaging, such as computed tomography (CT) and magnetic resonance imaging (MRI) can be a potential predictor to use in combination with high-throughput data sources. Such studies are required to determine which data sets are most relevant for the problem at hand and which data sets should be combined to become good performing, affordable models that are clinically applicable.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>The results suggest that the use of our integration approach on experimental data from multiple levels in the genome can improve the performance of decision support in cancer. For both data sets studied in this manuscript, combining high-throughput data sets (transcriptomics with proteomics, or genomics with transcriptomics) outperformed the models based on data from a single layer of biological information, independent of the outcome considered for prediction. These results emphasize the need for comprehensive multi-modal data gathered with high-throughput technologies as well as imaging, because it is unknown which technologies, and thus which levels of molecular biology, are the most relevant for prognostic prediction. We acknowledge that this will substantially increase costs in a first exploratory phase. However, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy.</p>
         <p>In the near future, we will compare our kernel-based integration method with a Bayesian network integration framework. These frameworks are complementary. We also plan to apply an ensemble approach for integrating these two frameworks because more accurate classifiers are not only obtained by combining different data types but also by combining individual decisions of multiple classifiers. In this way, the advantages of both methods can be exploited.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>AUC: area under the ROC curve; CGH: comparative genomic hybridization; CNV: copy number variation; CRM: circumferential margin involvement; DEDS: differential expression via distance synthesis; EGFR: epidermal growth factor receptor; <it>G</it>: model based on genomics data; LOO: leave-one-out; LS-SVM: least squares support vector machine; <it>M</it>: model based on microarray data; <it>MG</it>: model based on both microarray and genomics data; <it>MPT</it><sub>0</sub>: model based on microarray and proteomics data at <it>T</it><sub>0</sub>; <it>MPT</it><sub>1</sub>: model based on microarray and proteomics data at <it>T</it><sub>1</sub>; <it>MPT</it><sub>01</sub>: model based on all data (microarray and proteomics data at both timepoints); <it>MT</it><sub>0</sub>: model based on microarray data at <it>T</it><sub>0</sub>; <it>MT</it><sub>1</sub>: model based on microarray data at <it>T</it><sub>1</sub>; <it>MT</it><sub>01</sub>: model based on microarray data at both time points; <it>MT</it><sub>0</sub>-<it>T</it><sub>1</sub>: model based on change in gene expression between <it>T</it><sub>0 </sub>and <it>T</it><sub>1</sub>; <it>PT</it><sub>0</sub>: model based on proteomics data at <it>T</it><sub>0</sub>; <it>PT</it><sub>1</sub>: model based on proteomics data at <it>T</it><sub>1</sub>; <it>PT</it><sub>01</sub>: model based on proteomics data at both time points; <it>PT</it><sub>0</sub>-<it>T</it><sub>1</sub>: model based on change in protein abundances between <it>T</it><sub>0 </sub>and <it>T</it><sub>1</sub>; ROC: receiver operating characteristic; SVM: support vector machine;<it>T</it><sub>0</sub>: time point before treatment; <it>T</it><sub>1</sub>: time point after the first loading dose of cetuximab but before the start of radiotherapy with capecitabine; <it>T</it><sub>2</sub>: time point at moment of surgery.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>The authors declare that they have no competing interests.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>ADa performed the kernel-based integration modeling and drafted the manuscript. OG, FO, and JS participated in the design and implementation of the framework. ADa and OG performed pre-processing of the data. OG, JS, and BDM helped draft the manuscript. ADe, JPM, and KH provided clinical input, looked up patient records in the database, performed sample annotation, and gathered follow-up of patients. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data files are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> shows the ROC curves of the optimal LS-SVM models for all considered combinations of data sets shown in Tables 2 and 4. Additional data file <supplr sid="S2">2</supplr> shows the results for the prediction of WHEELER, pN-STAGE, and CRM in rectal cancer, using step C models for which a sample is required at both time points and for which both technologies need to be performed. Additional data file <supplr sid="S3">3</supplr> contains additional tables 1-3 showing all genes and proteins selected by the best performing models MPT1 for the prediction of WHEELER, pN-STAGE, and CRM in rectal cancer. Additional data file <supplr sid="S3">3</supplr> also contains additional tables 4-7 showing, for prostate cancer, the genes and CNVs selected by the best performing models MG for the prediction of GRADE, STAGE, METASTASIS, and RECURRENCE. All tables in additional data file <supplr sid="S3">3</supplr> show the number of LOO iterations in which each gene, protein, or CNV was selected, their chromosomal region, and whether it is up- or down-regulated.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>ROC curves of the models shown in Tables <tblr tid="T2">2</tblr> and <tblr tid="T4">4</tblr></p>
            </caption>
            <text>
               <p>The ROC curves of the optimal LS-SVM models for all considered combinations of data sets shown in Tables <tblr tid="T2">2</tblr> and <tblr tid="T4">4</tblr> are shown. Additional Figures 1-3 show the ROC curves for the prediction of WHEELER, pN-STAGE, and CRM in rectal cancer, respectively. For prostate cancer, the ROC curves for the prediction of GRADE, STAGE, METASTASIS, and RECURRENCE are shown in additional Figures 4-7, respectively.</p>
            </text>
            <file name="gm39-S1.pdf">
   <p>Click here for file</p>
</file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional data file 2</p>
            </title>
            <caption>
               <p>Additional LS-SVM models for rectal cancer</p>
            </caption>
            <text>
               <p>The results for the prediction of WHEELER, pN-STAGE, and CRM in rectal cancer, using step C models for which a sample is required at both time points and for which both technologies need to be performed. The AUC value and the number of included features are shown for each model. Significance tests were performed to compare these models with the best model based on two data sets shown in bold in Table <tblr tid="T2">2</tblr>.</p>
            </text>
            <file name="gm39-S2.pdf">
   <p>Click here for file</p>
</file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional data file 3</p>
            </title>
            <caption>
               <p>Genes, proteins, and CNVs selected by the models <it>MPT</it><sub>1 </sub>and <it>MG</it></p>
            </caption>
            <text>
               <p>Additional Tables 1-3 show all genes and proteins selected by the best performing models <it>MPT</it><sub>1 </sub>for the prediction of WHEELER (25 genes, 12 proteins), pN-STAGE (21 genes, 14 proteins), and CRM (7 genes, 33 proteins) in rectal cancer. Additional Tables 4-7 show, for prostate cancer, the genes and CNVs selected by the best performing models <it>MG </it>for the prediction of GRADE (6 genes, 8 CNVs), STAGE (42 genes, 22 CNVs), METASTASIS (18 genes, 3 CNVs), and RECURRENCE (32 genes, 2 CNVs). All tables additionally show the number of LOO iterations in which each gene, protein, or CNV was selected, their chromosomal region, and whether it is up- or down-regulated.</p>
            </text>
            <file name="gm39-S3.pdf">
   <p>Click here for file</p>
</file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>ADa is research assistant of the Fund for Scientific Research-Flanders (FWO-Vlaanderen). BDM is a full professor at the Katholieke Universiteit Leuven, Belgium. The authors are grateful to Anja von Heydebreck, Detlef Guessow and Christopher Stroh for their contribution at Merck Serono. This work is partially supported by the following. (1) Research Council KUL: GOA AMBioRICS, CoE EF/05/007 SymBioSys, PROMETA, several PhD/postdoc and fellow grants. (2) Flemish Government: (a) FWO: PhD/postdoc grants, projects G.0241.04 (Functional Genomics), G.0499.04 (Statistics), G.0318.05 (subfunctionalization), G.0302.07 (SVM/Kernel), research communities (ICCoS, ANMMM, MLDM); (b) IWT: PhD Grants, GBOU-McKnow-E (Knowledge management algorithms), GBOU-ANA (biosensors), TAD-BioScope-IT, Silicos; SBO-BioFrame, SBO-MoKa, TBM-Endometriosis. (3) Belgian Federal Science Policy Office: IUAP P6/25 (BioMaGNet, Bioinformatics and Modeling: from Genomes to Networks, 2007-2011). (4) EU-RTD: ERNSI: European Research Network on System Identification; FP6-NoE Biopattern; FP6-IP e-Tumors, FP6-MC-EST Bioptrain, FP6-STREP Strokemap.</p>
         </sec>
      </ack>
      <refgrp><bibl id="B1"><aug><au><snm>Shawe-Taylor</snm><fnm>J</fnm></au><au><snm>Cristianini</snm><fnm>N</fnm></au></aug><source>Kernel Methods for Pattern Analysis</source><publisher>Cambridge: Cambridge University Press</publisher><pubdate>2004</pubdate></bibl><bibl id="B2"><title><p>Machine learning in bioinformatics: a brief survey and recommendations for practitioners.</p></title><aug><au><snm>Bhaskar</snm><fnm>H</fnm></au><au><snm>Hoyle</snm><fnm>DC</fnm></au><au><snm>Singh</snm><fnm>S</fnm></au></aug><source>Comput Biol Med</source><pubdate>2006</pubdate><volume>36</volume><fpage>1104</fpage><lpage>1125</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.compbiomed.2005.09.002</pubid><pubid idtype="pmpid" link="fulltext">16226240</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Least squares support vector machine classifiers.</p></title><aug><au><snm>Suykens</snm><fnm>JAK</fnm></au><au><snm>Vandewalle</snm><fnm>J</fnm></au></aug><source>Neural Processing Lett</source><pubdate>1999</pubdate><volume>9</volume><fpage>293</fpage><lpage>300</lpage><xrefbib><pubid idtype="doi">10.1023/A:1018628609742</pubid></xrefbib></bibl><bibl id="B4"><aug><au><snm>Suykens</snm><fnm>JAK</fnm></au><au><snm>Van Gestel</snm><fnm>T</fnm></au><au><snm>De Brabanter</snm><fnm>J</fnm></au><au><snm>De Moor</snm><fnm>B</fnm></au><au><snm>Vandewalle</snm><fnm>J</fnm></au></aug><source>Least Squares Support Vector Machines</source><publisher>Singapore: World Scientific</publisher><pubdate>2002</pubdate></bibl><bibl id="B5"><title><p>Leave-one-out cross-validation based model selection criteria for weighted LS-SVMs.</p></title><aug><au><snm>Cawley</snm><fnm>GC</fnm></au></aug><source>Proc Int Joint Conf on Neural Networks</source><pubdate>2006</pubdate><fpage>1661</fpage><lpage>1668</lpage><xrefbib><pubid idtype="doi">full_text</pubid></xrefbib></bibl><bibl id="B6"><title><p>Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.</p></title><aug><au><snm>Alon</snm><fnm>A</fnm></au><au><snm>Barkai</snm><fnm>N</fnm></au><au><snm>Notterman</snm><fnm>DA</fnm></au><au><snm>Gish</snm><fnm>K</fnm></au><au><snm>Ybarra</snm><fnm>S</fnm></au><au><snm>Mack</snm><fnm>D</fnm></au><au><snm>Levine</snm><fnm>AJ</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1999</pubdate><volume>96</volume><fpage>6745</fpage><lpage>6750</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.96.12.6745</pubid><pubid idtype="pmcid">21986</pubid><pubid idtype="pmpid" link="fulltext">10359783</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.</p></title><aug><au><snm>Golub</snm><fnm>TR</fnm></au><au><snm>Slonim</snm><fnm>DK</fnm></au><au><snm>Tamayo</snm><fnm>P</fnm></au><au><snm>Huard</snm><fnm>C</fnm></au><au><snm>Gaasenbeek</snm><fnm>M</fnm></au><au><snm>Mesirov</snm><fnm>JP</fnm></au><au><snm>Coller</snm><fnm>H</fnm></au><au><snm>Loh</snm><fnm>ML</fnm></au><au><snm>Downing</snm><fnm>JR</fnm></au><au><snm>Caligiuri</snm><fnm>MA</fnm></au><au><snm>Bloomfield</snm><fnm>CD</fnm></au><au><snm>Lander</snm><fnm>ES</fnm></au></aug><source>Science</source><pubdate>1999</pubdate><volume>286</volume><fpage>531</fpage><lpage>537</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.286.5439.531</pubid><pubid idtype="pmpid" link="fulltext">10521349</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Clinical application of the 70-gene profile: the MINDACT trial.</p></title><aug><au><snm>Cardoso</snm><fnm>F</fnm></au><au><snm>van't Veer</snm><fnm>L</fnm></au><au><snm>Rutgers</snm><fnm>E</fnm></au><au><snm>Loi</snm><fnm>S</fnm></au><au><snm>Mook</snm><fnm>S</fnm></au><au><snm>Piccart-Gebhart</snm><fnm>MJ</fnm></au></aug><source>J Clin Oncol</source><pubdate>2008</pubdate><volume>26</volume><fpage>729</fpage><lpage>735</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2007.14.3222</pubid><pubid idtype="pmpid" link="fulltext">18258980</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>TAILORx: trial assigning individualized options for treatment (Rx).</p></title><aug><au><snm>Sparano</snm><fnm>JA</fnm></au></aug><source>Clin Breast Cancer</source><pubdate>2006</pubdate><volume>7</volume><fpage>347</fpage><lpage>350</lpage><xrefbib><pubidlist><pubid idtype="doi">10.3816/CBC.2006.n.051</pubid><pubid idtype="pmpid">17092406</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Development of the 21-gene assay and its application in clinical practice and clinical trials.</p></title><aug><au><snm>Sparano</snm><fnm>JA</fnm></au><au><snm>Paik</snm><fnm>S</fnm></au></aug><source>J Clin Oncol</source><pubdate>2008</pubdate><volume>26</volume><fpage>721</fpage><lpage>728</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2007.15.1068</pubid><pubid idtype="pmpid" link="fulltext">18258979</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Array comparative genomic hybridization and its applications in cancer.</p></title><aug><au><snm>Pinkel</snm><fnm>D</fnm></au><au><snm>Albertson</snm><fnm>DG</fnm></au></aug><source>Nat Genet</source><pubdate>2005</pubdate><volume>37</volume><fpage>S11</fpage><lpage>S17</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1569</pubid><pubid idtype="pmpid" link="fulltext">15920524</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Epigenetics in cancer.</p></title><aug><au><snm>Esteller</snm><fnm>M</fnm></au></aug><source>N Engl J Med</source><pubdate>2008</pubdate><volume>358</volume><fpage>1148</fpage><lpage>1159</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJMra072067</pubid><pubid idtype="pmpid" link="fulltext">18337604</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Global variation in copy number in the human genome.</p></title><aug><au><snm>Redon</snm><fnm>R</fnm></au><au><snm>Ishikawa</snm><fnm>S</fnm></au><au><snm>Fitch</snm><fnm>KR</fnm></au><au><snm>Feuk</snm><fnm>L</fnm></au><au><snm>Perry</snm><fnm>GH</fnm></au><au><snm>Andrews</snm><fnm>TD</fnm></au><au><snm>Fiegler</snm><fnm>H</fnm></au><au><snm>Shapero</snm><fnm>MH</fnm></au><au><snm>Carson</snm><fnm>AR</fnm></au><au><snm>Chen</snm><fnm>W</fnm></au><au><snm>Cho</snm><fnm>EK</fnm></au><au><snm>Dallaire</snm><fnm>S</fnm></au><au><snm>Freeman</snm><fnm>JL</fnm></au><au><snm>Gonz&#225;lez</snm><fnm>JR</fnm></au><au><snm>Gratac&#242;s</snm><fnm>M</fnm></au><au><snm>Huang</snm><fnm>J</fnm></au><au><snm>Kalaitzopoulos</snm><fnm>D</fnm></au><au><snm>Komura</snm><fnm>D</fnm></au><au><snm>MacDonald</snm><fnm>JR</fnm></au><au><snm>Marshall</snm><fnm>CR</fnm></au><au><snm>Mei</snm><fnm>R</fnm></au><au><snm>Montgomery</snm><fnm>L</fnm></au><au><snm>Nishimura</snm><fnm>K</fnm></au><au><snm>Okamura</snm><fnm>K</fnm></au><au><snm>Shen</snm><fnm>F</fnm></au><au><snm>Somerville</snm><fnm>MJ</fnm></au><au><snm>Tchinda</snm><fnm>J</fnm></au><au><snm>Valsesia</snm><fnm>A</fnm></au><au><snm>Woodwark</snm><fnm>C</fnm></au><au><snm>Yang</snm><fnm>F</fnm></au><etal/></aug><source>Nature</source><pubdate>2006</pubdate><volume>444</volume><fpage>444</fpage><lpage>454</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05329</pubid><pubid idtype="pmcid">2669898</pubid><pubid idtype="pmpid" link="fulltext">17122850</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Chromosomal abnormalities in cancer.</p></title><aug><au><snm>Frohling</snm><fnm>S</fnm></au><au><snm>Dohner</snm><fnm>H</fnm></au></aug><source>N Engl J Med</source><pubdate>2008</pubdate><volume>359</volume><fpage>722</fpage><lpage>734</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1056/NEJMra0803109</pubid><pubid idtype="pmpid" link="fulltext">18703475</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>The molecular make-up of a tumor: proteomics in cancer research.</p></title><aug><au><snm>Kolch</snm><fnm>W</fnm></au><au><snm>Mischak</snm><fnm>H</fnm></au><au><snm>Pitt</snm><fnm>AR</fnm></au></aug><source>Clin Sci</source><pubdate>2005</pubdate><volume>108</volume><fpage>369</fpage><lpage>383</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1042/CS20050006</pubid><pubid idtype="pmpid" link="fulltext">15831087</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Mass spectrometry-based proteomics.</p></title><aug><au><snm>Aebersold</snm><fnm>R</fnm></au><au><snm>Mann</snm><fnm>M</fnm></au></aug><source>Nature</source><pubdate>2003</pubdate><volume>422</volume><fpage>198</fpage><lpage>207</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature01511</pubid><pubid idtype="pmpid" link="fulltext">12634793</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Printing proteins as microarrays for high-throughput function determination.</p></title><aug><au><snm>MacBeatch</snm><fnm>G</fnm></au><au><snm>Schreiber</snm><fnm>SL</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>289</volume><fpage>1760</fpage><lpage>1763</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">10976071</pubid></xrefbib></bibl><bibl id="B18"><title><p>Systematic assessment of copy number variant detection via genome-wide SNP genotyping.</p></title><aug><au><snm>Cooper</snm><fnm>GM</fnm></au><au><snm>Zerr</snm><fnm>T</fnm></au><au><snm>Kidd</snm><fnm>JM</fnm></au><au><snm>Eichler</snm><fnm>EE</fnm></au><au><snm>Nickerson</snm><fnm>DA</fnm></au></aug><source>Nat Genet</source><pubdate>2008</pubdate><volume>40</volume><fpage>1199</fpage><lpage>1203</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.236</pubid><pubid idtype="pmcid">2759751</pubid><pubid idtype="pmpid" link="fulltext">18776910</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Pre-validation and inference in microarrays.</p></title><aug><au><snm>Tibshirani</snm><fnm>RJ</fnm></au><au><snm>Efron</snm><fnm>B</fnm></au></aug><source>Stat Appl Genet Mol Biol</source><pubdate>2002</pubdate><volume>1</volume><fpage>Article 1</fpage><xrefbib><pubid idtype="pubmed">16646777</pubid></xrefbib></bibl><bibl id="B20"><title><p>Towards integrated clinico-genomic models for personalized medicine: combining gene expression signatures and clinical factors in breast cancer outcomes prediction.</p></title><aug><au><snm>Nevins</snm><fnm>JR</fnm></au><au><snm>Huang</snm><fnm>ES</fnm></au><au><snm>Dressman</snm><fnm>H</fnm></au><au><snm>Pittman</snm><fnm>J</fnm></au><au><snm>Huang</snm><fnm>AT</fnm></au><au><snm>West</snm><fnm>M</fnm></au></aug><source>Hum Mol Genet</source><pubdate>2003</pubdate><volume>12</volume><fpage>R153</fpage><lpage>R157</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/hmg/ddg287</pubid><pubid idtype="pmpid" link="fulltext">12928487</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Identification and validation of a novel gene signature associated with the recurrence of human hepatocellular carcinoma.</p></title><aug><au><snm>Wang</snm><fnm>SM</fnm></au><au><snm>Ooi</snm><fnm>LL</fnm></au><au><snm>Hui</snm><fnm>KM</fnm></au></aug><source>Clin Cancer Res</source><pubdate>2007</pubdate><volume>13</volume><fpage>6275</fpage><lpage>6283</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/1078-0432.CCR-06-2236</pubid><pubid idtype="pmpid" link="fulltext">17975138</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>From bytes to bedside: data integration and computational biology for translational cancer research.</p></title><aug><au><snm>Mathew</snm><fnm>JP</fnm></au><au><snm>Taylor</snm><fnm>BS</fnm></au><au><snm>Bader</snm><fnm>GD</fnm></au><au><snm>Pyarajan</snm><fnm>S</fnm></au><au><snm>Antoniotti</snm><fnm>M</fnm></au><au><snm>Chinnaiyan</snm><fnm>AM</fnm></au><au><snm>Sander</snm><fnm>C</fnm></au><au><snm>Burakoff</snm><fnm>SJ</fnm></au><au><snm>Mishra</snm><fnm>B</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2007</pubdate><volume>3</volume><fpage>e12</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.0030012</pubid><pubid idtype="pmcid">1808026</pubid><pubid idtype="pmpid" link="fulltext">17319736</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Breast tumor copy number aberration phenotypes and genomic instability.</p></title><aug><au><snm>Fridlyand</snm><fnm>J</fnm></au><au><snm>Snijders</snm><fnm>AM</fnm></au><au><snm>Ylstra</snm><fnm>B</fnm></au><au><snm>Li</snm><fnm>H</fnm></au><au><snm>Olshen</snm><fnm>A</fnm></au><au><snm>Segraves</snm><fnm>R</fnm></au><au><snm>Dairkee</snm><fnm>S</fnm></au><au><snm>Tokuyasu</snm><fnm>T</fnm></au><au><snm>Ljung</snm><fnm>BM</fnm></au><au><snm>Jain</snm><fnm>AN</fnm></au><au><snm>McLennan</snm><fnm>J</fnm></au><au><snm>Ziegler</snm><fnm>J</fnm></au><au><snm>Chin</snm><fnm>K</fnm></au><au><snm>Devries</snm><fnm>S</fnm></au><au><snm>Feiler</snm><fnm>H</fnm></au><au><snm>Gray</snm><fnm>JW</fnm></au><au><snm>Waldman</snm><fnm>F</fnm></au><au><snm>Pinkel</snm><fnm>D</fnm></au><au><snm>Albertson</snm><fnm>DG</fnm></au></aug><source>BMC Cancer</source><pubdate>2006</pubdate><volume>6</volume><fpage>96</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2407-6-96</pubid><pubid idtype="pmcid">1459181</pubid><pubid idtype="pmpid" link="fulltext">16620391</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Novel risk stratification of patients with neuroblastoma by genomic signature, which is independent of molecular signature.</p></title><aug><au><snm>Tomioka</snm><fnm>N</fnm></au><au><snm>Oba</snm><fnm>S</fnm></au><au><snm>Ohira</snm><fnm>M</fnm></au><au><snm>Misra</snm><fnm>A</fnm></au><au><snm>Fridlyand</snm><fnm>J</fnm></au><au><snm>Ishii</snm><fnm>S</fnm></au><au><snm>Nakamura</snm><fnm>Y</fnm></au><au><snm>Isogai</snm><fnm>E</fnm></au><au><snm>Hirata</snm><fnm>T</fnm></au><au><snm>Yoshida</snm><fnm>Y</fnm></au><au><snm>Todo</snm><fnm>S</fnm></au><au><snm>Kanedo</snm><fnm>Y</fnm></au><au><snm>Albertson</snm><fnm>DG</fnm></au><au><snm>Pinkel</snm><fnm>D</fnm></au><au><snm>Feuerstein</snm><fnm>BG</fnm></au><au><snm>Nakagawara</snm><fnm>A</fnm></au></aug><source>Oncogene</source><pubdate>2008</pubdate><volume>27</volume><fpage>441</fpage><lpage>449</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.onc.1210661</pubid><pubid idtype="pmpid" link="fulltext">17637744</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Data merging for integrated microarray and proteomic analysis.</p></title><aug><au><snm>Waters</snm><fnm>KM</fnm></au><au><snm>Pounds</snm><fnm>JG</fnm></au><au><snm>Thrall</snm><fnm>BD</fnm></au></aug><source>Brief Funct Genomic Proteomic</source><pubdate>2006</pubdate><volume>5</volume><fpage>261</fpage><lpage>272</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bfgp/ell019</pubid><pubid idtype="pmpid" link="fulltext">16772273</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>State of the nation in data integration for bioinformatics.</p></title><aug><au><snm>Goble</snm><fnm>C</fnm></au><au><snm>Stevens</snm><fnm>R</fnm></au></aug><source>J Biomed Inform</source><pubdate>2008</pubdate><volume>41</volume><fpage>687</fpage><lpage>693</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.jbi.2008.01.008</pubid><pubid idtype="pmpid" link="fulltext">18358788</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Exon level integration of proteomics and microarray data.</p></title><aug><au><snm>Bitton</snm><fnm>DA</fnm></au><au><snm>Okoniewski</snm><fnm>MJ</fnm></au><au><snm>Connolly</snm><fnm>Y</fnm></au><au><snm>Miller</snm><fnm>CJ</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2008</pubdate><volume>9</volume><fpage>118</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-9-118</pubid><pubid idtype="pmcid">2267708</pubid><pubid idtype="pmpid" link="fulltext">18298841</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>A statistical framework for genomic data fusion.</p></title><aug><au><snm>Lanckriet</snm><fnm>GRG</fnm></au><au><snm>De Bie</snm><fnm>T</fnm></au><au><snm>Cristianini</snm><fnm>N</fnm></au><au><snm>Jordan</snm><fnm>MI</fnm></au><au><snm>Noble</snm><fnm>WS</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><fpage>2626</fpage><lpage>2635</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth294</pubid><pubid idtype="pmpid" link="fulltext">15130933</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Integration of clinical and microarray data with kernel methods.</p></title><aug><au><snm>Daemen</snm><fnm>A</fnm></au><au><snm>Gevaert</snm><fnm>O</fnm></au><au><snm>Moor</snm><fnm>BD</fnm></au></aug><source>Conf Proc IEEE Eng Med Biol Soc</source><pubdate>2007</pubdate><fpage>5411</fpage><lpage>5415</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">18003232</pubid></xrefbib></bibl><bibl id="B30"><title><p>Gene expression profiling identifies clinically relevant subtypes of prostate cancer.</p></title><aug><au><snm>Lapointe</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Higgins</snm><fnm>JP</fnm></au><au><snm>Rijn</snm><mnm>van de</mnm><fnm>M</fnm></au><au><snm>Bair</snm><fnm>E</fnm></au><au><snm>Montgomery</snm><fnm>K</fnm></au><au><snm>Ferrari</snm><fnm>M</fnm></au><au><snm>Egevad</snm><fnm>L</fnm></au><au><snm>Rayford</snm><fnm>W</fnm></au><au><snm>Bergerheim</snm><fnm>U</fnm></au><au><snm>Ekman</snm><fnm>P</fnm></au><au><snm>DeMarzo</snm><fnm>AM</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au><au><snm>Botstein</snm><fnm>D</fnm></au><au><snm>Brown</snm><fnm>PO</fnm></au><au><snm>Brooks</snm><fnm>JD</fnm></au><au><snm>Pollack</snm><fnm>JR</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2004</pubdate><volume>101</volume><fpage>811</fpage><lpage>816</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0304146101</pubid><pubid idtype="pmcid">321763</pubid><pubid idtype="pmpid" link="fulltext">14711987</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Genomic profiling reveals alternative genetic pathways of prostate tumorigenesis.</p></title><aug><au><snm>Lapointe</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Giacomini</snm><fnm>CP</fnm></au><au><snm>Salari</snm><fnm>K</fnm></au><au><snm>Huang</snm><fnm>S</fnm></au><au><snm>Wang</snm><fnm>P</fnm></au><au><snm>Ferrari</snm><fnm>M</fnm></au><au><snm>Hernandez-Boussard</snm><fnm>T</fnm></au><au><snm>Brooks</snm><fnm>JD</fnm></au><au><snm>Pollack</snm><fnm>JR</fnm></au></aug><source>Cancer Res</source><pubdate>2007</pubdate><volume>67</volume><fpage>8504</fpage><lpage>8510</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-07-0673</pubid><pubid idtype="pmpid" link="fulltext">17875689</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Phase I/II study of preoperative cetuximab, capecitabine, and external beam radiotherapy in patients with rectal cancer.</p></title><aug><au><snm>Machiels</snm><fnm>JP</fnm></au><au><snm>Sempoux</snm><fnm>C</fnm></au><au><snm>Scalliet</snm><fnm>P</fnm></au><au><snm>Coche</snm><fnm>JC</fnm></au><au><snm>Humblet</snm><fnm>Y</fnm></au><au><snm>Van Cutsem</snm><fnm>E</fnm></au><au><snm>Kerger</snm><fnm>J</fnm></au><au><snm>Canon</snm><fnm>JL</fnm></au><au><snm>Peeters</snm><fnm>M</fnm></au><au><snm>Aydin</snm><fnm>S</fnm></au><au><snm>Laurent</snm><fnm>S</fnm></au><au><snm>Kartheuser</snm><fnm>A</fnm></au><au><snm>Coster</snm><fnm>B</fnm></au><au><snm>Roels</snm><fnm>S</fnm></au><au><snm>Daisne</snm><fnm>JF</fnm></au><au><snm>Honhon</snm><fnm>B</fnm></au><au><snm>Duck</snm><fnm>L</fnm></au><au><snm>Kirkove</snm><fnm>C</fnm></au><au><snm>Bonny</snm><fnm>MA</fnm></au><au><snm>Haustermans</snm><fnm>K</fnm></au></aug><source>Ann Oncol</source><pubdate>2007</pubdate><volume>18</volume><fpage>738</fpage><lpage>744</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/annonc/mdl460</pubid><pubid idtype="pmpid" link="fulltext">17208931</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Exploration, normalization and summaries of high density oligonucleotide array probe level data.</p></title><aug><au><snm>Irizarry</snm><fnm>RA</fnm></au><au><snm>Hobbs</snm><fnm>B</fnm></au><au><snm>Collin</snm><fnm>F</fnm></au><au><snm>Beazer-Barclay</snm><fnm>YD</fnm></au><au><snm>Antonellis</snm><fnm>KJ</fnm></au><au><snm>Scherf</snm><fnm>U</fnm></au><au><snm>Speed</snm><fnm>TP</fnm></au></aug><source>Biostatistics</source><pubdate>2003</pubdate><volume>4</volume><fpage>249</fpage><lpage>264</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/biostatistics/4.2.249</pubid><pubid idtype="pmpid" link="fulltext">12925520</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Quantification of histologic regression of rectal cancer after irradiation.</p></title><aug><au><snm>Wheeler</snm><fnm>JMD</fnm></au><au><snm>Warren</snm><fnm>BF</fnm></au><au><snm>Mortensen</snm><fnm>NJ</fnm></au><au><snm>Ekanyaka</snm><fnm>N</fnm></au><au><snm>Kulacoglu</snm><fnm>H</fnm></au><au><snm>Jones</snm><fnm>AC</fnm></au><au><snm>George</snm><fnm>BD</fnm></au><au><snm>Kettlewell</snm><fnm>MGW</fnm></au></aug><source>Dis Colon Rectum</source><pubdate>2002</pubdate><volume>45</volume><fpage>1051</fpage><lpage>1056</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s10350-004-6359-x</pubid><pubid idtype="pmpid">12195189</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>What is the best to predict disease-free survival after preoperative radiochemotherapy for rectal cancer patients: tumor regression grading, nodal status or circumferential resection margin invasion?</p></title><aug><au><snm>Machiels</snm><fnm>JP</fnm></au><au><snm>Aydin</snm><fnm>S</fnm></au><au><snm>Bonny</snm><fnm>MA</fnm></au><au><snm>Hammouch</snm><fnm>F</fnm></au><au><snm>Sempoux</snm><fnm>C</fnm></au></aug><source>J Clin Oncol</source><pubdate>2006</pubdate><volume>24</volume><fpage>1319</fpage><lpage>1321</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1200/JCO.2005.05.0963</pubid><pubid idtype="pmpid" link="fulltext">16525188</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Role of circumferential margin involvement in the local recurrence of rectal cancer.</p></title><aug><au><snm>Adam</snm><fnm>IJ</fnm></au><au><snm>Mohamdee</snm><fnm>MO</fnm></au><au><snm>Martin</snm><fnm>IG</fnm></au><au><snm>Scott</snm><fnm>N</fnm></au><au><snm>Finan</snm><fnm>PJ</fnm></au><au><snm>Johnston</snm><fnm>D</fnm></au><au><snm>Dixon</snm><fnm>MF</fnm></au><au><snm>Quirke</snm><fnm>P</fnm></au></aug><source>Lancet</source><pubdate>1994</pubdate><volume>344</volume><fpage>707</fpage><lpage>711</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0140-6736(94)92206-3</pubid><pubid idtype="pmpid">7915774</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Local recurrence of rectal adenocarcinoma due to inadequate surgical resection: histopathological study of lateral tumor spread and surgical excision.</p></title><aug><au><snm>Quirke</snm><fnm>P</fnm></au><au><snm>Durdey</snm><fnm>P</fnm></au><au><snm>Dixon</snm><fnm>MF</fnm></au><au><snm>Williams</snm><fnm>NS</fnm></au></aug><source>Lancet</source><pubdate>1986</pubdate><volume>2</volume><fpage>996</fpage><lpage>999</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0140-6736(86)92612-7</pubid><pubid idtype="pmpid" link="fulltext">2430152</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Missing value estimation methods for DNA microarrays.</p></title><aug><au><snm>Troyanskaya</snm><fnm>O</fnm></au><au><snm>Cantor</snm><fnm>M</fnm></au><au><snm>Sherlock</snm><fnm>G</fnm></au><au><snm>Brown</snm><fnm>P</fnm></au><au><snm>Hastie</snm><fnm>T</fnm></au><au><snm>Tibshirani</snm><fnm>R</fnm></au><au><snm>Botstein</snm><fnm>D</fnm></au><au><snm>Altman</snm><fnm>RB</fnm></au></aug><source>Bioinformatics</source><pubdate>2001</pubdate><volume>17</volume><fpage>520</fpage><lpage>525</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/17.6.520</pubid><pubid idtype="pmpid" link="fulltext">11395428</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Classification of prostatic carcinomas.</p></title><aug><au><snm>Gleason</snm><fnm>DF</fnm></au></aug><source>Cancer Chemother Rep</source><pubdate>1966</pubdate><volume>50</volume><fpage>125</fpage><lpage>128</lpage><xrefbib><pubid idtype="pmpid">5948714</pubid></xrefbib></bibl><bibl id="B40"><aug><au><snm>Scholkopf</snm><fnm>B</fnm></au><au><snm>Tsuda</snm><fnm>K</fnm></au><au><snm>Vert</snm><fnm>JP</fnm></au></aug><source>Kernel Methods in Computational Biology</source><publisher>Cambridge, MA: MIT Press</publisher><pubdate>2004</pubdate></bibl><bibl id="B41"><aug><au><snm>Vapnik</snm><fnm>V</fnm></au></aug><source>Statistical Learning Theory</source><publisher>New York: Wiley</publisher><pubdate>1998</pubdate></bibl><bibl id="B42"><title><p>Systematic benchmarking of microarray data classification: assessing the role of nonlinearity and dimensionality reduction.</p></title><aug><au><snm>Pochet</snm><fnm>N</fnm></au><au><snm>De Smet</snm><fnm>F</fnm></au><au><snm>Suykens</snm><fnm>J</fnm></au><au><snm>Moor</snm><fnm>BD</fnm></au></aug><source>Bioinformatics</source><pubdate>2004</pubdate><volume>20</volume><fpage>3185</fpage><lpage>3195</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bth383</pubid><pubid idtype="pmpid" link="fulltext">15231531</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets.</p></title><aug><au><snm>Lai</snm><fnm>C</fnm></au><au><snm>Reinders</snm><fnm>MJT</fnm></au><au><snm>van't Veer</snm><fnm>LJ</fnm></au><au><snm>Wessels</snm><fnm>LFA</fnm></au></aug><source>Bioinformatics</source><pubdate>2006</pubdate><volume>7</volume><fpage>235</fpage><lpage>244</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-7-235</pubid><pubid idtype="pmcid">1569875</pubid><pubid idtype="pmpid" link="fulltext">16670007</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Identifying differentially expressed genes from microarray experiments via statistic synthesis.</p></title><aug><au><snm>Yang</snm><fnm>YH</fnm></au><au><snm>Xiao</snm><fnm>Y</fnm></au><au><snm>Segal</snm><fnm>MR</fnm></au></aug><source>Bioinformatics</source><pubdate>2005</pubdate><volume>21</volume><fpage>1084</fpage><lpage>1093</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/bti108</pubid><pubid idtype="pmpid" link="fulltext">15513985</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>How many genes are needed for a discriminant microarray data analysis.</p></title><aug><au><snm>Li</snm><fnm>W</fnm></au><au><snm>Yang</snm><fnm>Y</fnm></au></aug><source>Methods of Microarray Data Analysis</source><publisher>Kluwer Academic</publisher><editor>Lin SM, Johnson KF</editor><pubdate>2002</pubdate><fpage>137</fpage><lpage>150</lpage></bibl><bibl id="B46"><title><p>A method of comparing the areas under receiver operating characteristics curves derived from the same cases.</p></title><aug><au><snm>Hanley</snm><fnm>JA</fnm></au><au><snm>McNeil</snm><fnm>BJ</fnm></au></aug><source>Radiology</source><pubdate>1983</pubdate><volume>148</volume><fpage>839</fpage><lpage>843</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">6878708</pubid></xrefbib></bibl><bibl id="B47"><title><p>Gene functional classification from heterogeneous data.</p></title><aug><au><snm>Pavlidis</snm><fnm>P</fnm></au><au><snm>Weston</snm><fnm>J</fnm></au><au><snm>Cai</snm><fnm>J</fnm></au><au><snm>Grundy</snm><fnm>WN</fnm></au></aug><source>Proceedings of the Fifth Annual International Conference on Computational Biology: April 22-25, 2001; Montreal, Quebec, Canada</source><publisher>New York, NY: ACM</publisher><pubdate>2001</pubdate><fpage>242</fpage><lpage>252</lpage></bibl><bibl id="B48"><title><p>Multiclass multiple kernel learning.</p></title><aug><au><snm>Zien</snm><fnm>A</fnm></au><au><snm>Ong</snm><fnm>CS</fnm></au></aug><source>Proceedings of the 24th International Conference on Machine Learning: June 20-24, 2007; Corvalis, Oregon</source><publisher>New York, NY: ACM</publisher><pubdate>2007</pubdate><fpage>1191</fpage><lpage>1198</lpage></bibl><bibl id="B49"><title><p>Epidermal growth factor receptor gene polymorphisms predict pelvic recurrence in patients with rectal cancer treated with chemoradiation.</p></title><aug><au><snm>Zhang</snm><fnm>W</fnm></au><au><snm>Park</snm><fnm>DJ</fnm></au><au><snm>Lu</snm><fnm>B</fnm></au><au><snm>Yang</snm><fnm>DY</fnm></au><au><snm>Gordon</snm><fnm>M</fnm></au><au><snm>Groshen</snm><fnm>S</fnm></au><au><snm>Yun</snm><fnm>J</fnm></au><au><snm>Press</snm><fnm>OA</fnm></au><au><snm>Vallbohmer</snm><fnm>D</fnm></au><au><snm>Rhodes</snm><fnm>K</fnm></au><au><snm>Lenz</snm><fnm>HJ</fnm></au></aug><source>Clin Cancer Res</source><pubdate>2005</pubdate><volume>11</volume><fpage>600</fpage><lpage>605</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15701846</pubid></xrefbib></bibl><bibl id="B50"><title><p>Expression of cyclooxygenase-2 parallels expression of interleukin-1beta, interleukin-6 and NF-kappaB in human colorectal cancer.</p></title><aug><au><snm>Maihofner</snm><fnm>C</fnm></au><au><snm>Charalambous</snm><fnm>MP</fnm></au><au><snm>Bhambra</snm><fnm>U</fnm></au><au><snm>Lightfoot</snm><fnm>T</fnm></au><au><snm>Geisslinger</snm><fnm>G</fnm></au><au><snm>Gooderham</snm><fnm>NJ</fnm></au><au><cnm>The Colorectal Cancer Group</cnm></au></aug><source>Carcinogenesis</source><pubdate>2003</pubdate><volume>24</volume><fpage>665</fpage><lpage>671</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/carcin/bgg006</pubid><pubid idtype="pmpid" link="fulltext">12727794</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Integrin <it>&#945;</it>2 and extracellular signal-regulated kinase are functionally linked in highly malignant autocrine transforming growth factor-<it>&#945;</it>-driven colon cancer cells.</p></title><aug><au><snm>Sawhney</snm><fnm>RS</fnm></au><au><snm>Sharma</snm><fnm>B</fnm></au><au><snm>Humphrey</snm><fnm>LE</fnm></au><au><snm>Brattain</snm><fnm>MG</fnm></au></aug><source>J Biol Chem</source><pubdate>2003</pubdate><volume>278</volume><fpage>19861</fpage><lpage>19869</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M213162200</pubid><pubid idtype="pmpid" link="fulltext">12657625</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Correlation of IL-8 with induction, progression and metastatic potential of colorectal cancer.</p></title><aug><au><snm>Rubie</snm><fnm>C</fnm></au><au><snm>Frick</snm><fnm>VO</fnm></au><au><snm>Pfeil</snm><fnm>S</fnm></au><au><snm>Wagner</snm><fnm>M</fnm></au><au><snm>Kollmar</snm><fnm>O</fnm></au><au><snm>Kopp</snm><fnm>B</fnm></au><au><snm>Graber</snm><fnm>S</fnm></au><au><snm>Rau</snm><fnm>BM</fnm></au><au><snm>Schilling</snm><fnm>MK</fnm></au></aug><source>World J Gastroenterol</source><pubdate>2007</pubdate><volume>13</volume><fpage>4996</fpage><lpage>5002</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17854143</pubid></xrefbib></bibl><bibl id="B53"><title><p>Serum HCG beta, CA 72-4 and CEA are independent prognostic factors in colorectal cancer.</p></title><aug><au><snm>Louhimo</snm><fnm>J</fnm></au><au><snm>Carpelan-Holmstrom</snm><fnm>M</fnm></au><au><snm>Alfthan</snm><fnm>H</fnm></au><au><snm>Stenman</snm><fnm>UH</fnm></au><au><snm>Jarvinen</snm><fnm>HJ</fnm></au><au><snm>Haglund</snm><fnm>C</fnm></au></aug><source>Int J Cancer</source><pubdate>2002</pubdate><volume>101</volume><fpage>545</fpage><lpage>548</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/ijc.90009</pubid><pubid idtype="pmpid" link="fulltext">12237895</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Subcellular localization and tumor-suppressive functions of 15-lipoxygenase 2 (15-LOX2) and its splice variants.</p></title><aug><au><snm>Bhatia</snm><fnm>B</fnm></au><au><snm>Maldonado</snm><fnm>CJ</fnm></au><au><snm>Tang</snm><fnm>S</fnm></au><au><snm>Chandra</snm><fnm>D</fnm></au><au><snm>Klein</snm><fnm>RD</fnm></au><au><snm>Chopra</snm><fnm>D</fnm></au><au><snm>Shappell</snm><fnm>SB</fnm></au><au><snm>Yang</snm><fnm>P</fnm></au><au><snm>Newman</snm><fnm>RA</fnm></au><au><snm>Tang</snm><fnm>DG</fnm></au></aug><source>J Biol Chem</source><pubdate>2003</pubdate><volume>278</volume><fpage>25091</fpage><lpage>25100</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1074/jbc.M301920200</pubid><pubid idtype="pmpid" link="fulltext">12704195</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Secreted frizzled-related protein 4 inhibits proliferation and metastatic potential in prostate cancer.</p></title><aug><au><snm>Horvath</snm><fnm>LG</fnm></au><au><snm>Lelliott</snm><fnm>JE</fnm></au><au><snm>Kench</snm><fnm>JG</fnm></au><au><snm>Lee</snm><fnm>CS</fnm></au><au><snm>Williams</snm><fnm>ED</fnm></au><au><snm>Saunders</snm><fnm>DN</fnm></au><au><snm>Grvgiel</snm><fnm>JJ</fnm></au><au><snm>Sutherland</snm><fnm>RL</fnm></au><au><snm>Henshall</snm><fnm>SM</fnm></au></aug><source>Prostate</source><pubdate>2007</pubdate><volume>67</volume><fpage>1081</fpage><lpage>1090</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/pros.20607</pubid><pubid idtype="pmpid" link="fulltext">17476687</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>Modulation of CXCL14 (BRAK) expression in prostate cancer.</p></title><aug><au><snm>Schwarze</snm><fnm>SR</fnm></au><au><snm>Luo</snm><fnm>J</fnm></au><au><snm>Isaacs</snm><fnm>WB</fnm></au><au><snm>Jarrard</snm><fnm>DF</fnm></au></aug><source>Prostate</source><pubdate>2005</pubdate><volume>64</volume><fpage>67</fpage><lpage>74</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/pros.20215</pubid><pubid idtype="pmpid" link="fulltext">15651028</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>Mapping of TMPRSS2-ERG fusions in the context of multi-focal prostate cancer.</p></title><aug><au><snm>Furusato</snm><fnm>B</fnm></au><au><snm>Gao</snm><fnm>CL</fnm></au><au><snm>Ravindranath</snm><fnm>L</fnm></au><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Cullen</snm><fnm>J</fnm></au><au><snm>McLeod</snm><fnm>DG</fnm></au><au><snm>Dobi</snm><fnm>A</fnm></au><au><snm>Srivastava</snm><fnm>S</fnm></au><au><snm>Petrovics</snm><fnm>G</fnm></au><au><snm>Sesterhenn</snm><fnm>IA</fnm></au></aug><source>Mod Pathol</source><pubdate>2008</pubdate><volume>21</volume><fpage>67</fpage><lpage>75</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.modpathol.3801030</pubid><pubid idtype="pmpid" link="fulltext">18065961</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>Expression of the TMPRSS2:ERG fusion gene predicts cancer recurrence after surgery for localised prostate cancer.</p></title><aug><au><snm>Nam</snm><fnm>RK</fnm></au><au><snm>Sugar</snm><fnm>L</fnm></au><au><snm>Yang</snm><fnm>W</fnm></au><au><snm>Srivastava</snm><fnm>S</fnm></au><au><snm>Klotz</snm><fnm>LH</fnm></au><au><snm>Yang</snm><fnm>LY</fnm></au><au><snm>Stanimirovic</snm><fnm>A</fnm></au><au><snm>Encioiu</snm><fnm>E</fnm></au><au><snm>Neill</snm><fnm>M</fnm></au><au><snm>Loblaw</snm><fnm>DA</fnm></au><au><snm>Trachtenberg</snm><fnm>J</fnm></au><au><snm>Narod</snm><fnm>SA</fnm></au><au><snm>Seth</snm><fnm>A</fnm></au></aug><source>Br J Cancer</source><pubdate>2007</pubdate><volume>97</volume><fpage>1690</fpage><lpage>1695</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.bjc.6604054</pubid><pubid idtype="pmcid">2360284</pubid><pubid idtype="pmpid" link="fulltext">17971772</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Vav3 oncogene is overexpressed and regulates cell growth and androgen receptor activity in human prostate cancer.</p></title><aug><au><snm>Dong</snm><fnm>Z</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Lu</snm><fnm>S</fnm></au><au><snm>Wang</snm><fnm>A</fnm></au><au><snm>Lee</snm><fnm>K</fnm></au><au><snm>Wang</snm><fnm>LH</fnm></au><au><snm>Revelo</snm><fnm>M</fnm></au><au><snm>Lu</snm><fnm>S</fnm></au></aug><source>Mol Endocrinol</source><pubdate>2006</pubdate><volume>20</volume><fpage>2315</fpage><lpage>2325</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1210/me.2006-0048</pubid><pubid idtype="pmpid" link="fulltext">16762975</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>Prognostic relevance of Tiam1 protein expression in prostate carcinomas.</p></title><aug><au><snm>Engers</snm><fnm>R</fnm></au><au><snm>Mueller</snm><fnm>M</fnm></au><au><snm>Walter</snm><fnm>A</fnm></au><au><snm>Collard</snm><fnm>JG</fnm></au><au><snm>Willers</snm><fnm>R</fnm></au><au><snm>Gabbert</snm><fnm>HE</fnm></au></aug><source>Br J Cancer</source><pubdate>2006</pubdate><volume>95</volume><fpage>1081</fpage><lpage>1086</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.bjc.6603385</pubid><pubid idtype="pmcid">2360703</pubid><pubid idtype="pmpid" link="fulltext">17003780</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>JAGGED1 expression is associated with prostate cancer metastasis and recurrence.</p></title><aug><au><snm>Santagata</snm><fnm>S</fnm></au><au><snm>Demichelis</snm><fnm>F</fnm></au><au><snm>Riva</snm><fnm>A</fnm></au><au><snm>Varambally</snm><fnm>S</fnm></au><au><snm>Hofer</snm><fnm>MD</fnm></au><au><snm>Kutok</snm><fnm>JL</fnm></au><au><snm>Kim</snm><fnm>R</fnm></au><au><snm>Tang</snm><fnm>J</fnm></au><au><snm>Montie</snm><fnm>JE</fnm></au><au><snm>Chinnaiyan</snm><fnm>AM</fnm></au><au><snm>Rubin</snm><fnm>MA</fnm></au><au><snm>Aster</snm><fnm>JC</fnm></au></aug><source>Cancer Res</source><pubdate>2004</pubdate><volume>64</volume><fpage>6854</fpage><lpage>6857</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-04-2500</pubid><pubid idtype="pmpid" link="fulltext">15466172</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Implications for RNase L in prostate cancer biology.</p></title><aug><au><snm>Silverman</snm><fnm>RH</fnm></au></aug><source>Biochemistry</source><pubdate>2003</pubdate><volume>42</volume><fpage>1805</fpage><lpage>1812</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1021/bi027147i</pubid><pubid idtype="pmpid" link="fulltext">12590567</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>What proportion of patients referred to secondary care with iron deficiency anemia have colon cancer?</p></title><aug><au><snm>Raje</snm><fnm>D</fnm></au><au><snm>Mukhtar</snm><fnm>H</fnm></au><au><snm>Oshowo</snm><fnm>A</fnm></au><au><snm>Clark</snm><fnm>CI</fnm></au></aug><source>Dis Colon Rectum</source><pubdate>2007</pubdate><volume>50</volume><fpage>1211</fpage><lpage>1214</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s10350-007-0249-y</pubid><pubid idtype="pmpid" link="fulltext">17587088</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Epidermal growth factor receptor (EGFR) as a target in cancer therapy: understanding the role of receptor expression and other molecular determinants that could influence the response to anti-EGFR drugs.</p></title><aug><au><snm>Ciardiello</snm><fnm>F</fnm></au><au><snm>Tortora</snm><fnm>G</fnm></au></aug><source>Eur J Cancer</source><pubdate>2003</pubdate><volume>39</volume><fpage>1348</fpage><lpage>1354</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0959-8049(03)00235-1</pubid><pubid idtype="pmpid" link="fulltext">12826036</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Activity and expression of urokinase-type plasminogen activator and matrix metalloproteinases in human colorectal cancer.</p></title><aug><au><snm>Kim</snm><fnm>TD</fnm></au><au><snm>Song</snm><fnm>KS</fnm></au><au><snm>Li</snm><fnm>G</fnm></au><au><snm>Choi</snm><fnm>H</fnm></au><au><snm>Park</snm><fnm>HD</fnm></au><au><snm>Lim</snm><fnm>K</fnm></au><au><snm>Hwang</snm><fnm>BD</fnm></au><au><snm>Yoon</snm><fnm>WH</fnm></au></aug><source>BMC Cancer</source><pubdate>2006</pubdate><volume>6</volume><fpage>211</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2407-6-211</pubid><pubid idtype="pmcid">1563482</pubid><pubid idtype="pmpid" link="fulltext">16916471</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Serum levels of soluble E-selectin in colorectal cancer.</p></title><aug><au><snm>Uner</snm><fnm>A</fnm></au><au><snm>Akcali</snm><fnm>Z</fnm></au><au><snm>Unsal</snm><fnm>D</fnm></au></aug><source>Neoplasma</source><pubdate>2004</pubdate><volume>51</volume><fpage>269</fpage><lpage>274</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15254658</pubid></xrefbib></bibl><bibl id="B67"><title><p>GM-CSF promotes differentiation of human dendritic cells and T lymphocytes toward a predominantly type 1 proinflammatory response.</p></title><aug><au><snm>Eksioglu</snm><fnm>EA</fnm></au><au><snm>Mahmood</snm><fnm>SS</fnm></au><au><snm>Chang</snm><fnm>M</fnm></au><au><snm>Reddy</snm><fnm>V</fnm></au></aug><source>Exp Hematol</source><pubdate>2007</pubdate><volume>35</volume><fpage>1163</fpage><lpage>1171</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.exphem.2007.05.001</pubid><pubid idtype="pmpid" link="fulltext">17562355</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Prognostic significance of MMP-1 and MMP-3 functional promoter polymorphisms in colorectal cancer.</p></title><aug><au><snm>Zinzindohoue</snm><fnm>F</fnm></au><au><snm>Lecomte</snm><fnm>T</fnm></au><au><snm>Ferraz</snm><fnm>JM</fnm></au><au><snm>Houllier</snm><fnm>AM</fnm></au><au><snm>Cugnenc</snm><fnm>PH</fnm></au><au><snm>Berger</snm><fnm>A</fnm></au><au><snm>Blons</snm><fnm>H</fnm></au><au><snm>Laurent-Puig</snm><fnm>P</fnm></au></aug><source>Clin Cancer Res</source><pubdate>2005</pubdate><volume>11</volume><fpage>594</fpage><lpage>599</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">15701845</pubid></xrefbib></bibl><bibl id="B69"><title><p>Overexpression of Reg IV in colorectal adenoma.</p></title><aug><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Lai</snm><fnm>M</fnm></au><au><snm>Lv</snm><fnm>B</fnm></au><au><snm>Gu</snm><fnm>X</fnm></au><au><snm>Wang</snm><fnm>H</fnm></au><au><snm>Zhu</snm><fnm>Y</fnm></au><au><snm>Zhu</snm><fnm>Y</fnm></au><au><snm>Shao</snm><fnm>L</fnm></au><au><snm>Wang</snm><fnm>G</fnm></au></aug><source>Cancer Lett</source><pubdate>2003</pubdate><volume>200</volume><fpage>69</fpage><lpage>76</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0304-3835(03)00460-9</pubid><pubid idtype="pmpid" link="fulltext">14550954</pubid></pubidlist></xrefbib></bibl><bibl id="B70"><title><p>TNF-alpha activates MUC2 transcription via NF-kappaB but inhibits via JNK activation.</p></title><aug><au><snm>Ahn</snm><fnm>DH</fnm></au><au><snm>Crawley</snm><fnm>SC</fnm></au><au><snm>Hokari</snm><fnm>R</fnm></au><au><snm>Kato</snm><fnm>S</fnm></au><au><snm>Yang</snm><fnm>SC</fnm></au><au><snm>Li</snm><fnm>JD</fnm></au><au><snm>Kim</snm><fnm>YS</fnm></au></aug><source>Cell Physiol Biochem</source><pubdate>2005</pubdate><volume>15</volume><fpage>29</fpage><lpage>40</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000083636</pubid><pubid idtype="pmpid" link="fulltext">15665513</pubid></pubidlist></xrefbib></bibl><bibl id="B71"><title><p>Expression of a novel carbonic anhydrase, CA XIII, in normal and neoplastic colorectal mucosa.</p></title><aug><au><snm>Kummola</snm><fnm>L</fnm></au><au><snm>Hala</snm><fnm>J</fnm></au><au><snm>Kivelamainen</snm><fnm>JM</fnm></au><au><snm>Kivela</snm><fnm>AJ</fnm></au><au><snm>Saarnio</snm><fnm>J</fnm></au><au><snm>Karttunen</snm><fnm>T</fnm></au><au><snm>Parkkila</snm><fnm>S</fnm></au></aug><source>BMC Cancer</source><pubdate>2005</pubdate><volume>5</volume><fpage>41</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2407-5-41</pubid><pubid idtype="pmcid">1097719</pubid><pubid idtype="pmpid" link="fulltext">15836783</pubid></pubidlist></xrefbib></bibl><bibl id="B72"><title><p>Differential expression of genes encoding tight junction proteins in colorectal cancer: frequent dysregulation of claudin-1, -8 and -12.</p></title><aug><au><snm>Gropcke</snm><fnm>S</fnm></au><au><snm>Mannone</snm><fnm>J</fnm></au><au><snm>Weber</snm><fnm>B</fnm></au><au><snm>Staub</snm><fnm>E</fnm></au><au><snm>Heinze</snm><fnm>M</fnm></au><au><snm>Klaman</snm><fnm>I</fnm></au><au><snm>Pilarsky</snm><fnm>C</fnm></au><au><snm>Hermann</snm><fnm>K</fnm></au><au><snm>Castanos-Velez</snm><fnm>E</fnm></au><au><snm>Ropcke</snm><fnm>S</fnm></au><au><snm>Mann</snm><fnm>B</fnm></au><au><snm>Rosenthal</snm><fnm>A</fnm></au><au><snm>Buhr</snm><fnm>HJ</fnm></au></aug><source>Int J Colorectal Dis</source><pubdate>2007</pubdate><volume>22</volume><fpage>651</fpage><lpage>659</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00384-006-0197-3</pubid><pubid idtype="pmpid" link="fulltext">17047970</pubid></pubidlist></xrefbib></bibl><bibl id="B73"><title><p>Interleukin-1 receptor antagonist gene polymorphism a gs in human colorectal cancer.</p></title><aug><au><snm>Viet</snm><fnm>HT</fnm></au><au><snm>Wagsater</snm><fnm>D</fnm></au><au><snm>Hugander</snm><fnm>A</fnm></au><au><snm>Dimberg</snm><fnm>J</fnm></au></aug><source>Oncol Rep</source><pubdate>2005</pubdate><volume>14</volume><fpage>915</fpage><lpage>918</lpage><xrefbib><pubid idtype="pmpid">16142351</pubid></xrefbib></bibl><bibl id="B74"><title><p>Beta2-microglobulin mutations in microsatellite unstable colorectal tumors.</p></title><aug><au><snm>Kloor</snm><fnm>M</fnm></au><au><snm>Michel</snm><fnm>S</fnm></au><au><snm>Buckowitz</snm><fnm>B</fnm></au><au><snm>Ruschoff</snm><fnm>J</fnm></au><au><snm>Buttner</snm><fnm>R</fnm></au><au><snm>Holinski-Feder</snm><fnm>E</fnm></au><au><snm>Dippold</snm><fnm>W</fnm></au><au><snm>Wagner</snm><fnm>R</fnm></au><au><snm>Tariverdian</snm><fnm>M</fnm></au><au><snm>Benner</snm><fnm>A</fnm></au><au><snm>Schwitalle</snm><fnm>Y</fnm></au><au><snm>Kuchenbuch</snm><fnm>B</fnm></au><au><snm>von Knebel Doeberitz</snm><fnm>M</fnm></au></aug><source>Int J Cancer</source><pubdate>2007</pubdate><volume>121</volume><fpage>454</fpage><lpage>458</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/ijc.22691</pubid><pubid idtype="pmpid" link="fulltext">17373663</pubid></pubidlist></xrefbib></bibl><bibl id="B75"><title><p>Hypermethylation and silencing of the putative tumor suppressor <it>Tazarotene-induced gene </it>1 in human cancers.</p></title><aug><au><snm>Youssef</snm><fnm>EM</fnm></au><au><snm>Chen</snm><fnm>Xq</fnm></au><au><snm>Higuchi</snm><fnm>E</fnm></au><au><snm>Kondo</snm><fnm>Y</fnm></au><au><snm>Garcia-Manero</snm><fnm>G</fnm></au><au><snm>Lotan</snm><fnm>R</fnm></au><au><snm>Issa</snm><fnm>JPJ</fnm></au></aug><source>Cancer Res</source><pubdate>2004</pubdate><volume>64</volume><fpage>2411</fpage><lpage>2417</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-03-0164</pubid><pubid idtype="pmpid" link="fulltext">15059893</pubid></pubidlist></xrefbib></bibl><bibl id="B76"><title><p>Genetic disregulation of gene coding tumor necrosis factor alpha receptors (TNFalpha Rs) in colorectal cancer cells.</p></title><aug><au><snm>Muc-Wierzgon</snm><fnm>M</fnm></au><au><snm>Nowakowska-Zajdel</snm><fnm>E</fnm></au><au><snm>Kokot</snm><fnm>T</fnm></au><au><snm>Kozowicz</snm><fnm>A</fnm></au><au><snm>Zubelewicz</snm><fnm>B</fnm></au><au><snm>Klakla</snm><fnm>K</fnm></au><au><snm>Mazurek</snm><fnm>U</fnm></au><au><snm>Cholewa</snm><fnm>K</fnm></au><au><snm>Wilczok</snm><fnm>T</fnm></au><au><snm>Wierzgon</snm><fnm>J</fnm></au><au><snm>Sosada</snm><fnm>K</fnm></au></aug><source>J Exp Clin Cancer Res</source><pubdate>2004</pubdate><volume>23</volume><fpage>651</fpage><lpage>660</lpage><xrefbib><pubid idtype="pmpid">15743036</pubid></xrefbib></bibl><bibl id="B77"><title><p>Expression of intercellular adhesion molecule-1 and prognosis in colorectal cancer.</p></title><aug><au><snm>Maeda</snm><fnm>K</fnm></au><au><snm>Kang</snm><fnm>SM</fnm></au><au><snm>Sawada</snm><fnm>T</fnm></au><au><snm>Nishiguchi</snm><fnm>Y</fnm></au><au><snm>Yashiro</snm><fnm>M</fnm></au><au><snm>Ogawa</snm><fnm>Y</fnm></au><au><snm>Ohira</snm><fnm>M</fnm></au><au><snm>Ishikawa</snm><fnm>T</fnm></au><au><snm>Hirakawa</snm><fnm>YS</fnm></au><au><snm>Chung</snm><fnm>K</fnm></au></aug><source>Oncol Rep</source><pubdate>2002</pubdate><volume>9</volume><fpage>511</fpage><lpage>514</lpage><xrefbib><pubid idtype="pmpid">11956618</pubid></xrefbib></bibl><bibl id="B78"><title><p>Prognostic significance of adiponectin levels in non-metastatic colorectal cancer.</p></title><aug><au><snm>Ferroni</snm><fnm>P</fnm></au><au><snm>Palmirotta</snm><fnm>R</fnm></au><au><snm>Spila</snm><fnm>A</fnm></au><au><snm>Martini</snm><fnm>F</fnm></au><au><snm>Raparelli</snm><fnm>V</fnm></au><au><snm>Fossile</snm><fnm>E</fnm></au><au><snm>Mariotti</snm><fnm>S</fnm></au><au><snm>Del Monte</snm><fnm>G</fnm></au><au><snm>Buonomo</snm><fnm>O</fnm></au><au><snm>Roselli</snm><fnm>M</fnm></au><au><snm>Guadagni</snm><fnm>F</fnm></au></aug><source>Anticancer Res</source><pubdate>2007</pubdate><volume>27</volume><fpage>483</fpage><lpage>489</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17348431</pubid></xrefbib></bibl><bibl id="B79"><title><p>Expression and role of thrombospondin-1 in colorectal cancer.</p></title><aug><au><snm>Miyanaga</snm><fnm>K</fnm></au><au><snm>Kato</snm><fnm>Y</fnm></au><au><snm>Nakamura</snm><fnm>T</fnm></au><au><snm>Matsumura</snm><fnm>M</fnm></au><au><snm>Amaya</snm><fnm>H</fnm></au><au><snm>Horiuchi</snm><fnm>T</fnm></au><au><snm>Chiba</snm><fnm>Y</fnm></au><au><snm>Tanaka</snm><fnm>K</fnm></au></aug><source>Anticancer Res</source><pubdate>2002</pubdate><volume>22</volume><fpage>3941</fpage><lpage>3948</lpage><xrefbib><pubid idtype="pmpid">12553016</pubid></xrefbib></bibl><bibl id="B80"><title><p>Relationship between tissue factor expression and hepatic metastasis and prognosis in rectal cancer.</p></title><aug><au><snm>Wan</snm><fnm>Y</fnm></au><au><snm>Wu</snm><fnm>N</fnm></au><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Ju</snm><fnm>X</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Tang</snm><fnm>J</fnm></au><au><snm>Huang</snm><fnm>Y</fnm></au></aug><source>Zhonghua Zhong Liu Za Zhi</source><pubdate>2002</pubdate><volume>24</volume><fpage>378</fpage><lpage>380</lpage><xrefbib><pubid idtype="pmpid">12408769</pubid></xrefbib></bibl><bibl id="B81"><title><p>Polymorphisms in the cytochrome P450 genes CYP1A2, CYP1B1, CYP3A4, CYP3A5, CYP11A1, CYP17A1, CYP19A1 and colorectal cancer risk.</p></title><aug><au><snm>Bethke</snm><fnm>L</fnm></au><au><snm>Webb</snm><fnm>E</fnm></au><au><snm>Sellick</snm><fnm>G</fnm></au><au><snm>Rudd</snm><fnm>M</fnm></au><au><snm>Penegar</snm><fnm>S</fnm></au><au><snm>Withey</snm><fnm>L</fnm></au><au><snm>Qureshi</snm><fnm>M</fnm></au><au><snm>Houlston</snm><fnm>R</fnm></au></aug><source>BMC Cancer</source><pubdate>2007</pubdate><volume>7</volume><fpage>123</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2407-7-123</pubid><pubid idtype="pmcid">1925111</pubid><pubid idtype="pmpid" link="fulltext">17615053</pubid></pubidlist></xrefbib></bibl><bibl id="B82"><title><p>The expression and regulation of ADAMTS-1, -4, -5, -9, and -15, and TIMP-3 by TGFbeta1 in prostate cells: relevance to the accumulation of versican.</p></title><aug><au><snm>Cross</snm><fnm>NA</fnm></au><au><snm>Chandrasekharan</snm><fnm>S</fnm></au><au><snm>Jokonya</snm><fnm>N</fnm></au><au><snm>Fowles</snm><fnm>A</fnm></au><au><snm>Hamdy</snm><fnm>FC</fnm></au><au><snm>Buttle</snm><fnm>DJ</fnm></au><au><snm>Eaton</snm><fnm>CL</fnm></au></aug><source>Prostate</source><pubdate>2005</pubdate><volume>63</volume><fpage>269</fpage><lpage>275</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/pros.20182</pubid><pubid idtype="pmpid" link="fulltext">15599946</pubid></pubidlist></xrefbib></bibl><bibl id="B83"><title><p>Immunohistochemical expression of tumor antigens MAGE-A1, MAGE-A3/4, and NY-ESO-1 in cancerous and benign prostatic tissue.</p></title><aug><au><snm>Hudolin</snm><fnm>T</fnm></au><au><snm>Juretic</snm><fnm>A</fnm></au><au><snm>Spagnoli</snm><fnm>GC</fnm></au><au><snm>Pasini</snm><fnm>J</fnm></au><au><snm>Bandic</snm><fnm>D</fnm></au><au><snm>Heberer</snm><fnm>M</fnm></au><au><snm>Kosicek</snm><fnm>M</fnm></au><au><snm>Cacic</snm><fnm>M</fnm></au></aug><source>Prostate</source><pubdate>2006</pubdate><volume>66</volume><fpage>13</fpage><lpage>18</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/pros.20312</pubid><pubid idtype="pmpid" link="fulltext">16114059</pubid></pubidlist></xrefbib></bibl><bibl id="B84"><title><p>Aminopeptidase N regulated by zinc in human prostate participates in tumor cell invasion.</p></title><aug><au><snm>Ishii</snm><fnm>K</fnm></au><au><snm>Usui</snm><fnm>S</fnm></au><au><snm>Sugimura</snm><fnm>Y</fnm></au><au><snm>Yoshida</snm><fnm>S</fnm></au><au><snm>Hioki</snm><fnm>T</fnm></au><au><snm>Tatematsu</snm><fnm>M</fnm></au><au><snm>Yamamoto</snm><fnm>H</fnm></au><au><snm>Hirano</snm><fnm>K</fnm></au></aug><source>Int J Cancer</source><pubdate>2001</pubdate><volume>92</volume><fpage>49</fpage><lpage>54</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/1097-0215(200102)9999:9999&lt;::AID-IJC1161&gt;3.0.CO;2-S</pubid><pubid idtype="pmpid" link="fulltext">11279605</pubid></pubidlist></xrefbib></bibl><bibl id="B85"><title><p>Brn-3a neuronal transcription factor functional expression in human prostate cancer.</p></title><aug><au><snm>Diss</snm><fnm>JK</fnm></au><au><snm>Faulkes</snm><fnm>DJ</fnm></au><au><snm>Walker</snm><fnm>MM</fnm></au><au><snm>Patel</snm><fnm>A</fnm></au><au><snm>Foster</snm><fnm>CS</fnm></au><au><snm>Budhram-Mahadeo</snm><fnm>V</fnm></au><au><snm>Djamgoz</snm><fnm>MB</fnm></au><au><snm>Latchman</snm><fnm>DS</fnm></au></aug><source>Prostate Cancer Prostatic Dis</source><pubdate>2006</pubdate><volume>9</volume><fpage>83</fpage><lpage>91</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/sj.pcan.4500837</pubid><pubid idtype="pmpid" link="fulltext">16276351</pubid></pubidlist></xrefbib></bibl><bibl id="B86"><title><p>Functional characterization of the GDEP promoter and three enhancer elements in retinoblastoma and prostate cell lines.</p></title><aug><au><snm>Cross</snm><fnm>DS</fnm></au><au><snm>Burmester</snm><fnm>JK</fnm></au></aug><source>Med Oncol</source><pubdate>2008</pubdate><volume>25</volume><fpage>40</fpage><lpage>49</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s12032-007-0038-4</pubid><pubid idtype="pmpid" link="fulltext">18188713</pubid></pubidlist></xrefbib></bibl><bibl id="B87"><title><p>T-cell receptor gamma chain alternate reading frame protein (TARP) expression in prostate cancer cells leads to an increased growth rate and induction of caveolins and amphiregulin.</p></title><aug><au><snm>Wolfgang</snm><fnm>CD</fnm></au><au><snm>Essand</snm><fnm>M</fnm></au><au><snm>Lee</snm><fnm>B</fnm></au><au><snm>Pastan</snm><fnm>I</fnm></au></aug><source>Cancer Res</source><pubdate>2001</pubdate><volume>61</volume><fpage>8122</fpage><lpage>8126</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11719440</pubid></xrefbib></bibl><bibl id="B88"><title><p>Characterization of ZAG protein expression in prostate cancer using a semi-automated microscope system.</p></title><aug><au><snm>Descazeaud</snm><fnm>A</fnm></au><au><snm>de la Taille</snm><fnm>A</fnm></au><au><snm>Allory</snm><fnm>Y</fnm></au><au><snm>Faucon</snm><fnm>H</fnm></au><au><snm>Salomon</snm><fnm>L</fnm></au><au><snm>Bismar</snm><fnm>T</fnm></au><au><snm>Kim</snm><fnm>R</fnm></au><au><snm>Hofer</snm><fnm>MD</fnm></au><au><snm>Chopin</snm><fnm>D</fnm></au><au><snm>Abbou</snm><fnm>CC</fnm></au><au><snm>Rubin</snm><fnm>MA</fnm></au></aug><source>Prostate</source><pubdate>2006</pubdate><volume>66</volume><fpage>1037</fpage><lpage>1043</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1002/pros.20405</pubid><pubid idtype="pmpid" link="fulltext">16598739</pubid></pubidlist></xrefbib></bibl><bibl id="B89"><title><p>Fibrinogen synthesized by cancer cells augments the proliferative effect of fibroblast growth factor-2 (FGF-2).</p></title><aug><au><snm>Sahni</snm><fnm>A</fnm></au><au><snm>Simpson-Haidaris</snm><fnm>PJ</fnm></au><au><snm>Sahni</snm><fnm>SK</fnm></au><au><snm>Vaday</snm><fnm>GG</fnm></au><au><snm>Francis</snm><fnm>CW</fnm></au></aug><source>J Thromb Haemost</source><pubdate>2008</pubdate><volume>6</volume><fpage>176</fpage><lpage>183</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17949478</pubid></xrefbib></bibl><bibl id="B90"><title><p>The tumor metastasis suppressor gene Drg-1 down-regulates the expression of activating transcription factor 3 in prostate cancer.</p></title><aug><au><snm>Bandyopadhyay</snm><fnm>S</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Zhan</snm><fnm>R</fnm></au><au><snm>Pai</snm><fnm>SK</fnm></au><au><snm>Watabe</snm><fnm>M</fnm></au><au><snm>Iiizumi</snm><fnm>M</fnm></au><au><snm>Furuta</snm><fnm>E</fnm></au><au><snm>Mohinta</snm><fnm>S</fnm></au><au><snm>Liu</snm><fnm>W</fnm></au><au><snm>Hirota</snm><fnm>S</fnm></au><au><snm>Hosobe</snm><fnm>S</fnm></au><au><snm>Tsukada</snm><fnm>T</fnm></au><au><snm>Miura</snm><fnm>K</fnm></au><au><snm>Takano</snm><fnm>Y</fnm></au><au><snm>Saito</snm><fnm>K</fnm></au><au><snm>Commes</snm><fnm>T</fnm></au><au><snm>Piquemal</snm><fnm>D</fnm></au><au><snm>Hai</snm><fnm>T</fnm></au><au><snm>Watabe</snm><fnm>K</fnm></au></aug><source>Cancer Res</source><pubdate>2006</pubdate><volume>66</volume><fpage>11983</fpage><lpage>11990</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1158/0008-5472.CAN-06-0943</pubid><pubid idtype="pmpid" link="fulltext">17178897</pubid></pubidlist></xrefbib></bibl></refgrp>
   </bm>
</art>