A module map showing conditional activity of expression modules in cancer

Segal, Eran; Friedman, Nir; Koller, Daphne; Regev, Aviv

doi:10.1038/ng1434

Letter
Published: 26 September 2004

A module map showing conditional activity of expression modules in cancer

Eran Segal¹^nAff4,
Nir Friedman²,
Daphne Koller¹ &
…
Aviv Regev³

Nature Genetics volume 36, pages 1090–1098 (2004)Cite this article

10k Accesses
537 Citations
13 Altmetric
Metrics details

Abstract

DNA microarrays are widely used to study changes in gene expression in tumors, but such studies are typically system-specific and do not address the commonalities and variations between different types of tumor. Here we present an integrated analysis of 1,975 published microarrays spanning 22 tumor types. We describe expression profiles in different tumors in terms of the behavior of modules, sets of genes that act in concert to carry out a specific function. Using a simple unified analysis, we extract modules and characterize gene-expression profiles in tumors as a combination of activated and deactivated modules. Activation of some modules is specific to particular types of tumor; for example, a growth-inhibitory module is specifically repressed in acute lymphoblastic leukemias and may underlie the deregulated proliferation in these cancers. Other modules are shared across a diverse set of clinical conditions, suggestive of common tumor progression mechanisms. For example, the bone osteoblastic module spans a variety of tumor types and includes both secreted growth factors and their receptors. Our findings suggest that there is a single mechanism for both primary tumor proliferation and metastasis to bone. Our analysis presents multiple research directions for diagnostic, prognostic and therapeutic studies.

You have full access to this article via your institution.

Download PDF

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Srinivas Niranj Chandrasekaran, Beth A. Cimini, … Anne E. Carpenter

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Qiuyue Yuan & Zhana Duren

Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis

Article Open access 25 March 2024

Wenpin Hou & Zhicheng Ji

Main

Cancer is a multifaceted phenomenon, originating in different tissues and involving disruptions of various cellular processes. Aberrations in regulation of key proliferation and survival pathways are common to all tumors, whereas alterations in other pathways may be specific to certain tumors. Understanding which mechanisms are general and which are specific has important therapeutic implications, but few studies^1,2,3,4 address this issue from a genome-wide perspective. Here, we used DNA microarray data in a comprehensive analysis aimed at identifying the shared and unique molecular 'modules' underlying human malignancies. Two recent studies^3,5 demonstrate the utility of similar approaches in the context of a single module. The result of our analysis is a global map showing the modules that are induced or repressed in a wide variety of clinical conditions.

We analyzed a 'cancer compendium' of expression profiles compiled from 26 studies (Supplementary Table 1 online), measuring the expression of 14,145 genes in 1,975 arrays spanning 17 categories (Fig. 1a). First, we organized genes into higher-level modules, and then we identified clinical conditions in which different modules are induced or repressed.

**Figure 1: Overview of the analysis procedure.**

We started by collecting 2,849 biologically meaningful gene sets, including clusters of coexpressed genes, genes expressed in specific tissue types⁶ and genes belonging to the same functional category or pathway^7,8,9 (Fig. 1b). We identified the arrays in which each gene set has a prominent expression signature by testing whether the expression of a statistically significant fraction of the genes in the set changed coordinately in the array (Fig. 1c,d). In our compendium, the change in expression of each gene in a given array is relative to the average expression of the gene across all arrays in the relevant data set.

Gene sets reflect biological modules only approximately. Only a subset of genes in a set may contribute to its expression signature, and different gene sets may have similar signatures across the arrays, owing to either an overlap between the gene sets or coregulation of nonoverlapping gene sets. When several gene sets (a cluster) have similar signatures, we extracted from this cluster a core module, which both refines the gene composition of each gene set and combines several related gene sets. This module more closely reflects the genes that participate in a specific biological process, as it consists of the genes whose expression profile corresponds to the signature of the cluster. Overall, we identified 456 statistically significant modules (Supplementary Note and Supplementary Fig. 1 online) that span various processes and functions, including metabolism, transcription, translation, degradation, cellular and neural signaling, growth, cell cycle, apoptosis and extracellular matrix and cytoskeleton components.

In the second step of our analysis, we used these modules to characterize clinical conditions according to the combination of modules that are activated and deactivated in them. Using information provided in the original studies, we annotated all the arrays with 263 biological and clinical conditions, including tissue and tumor type, diagnostic and prognostic information, and molecular markers. For each module and each condition, we tested whether the module was induced (or repressed) in a significant fraction of the arrays labeled with the condition. We distinguished between 'specific' and 'general' annotations: specific annotations are evaluated within each category, whereas general annotations are evaluated only relative to their lack of association with arrays from the other categories. We compiled the module-condition pairs into a global module map for cancer (Fig. 2).

Figure 2: The cancer module map: a matrix of modules (rows) versus array clinical conditions (columns), where a red (or green) entry indicates that the arrays in which the corresponding module was significantly induced (or repressed) contained more arrays with the given annotation than would be expected by chance.

The results must be interpreted with caution, because the biological interpretation of induction (or repression) of a module in a given condition depends on our choice of normalization (Supplementary Note online). In addition, interpretation may be confounded by combining diverse data sets, each normalized separately. To address this problem, we used annotations in a way that is strictly local to each category (Supplementary Note online) in the final analysis step, in which we paired modules with clinical annotations.

The module map shows that some modules (e.g., cell cycle; Fig. 3a) are shared across multiple tumor types and may be related to general tumorigenic processes, whereas others are more specific to the tissue origin or progression of particular tumors. For example, modules related to neural processes (e.g., #274 and #137) are repressed in a subset of brain tumors (relative to other central nervous system tumors), and an intermediate filament module (#357) is induced in squamous cell lung carcinomas and reduced in lung adenocarcinomas (both relative to other lung tumors), consistent with the idea that de-differentiation processes accompany tumorigenesis. Related modules, such as cell cycle modules (Fig. 3a), seem to form building blocks that are used together in different conditions. More specialized modules, such as signaling and growth regulatory modules (Fig. 3b,c), are used in distinct combinations by various tumors.

**Figure 3: Combinatorial signatures in the cancer module map.**

Conversely, the module map characterizes each condition by a particular combination of modules. For example, invasive hepatocellular carcinoma (HCC) is characterized by induction of cell cycle modules and repression of modules related to metabolism, detoxification, the extracellular matrix and signaling (relative to hepatitis-infected liver tissue and noninvasive HCC). Estrogen receptor–positive breast cancer is characterized by repression of modules containing keratins and other intermediate filaments (relative to other breast adenocarcinomas and human mammary epithelial cells). The map indicates that related conditions involve related modules, albeit in distinct ways (Fig. 3d,e). For example, various tumors of hematologic origin (Fig. 3d) involve similar immune, inflammation, growth regulation and signaling modules. The pattern of involvement separates different tumor types and subtypes.

Characterizing conditions in terms of modules provides important insights into the mechanisms underlying specific malignancies. For example, the growth inhibitory module (Fig. 4) consists primarily of growth suppressors (11 of 16) whose expression is coordinately repressed in a subset of acute leukemia arrays (relative to the leukemia category; 40 arrays; P < 4 × 10⁻²⁹). Some of these genes are direct (DUSP2 (ref. 10), DUSP4 (ref. 11), DUSP6 (ref. 12)) or indirect (RGS3 (ref. 13), RGS4 (ref. 14)) repressors of ERK1, an activator of cell proliferation (Fig. 4b) known to be constitutively active in acute leukemia¹⁰. Others (MAP3K7IP1 (also called TAB1; ref. 15) and GADD45G (ref. 16)) are activators of the apoptosis repressor p38 (Fig. 4b). Thus, the concerted downregulation of these growth suppressors may allow ERK1 and p38 to escape regulation, leading to uncontrolled proliferation and reduced cell death. DUSP2 has been implicated in acute leukemia¹⁰; the other genes may offer new therapeutic targets.

**Figure 4: Growth inhibitory module (#173), a module that responds significantly to one specific condition: acute leukemia.**

The steroid catabolism module (Fig. 5) primarily contains steroid hormone enzymes (8 of 13) whose expression is repressed in a subset of HCC and hepatic cell lines (relative to hepatitis-infected liver tissue and HCC; 31 arrays; P < 4 × 10⁻⁸). This may indicate more than a general reduction in metabolic processes. Expression of an additional module (#404), consisting of steroid hormone receptors (6 of 25 module genes) and binding proteins (15 of 25), is repressed in a subset of HCC and hepatic cell lines (relative to hepatitis-infected liver tissue and HCC; 24 arrays; P < 2.5 × 10⁻⁶). This reduction of steroid hormone catabolism in HCC is consistent with the fact that HCC is significantly more prevalent in men and postmenopausal women¹⁷ and that elevated levels of serum testosterone predict an increased HCC risk. Overall, these results suggest that an imbalance in the generation of steroid hormones and in receiving steroid hormone signals may have a role in hepatitis and HCC.

**Figure 5: Steroid catabolism module (#505), a module that responds significantly to one specific condition: liver tissue and tumor samples.**

Other modules provide insight into a variety of tumors. For example, the bone osteoblastic module (Fig. 6) consists of genes associated with proliferation and differentiation of bone-building cells. These genes are induced in 172 arrays, including a subset of breast cancer samples (relative to other breast cancer and human mammary epithelial cells; 37 arrays; P < 5.6 × 10⁻¹⁴) and a subset of nontumor hepatitis-infected liver (relative to other hepatitis-infected liver tissue and HCC; 47 arrays; P < 10⁻¹⁰). Expression of these genes is repressed in 361 arrays, including subsets of HCC (relative to other hepatitis-infected liver tissue and HCC; 48 arrays; P < 2 × 10⁻⁹), a subset of ALL1 acute lymphoblastic leukemia (relative to other acute lymphoblastic leukemia and acute myeloid leukemia; 10 arrays; P < 9 × 10⁻⁶) and a subset of lung cancer samples (relative to other lung cancers; 120 arrays; P < 10⁻³³).

**Figure 6: Bone osteoblastic module (#234), a module that responds significantly to multiple conditions, including breast cancer, lung cancer, HCC and ALL.**

Bone-related clinical conditions have been associated with all of these malignancies. In particular, bone metastasis is a key phenomenon in breast cancer, and some breast metastases are known to be osteoblastic¹⁸. Not all primary breast tumors activate the osteoblastic module, consistent with the fact that many breast metastases to bone are not osteoblastic¹⁸ and probably use different mechanisms¹⁹. Bone metastasis is also common in lung cancer¹⁸ and was recently implicated in HCC²⁰. Finally, ALL has been associated with reduced bone-mass density in a subpopulation of individuals²¹. The bone osteoblastic module reflects these diverse phenomena and may partially explain them. Although osteoblastic metastasis is also common in prostate cancer¹⁸, the module was not substantially expressed in the prostate cancer samples in our compendium. As several genes in the module that are known to be transcriptionally induced in prostate cancer (MGP, IGF2, IL6 and GHR) are not induced in this data set, we suspect that these arrays are uninformative about osteoblastic metastasis.

The induction of the bone osteoblastic module in breast cancer is particularly interesting. Previous studies suggested that breast tumors preferentially metastasize to bone owing to a cycle of positive feedback through reciprocal secretion of growth factors between the tumor and bone cells¹⁸. It was previously unclear, however, whether the molecular mechanisms necessary to initiate this cycle are present in the primary tumor¹⁹. We found that both the secreted growth factors and the intracellular proteins required to receive their signal were induced in primary breast cancer tumors, suggesting that the primary tumor uses the osteoblastic mechanism for its own paracrine proliferation. One might suspect that the module is induced in the surrounding stroma rather than in the tumor itself. Previous immunohistochemical and in situ hybridization experiments (Fig. 6d) indicate that 19 of the 32 module genes are expressed in epithelial cells in tumors and some also in metastasis of breast cancer to bone (e.g., IGF2 (ref. 18), BMP4 (ref. 18), IL6 (ref. 18), FRZB²² and activin A²³). Only 4 of 32 genes, all of which encode secreted proteins, are expressed solely in the stroma, indicative of possible paracrine signaling between tumor and breast stroma. This process may be subsequently substituted by signaling between the metastasized tumor and bone stroma. Thus, this borrowed module may both be innately useful to the primary tumor and provide a mechanism for effective osteoblastic bone metastasis. This hypothesis is consistent with recent findings on the metastatic potential of primary tumors^24,25 and identifies several new targets for further research.

The downregulation of the bone osteoblastic module in HCC, ALL and lung cancer is also notable. There is no clear explanation for this downregulation in lung and HCC tumors, but repression of this growth-inducing module in the ALL bone marrow samples provides a potential explanation for the reduced bone mass density in ALL. Dlx3 and Dlx5, two ALL-1 targets that are crucial to osteoblast proliferation and differentiation²⁶, are part of the module.

In conclusion, our method provides a global view of cancer and shows that tumors can be characterized by combinations of a relatively small number of modules. Several other methods have been proposed for global analysis of microarray data^27,28,29. Notably, our work, which is the first to apply such global analysis to human data, uses existing biological knowledge directly, in the form of gene sets and clinical annotations. Furthermore, unlike recent meta-analysis⁴ of a large compendium of cancer expression profiles, our approach focuses on identifying modules of genes and is independent of predefined queries (Supplementary Note online).

The results of our analysis are publicly available on a data-mining website; the automated tool that we used to generate the analysis is also available. This tool allows researchers to construct a module map from any collection of gene sets and expression data in any organism and to study new data in the context of a large compendium. Although the quality of current annotations and normalization procedures may limit the map's accuracy, our examples indicate that many phenomena are sufficiently robust to be detected using our approach. Thus, our approach provides a valuable tool for understanding the molecular basis of cancer, both for specific tumors and for tumorigenic processes in general.

Methods

DNA microarray data set.

We downloaded data available for 1,975 human DNA microarrays from the Stanford Microarray Database and the Center for Genomic Research at the Whitehead Institute (Supplementary Table 1 online). We normalized the expression of each gene g in every data set separately. For data sets generated using Affymetrix chips, we first determined the log (base 2) of the expression value of gene g in each array (truncating to 10 expression values that are below 10). For data sets generated using spotted cDNA chips, we used the log-ratio (base 2) between the measured sample and the control sample. In both types of data sets, we then normalized the (log-space) expression value of gene g in each array relative to its average expression in all the arrays in the same data set, by subtracting its average in that data set from each of its expression measurements. After this normalization, the mean value of a gene, in each data set, is zero.

Gene sets.

We compiled 2,849 gene sets, obtained as follows: 1,281 from the Gene Ontology⁸ hierarchy (downloaded on July 2003, version 1.320); 114 from the Kyoto Encyclopedia of Genes and Genomes⁷ (downloaded on May 2003); 53 from the Gene MicroArray Pathway Profiler⁹ (downloaded on July 2003); 101 tissue-specific expressed gene sets⁶ (one gene set was defined for each array by taking all genes above absolute expression of 400; we removed genes whose absolute expression was >400 in >50 of the 101 arrays); and 1,300 gene sets obtained by clustering each of the data sets of Supplementary Table 1 online using a published clustering method (the P-cluster algorithm²⁷) and taking clusters of coexpressed genes.

Identifying arrays in which the expression of gene sets changes significantly.

To identify the arrays in which each gene set was significantly induced (or repressed), we defined the induced (or repressed) genes in each array to be those genes whose change in expression was greater (or less) than twofold. For each gene set and each array, we calculated the fraction of genes from that gene set that were induced (or repressed) in that array and used the hypergeometric distribution to calculate a P value for this fraction (compared with the null hypothesis of choosing the same number genes at random). We corrected for multiple tests using the false discovery rate correction with 5% false rate.

Statistical significance of array–gene set pairs.

We evaluated the number of array–gene set pairs in which the gene set was significantly induced (or repressed) in the array (as described above). Overall, we found 299,233 such pairs; only 14,962 would be expected by chance (P < 0.05), suggesting that the selected gene sets are informative for the cancer compendium (Supplementary Fig. 2 online).

Automatic identification of gene set clusters.

We carried out (bottom-up) hierarchical clustering of the gene sets in the matrix of all significant array–gene set pairs³⁰. This resulted in a tree in which each leaf node, corresponding to some gene set G, is associated with a vector (indexed by arrays) that is zero everywhere except for entries that correspond to arrays in which set G was significantly induced (or repressed), in which case the entry contains the fraction (or negative fraction) of genes from set G that are induced (or repressed) in an array a. Each internal node is associated with a vector representing the average of all of the gene set vectors at its descendant leaves. We annotated each interior node with the Pearson correlation between the vectors associated with its two children in the hierarchy. We defined as a cluster each interior node whose Pearson correlation differed by more than 0.05 from the Pearson correlation of its parent node in the hierarchy, resulting in 577 clusters of gene sets. Such interior nodes represent points in the tree with a large gap between the similarities in expression of the node's children and the similarity in expression of the node and its sibling.

Testing consistency of a gene with expression of a gene set.

Given a gene set G and a gene g, we tested whether the expression of g was consistent with the significant changes in the expression of G. We first identified the subsets of arrays I and R in which G was significantly induced and repressed, respectively. We then measured the extent to which the expression of g changed by more (or less) than twofold in arrays in I (or R) with the score

,

where p_a is the fraction of genes in array a that are induced (or repressed) by more than twofold for arrays in I (or in R). This score assigns more weight to induction in arrays where there are fewer induced genes (and respectively for repression).

We evaluated the significance of the score for gene g with respect to the null hypothesis where the genes in each array are randomly permuted. Under this null hypothesis, the score for gene g is the sum of independent binary random variables, one for each array in I and R. The random variable corresponding to array a attains the value −log(p_a) with probability p_a and the value of 0 with probability 1 − p_a. Because the score for gene g in this model is a sum of independent random variables, its mean μ and variance σ² are the sum of the means and variances, respectively, of the these variables and can be computed analytically:

.

Moreover, by the central limit theorem, the distribution of the score for gene g under the null hypothesis can be closely approximated by a Gaussian distribution with mean μ and variance σ². We used standard methods for computing the tail probability of a Gaussian distribution to compute the probability of attaining a score as large as the observed score under the null hypothesis.

Deriving modules from clusters of gene sets.

For each cluster of gene sets, we defined G to be the union of the gene sets in the cluster. We then tested each gene in G for consistency (as described above). The resulting module consists of genes whose expression is significantly consistent with the expression of the gene set (after false discovery rate correction for multiple hypotheses using 5% false rate). Leave-one-out cross-validation analysis (Supplementary Note and Supplementary Fig. 1 online) showed that 456 of the 577 gene-set clusters were significant at P < 0.01. All further analysis was carried out only for the 456 modules derived from these 456 gene set clusters.

Enrichment of clinical annotations.

To characterize conditions as a combination of activated and deactivated modules, we associated each array with the annotations it represents, from a total of 263 clinical annotations that we compiled based on published studies (see our project website for the complete set of clinical annotations). We distinguished between 185 specific annotations (present in <70% of the arrays in a given category; Fig. 1a and project website) and 78 general annotations (present in 70% or more of the arrays in a category). For example, 'Stage T2' is a specific annotation in the 'lung cancer' category (12.6% of samples in this category), whereas 'lung cancer' is a general annotation (86% of the samples in the 'lung cancer' category). For each module and each annotation, we calculated the fraction of arrays associated with that annotation of the total number of arrays in which the module is significantly induced (or repressed) and used the hypergeometric distribution to calculate a P value for this fraction. For specific annotations, we only considered arrays in the same category when computing the P value. For general annotations, we considered all other arrays in the compendium as background (i.e., the other arrays were marked as not having the general annotation). In both cases, all annotations were strictly local (e.g., the lung cancer annotation in the lung cancer category is distinct from the lung cancer annotation in the 'various tumors' category and is reported separately). We carried out a false discovery rate correction for multiple hypotheses and took P < 0.05 to be significant in Figure 2.

GeneXPress.

We carried out all analysis and visualizations in GeneXPress. This tool can identify the arrays in which gene sets are significantly expressed, and the clinical annotations enriched in these significant arrays, and can be used for any input expression data and gene sets in any organism. GeneXPress is freely available for academic use.

URLs.

More detailed results, including the expression compendium, clinical annotations that we compiled and all the significant gene set–array pairs, viewable in GeneXPress, can be found on our project website (http://dags.stanford.edu/cancer). The website also contains detailed views of all 456 modules in the format of Figures 4,5,6, which can be searched and browsed in various ways. GeneXPress is freely available for academic use at http://GeneXPress.stanford.edu/. All expression data used is available from the Stanford Microarray Database (http://genome-www5.stanford.edu/Microarray/SMD/) and the Center for Genomic Research at the Whitehead Institute (http://www-genome.wi.mit.edu/cgi-bin/cancer/datasets.cgi).

References

Ramaswamy, S., Ross, K.N., Lander, E.S. & Golub, T.R. A molecular signature of metastasis in primary solid tumors. Nat. Genet. 33, 49–54 (2003).
Article CAS Google Scholar
Ramaswamy, S. et al. Multiclass cancer diagnosis using tumor gene expression signatures. Proc. Natl. Acad. Sci. USA 98, 15149–15154 (2001).
Article CAS Google Scholar
Lamb, J. et al. A mechanism of cyclin D1 action encoded in the patterns of gene expression in human cancer. Cell 114, 323–334 (2003).
Article CAS Google Scholar
Rhodes, D.R. et al. Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. Proc. Natl. Acad. Sci. USA 101, 9309–9314 (2004).
Article CAS Google Scholar
Mootha, V.K. et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat. Genet. 34, 267–723 (2003).
Article CAS Google Scholar
Su, A.I. et al. Large-scale analysis of the human and mouse transcriptomes. Proc. Natl. Acad. Sci. USA 99, 4465–4470 (2002).
Article CAS Google Scholar
Kanehisa, M., Goto, S., Kawashima, S. & Nakaya, A. The KEGG databases at GenomeNet. Nucleic Acids Res. 30, 42–46 (2002).
Article CAS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).
Article CAS Google Scholar
Dahlquist, K.D., Salomonis, N., Vranizan, K., Lawlor, S.C. & Conklin, B.R. GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat. Genet. 31, 19–20 (2002).
Article CAS Google Scholar
Kim, S.C. et al. Constitutive activation of extracellular signal-regulated kinase in human acute leukemias: combined role of activation of MEK, hyperexpression of extracellular signal-regulated kinase, and downregulation of a phosphatase, PAC1. Blood 93, 3893–3899 (1999).
CAS PubMed Google Scholar
Chu, Y., Solski, P.A., Khosravi-Far, R., Der, C.J. & Kelly, K. The mitogen-activated protein kinase phosphatases PAC1, MKP-1, and MKP-2 have unique substrate specificities and reduced activity in vivo toward the ERK2 sevenmaker mutation. J. Biol. Chem. 271, 6497–6501 (1996).
Article CAS Google Scholar
Furukawa, T., Sunamura, M., Motoi, F., Matsuno, S. & Horii, A. Potential tumor suppressive pathway involving DUSP6/MKP-3 in pancreatic cancer. Am. J. Pathol. 162, 1807–1815 (2003).
Article CAS Google Scholar
Leone, A.M., Errico, M., Lin, S.L. & Cowen, D.S. Activation of extracellular signal-regulated kinase (ERK) and Akt by human serotonin 5-HT(1B) receptors in transfected BE(2)-C neuroblastoma cells is inhibited by RGS4. J. Neurochem. 75, 934–938 (2000).
Article CAS Google Scholar
Shi, C.S. et al. Regulator of G-protein signaling 3 (RGS3) inhibits Gbeta1gamma 2-induced inositol phosphate production, mitogen-activated protein kinase activation, and Akt activation. J. Biol. Chem. 276, 24293–24300 (2001).
Article CAS Google Scholar
Ge, B. et al. TAB1beta (transforming growth factor-beta-activated protein kinase 1-binding protein 1beta), a novel splicing variant of TAB1 that interacts with p38alpha but not TAK1. J. Biol. Chem. 278, 2286–2293 (2003).
Article CAS Google Scholar
Mita, H., Tsutsui, J., Takekawa, M., Witten, E.A. & Saito, H. Regulation of MTK1/MEKK4 kinase activity by its N-terminal autoinhibitory domain and GADD45 binding. Mol. Cell. Biol. 22, 4544–4555 (2002).
Article CAS Google Scholar
Granata, O.M. et al. Altered androgen metabolism eventually leads hepatocellular carcinoma to an impaired hormone responsiveness. Mol. Cell. Endocrinol. 193, 51–58 (2002).
Article CAS Google Scholar
Mundy, G.R. Metastasis to bone: causes, consequences and therapeutic opportunities. Nat. Rev. Cancer 2, 584–593 (2002).
Article CAS Google Scholar
Kang, Y. et al. A multigenic program mediating breast cancer metastasis to bone. Cancer Cell 3, 537–459 (2003).
Article CAS Google Scholar
Iguchi, H. et al. A possible role of VEGF in osteolytic bone metastasis of hepatocellular carcinoma. J. Exp. Clin. Cancer Res. 21, 309–313 (2002).
CAS PubMed Google Scholar
Boot, A.M., van den Heuvel-Eibrink, M.M., Hahlen, K., Krenning, E.P. & de Muinck Keizer-Schrama, S.M. Bone mineral density in children with acute lymphoblastic leukaemia. Eur. J. Cancer 35, 1693–1697 (1999).
Article CAS Google Scholar
Ugolini, F. et al. Differential expression assay of chromosome arm 8p genes identifies Frizzled-related (FRP1/FRZB) and Fibroblast Growth Factor Receptor 1 (FGFR1) as candidate breast cancer genes. Oncogene 18, 1903–1910 (1999).
Article CAS Google Scholar
Reinholz, M.M., Iturria, S.J., Ingle, J.N. & Roche, P.C. Differential gene expression of TGF-beta family members and osteopontin in breast tumor tissue: analysis by real-time quantitative PCR. Breast Cancer Res. Treat. 74, 255–269 (2002).
Article CAS Google Scholar
Bernards, R. & Weinberg, R.A. A progression puzzle. Nature 418, 823 (2002).
Article CAS Google Scholar
Hynes, R.O. Metastatic potential: generic predisposition of the primary tumor or rare, metastatic variants-or both? Cell 113, 821–823 (2003).
Article CAS Google Scholar
Ferrari, N. et al. DLX genes as targets of ALL-1: DLX 2,3,4 down-regulation in t(4;11) acute lymphoblastic leukemias. J. Leukoc. Biol. 74, 302–305 (2003).
Article CAS Google Scholar
Segal, E. et al. Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat. Genet. 34, 166–176 (2003).
Article CAS Google Scholar
Ihmels, J. et al. Revealing modular organization in the yeast transcriptional network. Nat. Genet. 31, 370–377 (2002).
Article CAS Google Scholar
Tanay, A., Sharan, R., Kupiec, M. & Shamir, R. Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data. Proc. Natl. Acad. Sci. USA 101, 2981–2986 (2004).
Article CAS Google Scholar
Eisen, M.B., Spellman, P.T., Brown, P.O. & Botstein, D. Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14863–14868 (1998).
Article CAS Google Scholar

Download references

Acknowledgements

We thank J. Effrat, T. Fojo, Y. Friedman, A. Kaushal, W. Lu, T. Pham, M. Tong, and R. Yelensky for technical help with software and visualization and I. Ben-Porath, Y. Dor, L. Garwin, N. Kaminski, D. Pe'er, O. Rando and T. Raveh for comments on previous versions of this manuscript. E.S., N.F. and D.K. were supported by a National Science Foundation grant under the Information Technology Research program. E.S. was also supported by a Stanford Graduate Fellowship. N.F. was also supported by an Alon Fellowship, by the Harry & Abe Sherman Senior Lectureship in Computer Science and by the United States-Israel Bi-National Science Foundation grant. N.F. and A.R. were supported by a Center of Excellence Grant from the National Institute of General Medical Sciences. A.R. was also supported by the Bauer Center for Genomics Research.

Author information

Eran Segal
Present address: Center for Studies in Physics and Biology, The Rockefeller University, New York, New York, 10021, USA

Authors and Affiliations

Computer Science Department, Stanford University, Stanford, 94305, California, USA
Eran Segal & Daphne Koller
School of Computer Science and Engineering, Hebrew University, Jerusalem, 91904, Israel
Nir Friedman
Bauer Center for Genomics Research, Harvard University, Cambridge, 02138, Massachusetts, USA
Aviv Regev

Authors

Eran Segal
View author publications
You can also search for this author in PubMed Google Scholar
Nir Friedman
View author publications
You can also search for this author in PubMed Google Scholar
Daphne Koller
View author publications
You can also search for this author in PubMed Google Scholar
Aviv Regev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Daphne Koller or Aviv Regev.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Segal, E., Friedman, N., Koller, D. et al. A module map showing conditional activity of expression modules in cancer. Nat Genet 36, 1090–1098 (2004). https://doi.org/10.1038/ng1434

Download citation

Received: 16 March 2004
Accepted: 25 August 2004
Published: 26 September 2004
Issue Date: 01 October 2004
DOI: https://doi.org/10.1038/ng1434

This article is cited by

Robust normalization and transformation techniques for constructing gene coexpression networks from RNA-seq data
- Kayla A. Johnson
- Arjun Krishnan
Genome Biology (2022)
ProFuMCell and ProModb: Web services for analyzing interaction-based functionally localized protein modules in a cell
- Barnali Das
- Pralay Mitra
Journal of Molecular Modeling (2022)
Tumor relevant protein functional interactions identified using bipartite graph analyses
- Divya Lakshmi Venkatraman
- Deepshika Pulimamidi
- Shubhada R. Hegde
Scientific Reports (2021)
Integration of machine learning and genome-scale metabolic modeling identifies multi-omics biomarkers for radiation resistance
- Joshua E. Lewis
- Melissa L. Kemp
Nature Communications (2021)
Integrating HSICBFO and FWSMOTE algorithm-prediction through risk factors in cervical cancer
- S. Geeitha
- M. Thangamani
Journal of Ambient Intelligence and Humanized Computing (2021)