Original Article

A Five-Gene Signature and Clinical Outcome in Non–Small-Cell Lung Cancer

List of authors.
  • Hsuan-Yu Chen, M.Sc.,
  • Sung-Liang Yu, Ph.D.,
  • Chun-Houh Chen, Ph.D.,
  • Gee-Chen Chang, M.D., Ph.D.,
  • Chih-Yi Chen, M.D.,
  • Ang Yuan, M.D., Ph.D.,
  • Chiou-Ling Cheng, M.Sc.,
  • Chien-Hsun Wang, M.Sc.,
  • Harn-Jing Terng, Ph.D.,
  • Shu-Fang Kao, M.Sc.,
  • Wing-Kai Chan, M.D.,
  • Han-Ni Li, M.Sc.,
  • Chun-Chi Liu, M.Sc.,
  • Sher Singh, Ph.D.,
  • Wei J. Chen, M.D., Sc.D.,
  • Jeremy J.W. Chen, Ph.D.,
  • and Pan-Chyr Yang, M.D., Ph.D.

Abstract

Background

Current staging methods are inadequate for predicting the outcome of treatment of non–small-cell lung cancer (NSCLC). We developed a five-gene signature that is closely associated with survival of patients with NSCLC.

Methods

We used computer-generated random numbers to assign 185 frozen specimens for microarray analysis, real-time reverse-transcriptase polymerase chain reaction (RT-PCR) analysis, or both. We studied gene expression in frozen specimens of lung-cancer tissue from 125 randomly selected patients who had undergone surgical resection of NSCLC and evaluated the association between the level of expression and survival. We used risk scores and decision-tree analysis to develop a gene-expression model for the prediction of the outcome of treatment of NSCLC. For validation, we used randomly assigned specimens from 60 other patients.

Results

Sixteen genes that correlated with survival among patients with NSCLC were identified by analyzing microarray data and risk scores. We selected five genes (DUSP6, MMD, STAT1, ERBB3, and LCK) for RT-PCR and decision-tree analysis. The five-gene signature was an independent predictor of relapse-free and overall survival. We validated the model with data from an independent cohort of 60 patients with NSCLC and with a set of published microarray data from 86 patients with NSCLC.

Conclusions

Our five-gene signature is closely associated with relapse-free and overall survival among patients with NSCLC.

Introduction

Lung cancer — predominantly non–small-cell lung cancer (NSCLC) — is the most common cause of death from cancer worldwide.1 The relapse rate among patients with early-stage NSCLC is 40% within 5 years after potentially curative treatment.2 The current staging system for NSCLC is inadequate for predicting the outcome of treatment.

Gene-expression profiling (see Glossary) by means of microarrays3,4 and reverse-transcriptase polymerase chain reaction (RT-PCR)5,6 is useful for classifying tumors and formulating a prognosis for patients with various types of cancer,7-9 including lung cancer.10-16 The use of microarrays in clinical practice is limited, however, by the large number of genes used in gene profiling,17 the need for complicated methods, and the lack of both reproducibility and independent validation. The genes selected for profiling in studies of lung cancer have varied considerably; only a few genes have been consistently included.10-13 Moreover, gene-expression profiles can vary according to the microarray platform and the analytic strategy used.6

The RT-PCR method can be applied to paraffin-embedded pathological specimens and is reproducible and applicable in clinical practice. However, RT-PCR can be used to analyze only a small number of genes.17 In a previous study, our group performed microarray analysis of cell lines derived from specimens of invasive NSCLC and identified 672 genes associated with invasive activity.18 We also identified genes (CRMP-1 and HLJ1) that are associated with clinical outcome of patients with NSCLC.19,20 A recent study showed that the results of RT-PCR analysis of eight genes correlated with the outcomes of patients with adenocarcinoma of the lung.5

In the current study, we examined gene expression in 125 surgical specimens of NSCLC, using microarrays and real-time RT-PCR in order to identify a gene signature that is correlated with the clinical outcome.

Methods

Patients and Tissue Specimens

We used computer-generated random numbers to assign specimens from 185 consecutive patients for microarray analysis. We studied frozen specimens of lung-cancer tissue from 125 randomly selected patients who underwent surgical resection of NSCLC at the Taichung Veterans General Hospital between December 1999 and December 2003. Of these 125 specimens, 60 were adenocarcinomas, 52 were squamous-cell carcinomas, and 13 were other types of cancer. We validated the five-gene risk-prediction model using an independent cohort of 60 randomly selected patients who underwent surgical resection of NSCLC at the Taichung Veterans General Hospital between November 1999 and December 2003. The patients had not received adjuvant chemotherapy. The study was approved by the institutional review board of the hospital. Written informed consent was obtained from all patients.

Microarray Analysis of Complementary DNA

The 672 genes associated with invasive activity, identified in a previous study by our group,18 were rearrayed in duplicate on a nylon membrane. We isolated 4 μg of total RNA from each specimen, amplified it using an amplification kit (Ambion), and labeled it with digoxigenin during reverse transcription.21 The details of target preparation, hybridization, color development, image analysis, and spot quantification have been described previously.18,21,22

RT-PCR Analysis

To validate the levels of expression of genes found on microarray analysis, RT-PCR was performed on 16 genes and a control gene for TATA-box–binding protein (TBP), with the use of specific TaqMan probes and primer sets; the transcripts were amplified with reagent (TaqMan One-Step RT-PCR Master Mix Reagent, Applied Biosystems) and a sequence detection system (ABI Prism 7900HT, Applied Biosystems). Gene expression was quantified in relation to the expression of TBP with the use of sequence detector software and the relative quantification method (Applied Biosystems) (for details, see the Methods section of the Supplementary Appendix, available with the full text of this article at www.nejm.org). We chose TBP as the internal control for real-time RT-PCR because it is invariant in clinical cancer specimens.23

Statistical Analysis

The 125 specimens were randomly assigned to either the training set or the testing set (see Table 1 of the Supplementary Appendix). The average intensity for each gene in the microarray was assessed. To reduce variation among microarrays, the intensity values for samples in each microarray were rescaled by means of a quantile normalization method.24 To reduce background noise, background intensity values of less than 3000 were assigned the value of 3000.22 Each intensity value was then log-transformed to a base-2 scale. Genes with coefficients of variation of less than 3% were excluded from further analyses. Finally, the gene-expression intensity values were transformed to ordinal coding values, according to the ranking of the level of gene expression among the 485 genes in 125 patients (60,625 observations). The intensity value was coded as 1 for expression levels ranked as at or below the 25th percentile of the total gene expression, 2 for levels above the 25th and at or below the 50th percentiles, 3 for levels above the 50th and at or below the 75th percentiles, and 4 for levels above the 75th percentile.

Hazard ratios from univariate Cox regression analysis were used to determine which genes were associated with death from any cause or recurrence of cancer. Protective genes were defined as those associated with a hazard ratio for death of less than 1; risk genes were defined as those associated with a hazard ratio for death of more than 1. We used univariate Cox proportional-hazards regression analysis to evaluate the association between survival and the level of expression of each gene from microarray analysis.25 For genes that were significantly correlated with survival, we used a linear combination of the gene-expression coding values weighted by the regression coefficients to calculate a risk score for each patient.6,10

16-Gene Signature

Risk scores were calculated for 16 genes. A patient's risk score was calculated as the sum of the levels of expression of each gene, as measured by microarray analysis, multiplied by the corresponding regression coefficients (see the Methods section of the Supplementary Appendix). Patients were classified as having a high-risk gene signature or a low-risk gene signature, with the 50th percentile (median) of the risk score as the threshold value (median, 4.9; range, 1.3 to 21.9). The median risk score was chosen as the threshold value to reflect the fact that almost half of patients with early-stage NSCLC relapse within 5 years after potentially curative surgery2 and also in order to eliminate the effect of extreme values in the training cohort by ensuring that there were equal numbers of patients in the high-risk and low-risk groups. The risk scores and the threshold value derived from the training cohort were not reestimated but were applied directly to the testing cohort.

Five-Gene Signature

The levels of expression of the 16 genes were confirmed by RT-PCR and indexed by Spearman's rank-correlation test.26 From these 16 genes, we further identified five genes that were significantly associated with survival. The levels of expression of these five genes, as measured by RT-PCR, were used to construct the recursive-partitioning decision tree.27,28 Avadis software29,30 (Strand Genomic) was then used to classify patients as having a high-risk gene signature or a low-risk gene signature on the basis of the decision tree.

Our rationale for using a decision tree based on RT-PCR rather than on microarray analysis was practicality. RT-PCR uses a small number of genes to capture the relevant covariate structure, especially the complex interaction and nonlinearity of levels of gene expression.28 In our univariate-splitting tree, only one of the five genes was used to make a splitting decision at each intermediate node. To avoid overfitting, we used a pruning method called minimum error (see the Methods section and Figure 1 of the Supplementary Appendix).

The Kaplan–Meier method was used to estimate overall survival and relapse-free survival. Differences in survival between the high-risk group and the low-risk group were analyzed with the log-rank test. Multivariate Cox proportional-hazards regression analysis with stepwise selection was used to evaluate independent prognostic factors associated with survival, and the five-gene signature, age, sex, tumor stage, and histologic characteristics were used as covariates. A P value of less than 0.05 was considered to indicate statistical significance, and all tests were two-tailed.

We also studied an independent cohort of 60 patients who underwent surgical resection of NSCLC at the Taichung Veterans General Hospital between November 1999 and December 2003. This cohort was used to validate our five-gene risk-prediction model.

To further validate our model, we applied it to microarray data from 86 patients with NSCLC, reported by Beer et al.10 (available at http://dot.ped.med.umich.edu:2000/ourimage/pub/Lung/index.html). The five genes (and their corresponding Affymetrix probe sets) were DUSP6 (X93920_at), MMD (X85750_at), STAT1 (M97936_at), ERBB3 (S61953_at), and LCK (M26692_s_at); the control gene was TBP (X54993_s_at). To make the levels of gene expression from the microarrays and from RT-PCR comparable, we log-transformed the microarray data to a base-2 scale after assigning a value of 1.1 to intensity values of less than 1.1. After log transformation, the levels of expression of the five genes were divided by the level of expression of the control gene TBP in order to calculate the relative level of expression. We applied the decision-tree model to these relative levels of expression, using the data from 86 patients with NSCLC.10 Because the maximum follow-up time for the survival analysis in our study was 62 months, we used the 5-year survival data for the 86 patients.

Results

The 16-Gene Signature and Survival

Table 1. Table 1. Hazard Ratios for Death from Any Cause for the 125 Patients with NSCLC and Results of Validation of the 16-Gene Signature.

On microarray analysis of tumors from the 125 patients, 485 of 672 genes had a coefficient of variation greater than 3% and were thus included in the analyses. Hazard ratios from the univariate Cox regression analysis showed that the levels of expression of 16 genes correlated with death from any cause: 4 were protective genes (associated with a hazard ratio of less than 1) and 12 were risk genes (associated with a hazard ratio of more than 1 (Table 1).

Figure 1. Figure 1. The 16-Gene Signature and Survival of 125 Patients with NSCLC.

Panel A shows the gene-expression profiles of the tumor specimens (according to the color scale shown); each column represents an individual patient. The magnitude of the corresponding risk scores is represented by the slope of the red triangle. Also shown are Kaplan–Meier estimates of overall and relapse-free survival according to the 16-gene microarray signature in the training cohort (Panel B) and the testing cohort (Panel C).

Table 1 of the Supplementary Appendix lists the characteristics of the 125 patients in the first analysis. Among the 63 patients in the training cohort, tumors with high risk scores expressed risk genes, whereas tumors with low risk scores expressed protective genes (Figure 1A). Patients with a high-risk 16-gene signature had a lower median overall survival than those with a low-risk 16-gene signature (20 months vs. not reached) (Figure 1B). Tumors with a high-risk gene signature were associated with a lower median relapse-free survival than tumors with a low-risk gene signature (12 months vs. not reached) (Figure 1B). The median duration of follow-up in the training cohort was 20 months.

Results in the testing cohort were similar to those in the training cohort. Among the 62 patients, tumors with high risk scores expressed risk genes, whereas tumors with low risk scores expressed protective genes (Figure 1A). Patients with a high-risk 16-gene signature had a lower median overall survival than those with a low-risk gene signature (Figure 1C). Tumors with a high-risk gene signature were associated with a lower median relapse-free survival than tumors with a low-risk gene signature (18 months vs. not reached) (Figure 1C). The median duration of follow-up in the testing cohort was 18 months. Our entire microarray data set is available online (www.ncbi.nlm.nih.gov/projects/geo/) under the data series accession number GSE4882.

The Five-Gene Signature and Survival

There was a significant correlation between the results of microarray and RT-PCR analyses for the gene-expression data for 5 of the 16 genes in 101 of the 125 tumor specimens (Table 1). These five genes were for dual-specificity phosphatase 6 (DUSP6), monocyte-to-macrophage differentiation-associated protein (MMD), signal transducer and activator of transcription 1 (STAT1), v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 3 (ERBB3), and lymphocyte-specific protein tyrosine kinase (LCK).

Table 2. Table 2. Clinical Characteristics of the Original and Validation Cohorts.

We identified 59 patients with high-risk gene signatures and 42 with low-risk gene signatures, according to gene expression as measured with RT-PCR and decision-tree analysis (see Figure 1 of the Supplementary Appendix). The structure of the decision tree was based on the threshold of expression of each of the five genes, as automatically determined according to a recursive-partition algorithm. The use of this algorithm resulted in the most accurate separation of patients with a high-risk signature from those with a low-risk signature. Table 2 summarizes the clinical characteristics of the 101 patients, hereafter called the original cohort, according to their five-gene signatures. The five-gene signature was strongly associated with overall survival (sensitivity, 98%; specificity, 93%; positive predictive value, 95%; negative predictive value, 98%; and overall accuracy, 96%).

Figure 2. Figure 2. Kaplan–Meier Estimates of Survival of Patients with NSCLC According to the Five-Gene Signatures as Measured by RT-PCR.

Overall survival and relapse-free survival are shown for the 101 patients with NSCLC (Panel A and Panel B, respectively) and for the 59 patients with stage I or II disease (Panel C and Panel D, respectively). Overall survival is also shown for the independent cohort of 60 patients (Panel E), for the 42 patients in this cohort who had stage I or II disease (Panel F), and for the 86 patients described in an independent set of published NSCLC microarray data10(Panel G).

The median follow-up of the 101 patients was 20 months. The patients with a high-risk gene signature had a shorter median overall survival than the patients with a low-risk gene signature (20 months vs. 40 months, P<0.001 by the log-rank test) (Figure 2A). The high-risk gene signature was associated with a median relapse-free survival of 13 months, whereas the low-risk gene signature was associated with a median relapse-free survival of 29 months (P=0.002 by the log-rank test) (Figure 2B).

Table 3. Table 3. Hazard Ratios for Death from Any Cause Among Patients with NSCLC, According to Multivariate Cox Regression Analysis.

According to Cox multivariate regression analysis, the high-risk five-gene signature, tumor stage III, and older age were significantly associated with death from any cause among the 101 patients (Table 3), and the high-risk five-gene signature and tumor stage III were significantly associated with recurrence of cancer as well (hazard ratio for the high-risk signature vs. the low-risk signature, 1.92; 95% confidence interval [CI], 1.06 to 3.46; P=0.03; hazard ratio for stage III vs. stage I or II disease, 2.28; 95% CI, 1.33 to 3.91; P=0.003). In a subgroup analysis of 59 patients with stage I or II disease, those with a high-risk gene signature had a shorter overall survival and a shorter relapse-free survival than those with a low-risk gene signature (Figure 2C and 2D, respectively).

Validation of the Five-Gene Signature

The clinical characteristics of the 60 patients in the validation cohort are listed in Table 2. The median duration of follow-up was 17 months. Patients with a high-risk gene signature had a shorter median overall survival than those with a low-risk gene signature (21 months vs. not reached) (Figure 2E). According to Cox multivariate regression analysis, the five-gene signature was significantly associated with overall survival (Table 3).

We analyzed the five-gene signatures in tumor specimens obtained from patients in the validation cohort with stage I or stage II disease both together and separately. Among patients with stage I or II disease combined, those with a high-risk gene signature had a shorter overall survival than those with a low-risk gene signature (Figure 2F). Among patients with stage I disease, low-risk gene signatures were associated with a longer overall survival than were high-risk gene signatures (P=0.02 by the log-rank test). Among patients with stage II disease, overall survival did not differ significantly between those with high-risk and those with low-risk gene signatures, probably owing to the small number of patients.

We also validated the five-gene signature in an independent set of microarray data from 86 patients from a Western population with NSCLC.10 Table 2 of the Supplementary Appendix lists the clinical characteristics of these 86 patients according to their five-gene signatures. The patients with high-risk gene signatures had a shorter overall survival than did those with low-risk gene signatures (Figure 2G) (P=0.06 by the log-rank test). According to Cox multivariate regression analysis, the high-risk five-gene signature and tumor stage III were significantly associated with death from any cause (Table 3).

Discussion

NSCLC is a heterogeneous disease. Even in patients with similar clinical and pathological features, the outcome varies: some are cured, whereas in others, the cancer recurs. Staging systems for lung cancer that are based on clinical and pathological findings may have reached their limit of usefulness for predicting outcomes, but molecular methods add value. Gene-expression profiling with the use of microarrays3,4 or PCR5,6 has been shown to estimate the prognosis for patients with lung cancer accurately.10-16 However, the use of microarrays in clinical practice is limited by the large number of genes in the analysis,17 complicated methods, lack of reproducibility and independent validation of the results, and the need for fresh-frozen tissue.17 RT-PCR involving a small number of genes may be a more clinically useful method. It allows for accurate and reproducible quantification of results for RNA obtained from small amounts of paraffin-embedded specimens.17,31 The results of RT-PCR performed on 8 genes, selected from a total of 45, have recently been shown to correlate with the outcomes of lung adenocarcinoma.5

We identified an RT-PCR–based five-gene signature (including DUSP6, MMD, STAT1, ERBB3, and LCK) using risk scores based on microarray and decision-tree analyses of 125 frozen tumor specimens from patients with NSCLC. The specimens were randomly divided into a training set (63 specimens) and a testing set (62 specimens). The presence of a high-risk five-gene signature in the NSCLC tumors was associated with an increased risk of recurrence and decreased overall survival.

Our selection of genes in the microarray training set was validated in the microarray testing set, and the patterns of gene expression found on microarray analysis were validated by RT-PCR. Our results were also validated in an independent cohort of 60 patients who were treated at the Taichung Veterans General Hospital. These results in our Chinese patients were also validated with the use of a set of published NSCLC microarray data from patients from a Western population with NSCLC. Thus, we believe that the data we obtained using the five-gene signature are reliable.

The identification of five genes that are closely associated with the outcomes in patients with NSCLC has clinical implications. Cisplatin-based adjuvant chemotherapy is effective in some patients with NSCLC.32 We propose that patients who have tumors with a high-risk gene signature could benefit from this type of adjuvant therapy, whereas those with a low-risk gene signature could be spared what may be unnecessary treatment. Prospective, large-scale, multicenter studies are necessary to test this idea.

The identification of five genes that can predict the clinical outcome in patients with NSCLC may reveal targets for the development of therapy for lung cancer. STAT1 causes arrested growth and apoptosis in many types of cancer cells by inducing the expression of p21WAF1 and caspase.33,34 MMD is preferentially expressed in mature macrophages.35 Our group has shown that macrophage activation promotes cancer metastasis,22 although the function of the MMD protein is unknown. DUSP6 inactivates extracellular signal-regulated kinase 2 (ERK2) (also known as mitogen-activated protein kinase 1 [MAPK1]), resulting in tumor suppression and apoptosis.36 ERBB3, a member of the epidermal growth factor receptor family of tyrosine kinases, can shorten cell survival.37 LCK, a member of the Src family of protein tyrosine kinases, is expressed mainly in T cells and is one of the first signaling molecules downstream of the T-cell receptor. It plays a key role not only in the differentiation and activation of T cells but also in the induction of apoptosis.38 In addition, LCK is expressed in many cancers and regulates the mobility of cancer cells.39,40

In conclusion, the five-gene expression signature we identified is closely associated with the clinical outcome in patients with surgically resected NSCLC. This signature could be useful in stratifying patients according to risk in trials of adjuvant treatment of the disease.

Funding and Disclosures

Supported by grants from the National Research Program for Genomic Medicine of the National Science Council of the Republic of China (NSC94-3112-B002-013-Y) and from Advpharma.

Dr. Terng reports being an employee of Advpharma. No other potential conflict of interest relevant to this article was reported.

Drs. W.J. Chen, J.J.W. Chen, and P.C. Yang contributed equally to this article.

Author Affiliations

From National Taiwan University College of Public Health (H.-Y.C., W.J.C.), National Taiwan University College of Medicine (H.-Y.C., S.-L.Y., C.-L.C., C.-H.W., S.-F.K., H.-N.L., S.S., W.J.C., J.J.W.C., P.-C.Y.), Academia Sinica (C.-H.C, P.-C.Y.), National Taiwan University Hospital (A.Y., W.-K.C., P.-C.Y.), and Advpharma (H.-J.T.) — all in Taipei, Taiwan; and Taichung Veterans General Hospital (G.-C.C., C.-Y.C.) and National Chung-Hsing University (G.-C.C., C.-C.L., J.J.W.C.) — both in Taichung, Taiwan.

Address reprint requests to Dr. Yang at the Department of Internal Medicine, National Taiwan University Hospital, No. 7, Chung-Shan S. Rd., Taipei, Taiwan 100, or at .

Supplementary Material

Glossary

Decision tree
A statistical tool for predicting which patient belongs to which specific class (e.g., good or poor clinical outcome) on the basis of feature information (gene-expression levels), with the use of a recursive-partitioning process and tree-based classification rules.

Gene-expression profiling
Determination of the level of expression of thousands of genes simultaneously by DNA microarray or real-time RT-PCR.

High-risk gene signature
Aberrant expression of a panel of genes in tissue that signifies a high risk of an adverse outcome (relapse or death in patients with cancer).

Independent cohort
An independent group of patients having clinical characteristics similar to those of an original group of patients in a study. The independent cohort is used to confirm the findings of the original study.

Risk gene
A gene for which altered expression in the tissue of interest is associated with an increased risk of an adverse clinical outcome (relapse or death in patients with cancer).

Risk score
A score that predicts the likelihood of an individual patient's survival on the basis of statistical analysis of risk factors (the expression levels of risk genes) associated with survival.

References (40)

  1. 1. Jemal A, Siegel R, Ward E, et al. Cancer statistics, 2006. CA Cancer J Clin 2006;56:106-130

  2. 2. Hoffman PC, Mauer AM, Vokes EE. Lung cancer. Lancet 2000;355:479-485[Erratum, Lancet 2000;355:1280.]

  3. 3. Hoheisel JD. Microarray technology: beyond transcript profiling and genotype analysis. Nat Rev Genet 2006;7:200-210

  4. 4. Schena M, Shalon D, Davis RW, Brown PO. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 1995;270:467-470

  5. 5. Endoh H, Tomida S, Yatabe Y, et al. Prognostic model of pulmonary adenocarcinoma by expression profiling of eight genes as determined by quantitative real-time reverse transcriptase polymerase chain reaction. J Clin Oncol 2004;22:811-819

  6. 6. Lossos IS, Czerwinski DK, Alizadeh AA, et al. Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes. N Engl J Med 2004;350:1828-1837

  7. 7. Ramaswamy S, Ross KN, Lander ES, Golub TR. A molecular signature of metastasis in primary solid tumors. Nat Genet 2003;33:49-54

  8. 8. Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 1999;286:531-537

  9. 9. Ludwig JA, Weinstein JN. Biomarkers in cancer staging, prognosis and treatment selection. Nat Rev Cancer 2005;5:845-856

  10. 10. Beer DG, Kardia SL, Huang CC, et al. Gene-expression profiles predict survival of patients with lung adenocarcinoma. Nat Med 2002;8:816-824

  11. 11. Bhattacharjee A, Richards WG, Staunton J, et al. Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc Natl Acad Sci U S A 2001;98:13790-13795

  12. 12. Garber ME, Troyanskaya OG, Schluens K, et al. Diversity of gene expression in adenocarcinoma of the lung. Proc Natl Acad Sci U S A 2001;98:13784-13789[Erratum, Proc Natl Acad Sci U S A 2002;99:1098.]

  13. 13. Wigle DA, Jurisica I, Radulovich N, et al. Molecular profiling of non-small cell lung cancer and correlation with disease-free survival. Cancer Res 2002;62:3005-3008

  14. 14. Gordon GJ, Jensen RV, Hsiao LL, et al. Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res 2002;62:4963-4967

  15. 15. Granville CA, Dennis PA. An overview of lung cancer genomics and proteomics. Am J Respir Cell Mol Biol 2005;32:169-176

  16. 16. Tsao MS, Sakurada A, Cutz JC, et al. Erlotinib in lung cancer -- molecular and clinical predictors of outcome. N Engl J Med 2005;353:133-144

  17. 17. Ramaswamy S. Translating cancer genomics into clinical oncology. N Engl J Med 2004;350:1814-1816

  18. 18. Chen JJ, Peck K, Hong TM, et al. Global analysis of gene expression in invasion by a lung cancer model. Cancer Res 2001;61:5223-5230

  19. 19. Shih JY, Yang SC, Hong TM, et al. Collapsin response mediator protein-1 and the invasion and metastasis of cancer cells. J Natl Cancer Inst 2001;93:1392-1400

  20. 20. Wang CC, Tsai MF, Hong TM, et al. The transcriptional factor YY1 upregulates the novel invasion suppressor HLJ1 expression and inhibits cancer cell invasion. Oncogene 2005;24:4081-4093

  21. 21. Chen JJ, Wu R, Yang PC, et al. Profiling expression patterns and isolating differentially expressed genes by cDNA microarray system with colorimetry detection. Genomics 1998;51:313-324

  22. 22. Chen JJ, Lin YC, Yao PL, et al. Tumor-associated macrophages: the double-edged sword in cancer progression. J Clin Oncol 2005;23:953-964

  23. 23. Bieche I, Onody P, Laurendeau I, et al. Real-time reverse transcription-PCR assay for future management of ERBB2-based clinical applications. Clin Chem 1999;45:1148-1156

  24. 24. Bolstad BM, Irizarry RA, Astrand M, Speed TP. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 2003;19:185-193

  25. 25. Cox DR. Regression models and life-tables. J R Stat Soc [B] 1972;34:187-220

  26. 26. Rosner R. Fundamentals of biostatistics. 5th ed. Pacific Grove, CA: Duxbury, 2000.

  27. 27. Zhang H, Yu CY, Singer B, Xiong M. Recursive partitioning for tumor classification with gene expression microarray data. Proc Natl Acad Sci U S A 2001;98:6730-6735

  28. 28. Koziol JA, Zhang JY, Casiano CA, et al. Recursive partitioning as an approach to selection of immune markers for tumor diagnosis. Clin Cancer Res 2003;9:5120-5126

  29. 29. Shafer JC, Agrawal R, Mehta M. SPRINT: a scalable parallel classifier for data mining. In: Proceedings of the 22nd International Conference on Very Large Databases, Bombay, India, September 3–6, 1996.

  30. 30. Avadis user manual. Bangalore, India: Strand Genomic Pvt. Ltd., 2004.

  31. 31. Bast RC Jr, Hortobagyi GN. Individualized care for patients with cancer -- a work in progress. N Engl J Med 2004;351:2865-2867

  32. 32. Arriagada R, Bergman B, Dunant A, et al. Cisplatin-based adjuvant chemotherapy in patients with completely resected non-small-cell lung cancer. N Engl J Med 2004;350:351-360

  33. 33. Yu H, Jove R. The STATs of cancer -- new molecular targets come of age. Nat Rev Cancer 2004;4:97-105

  34. 34. Kumar A, Commane M, Flickinger TW, Horvath CM, Stark GR. Defective TNF-alpha-induced apoptosis in STAT1-null cells due to low constitutive levels of caspases. Science 1997;278:1630-1632

  35. 35. Rehli M, Krause SW, Schwarzfischer L, Kreutz M, Andreesen R. Molecular cloning of a novel macrophage maturation-associated transcript encoding a protein with several potential transmembrane domains. Biochem Biophys Res Commun 1995;217:661-667

  36. 36. Furukawa T, Sunamura M, Motoi F, Matsuno S, Horii A. Potential tumor suppressive pathway involving DUSP6/MKP-3 in pancreatic cancer. Am J Pathol 2003;162:1807-1815

  37. 37. Muller-Tidow C, Diederichs S, Bulk E, et al. Identification of metastasis-associated receptor tyrosine kinases in non-small cell lung cancer. Cancer Res 2005;65:1778-1782

  38. 38. Zamoyska R, Basson A, Filby A, Legname G, Lovatt M, Seddon B. The influence of the src-family kinases, Lck and Fyn, on T cell differentiation, survival and activation. Immunol Rev 2003;191:107-118

  39. 39. Mahabeleshwar GH, Das R, Kundu GC. Tyrosine kinase, p56lck-induced cell motility, and urokinase-type plasminogen activator secretion involve activation of epidermal growth factor receptor/extracellular signal regulated kinase pathways. J Biol Chem 2004;279:9733-9742

  40. 40. Mahabeleshwar GH, Kundu GC. Tyrosine kinase p56lck regulates cell motility and nuclear factor kappaB-mediated secretion of urokinase type plasminogen activator through tyrosine phosphorylation of IkappaBalpha following hypoxia/reoxygenation. J Biol Chem 2003;278:52598-52612

Citing Articles (590)

    Letters

    Figures/Media

    1. Table 1. Hazard Ratios for Death from Any Cause for the 125 Patients with NSCLC and Results of Validation of the 16-Gene Signature.
      Table 1. Hazard Ratios for Death from Any Cause for the 125 Patients with NSCLC and Results of Validation of the 16-Gene Signature.
    2. Figure 1. The 16-Gene Signature and Survival of 125 Patients with NSCLC.
      Figure 1. The 16-Gene Signature and Survival of 125 Patients with NSCLC.

      Panel A shows the gene-expression profiles of the tumor specimens (according to the color scale shown); each column represents an individual patient. The magnitude of the corresponding risk scores is represented by the slope of the red triangle. Also shown are Kaplan–Meier estimates of overall and relapse-free survival according to the 16-gene microarray signature in the training cohort (Panel B) and the testing cohort (Panel C).

    3. Table 2. Clinical Characteristics of the Original and Validation Cohorts.
      Table 2. Clinical Characteristics of the Original and Validation Cohorts.
    4. Figure 2. Kaplan–Meier Estimates of Survival of Patients with NSCLC According to the Five-Gene Signatures as Measured by RT-PCR.
      Figure 2. Kaplan–Meier Estimates of Survival of Patients with NSCLC According to the Five-Gene Signatures as Measured by RT-PCR.

      Overall survival and relapse-free survival are shown for the 101 patients with NSCLC (Panel A and Panel B, respectively) and for the 59 patients with stage I or II disease (Panel C and Panel D, respectively). Overall survival is also shown for the independent cohort of 60 patients (Panel E), for the 42 patients in this cohort who had stage I or II disease (Panel F), and for the 86 patients described in an independent set of published NSCLC microarray data10(Panel G).

    5. Table 3. Hazard Ratios for Death from Any Cause Among Patients with NSCLC, According to Multivariate Cox Regression Analysis.
      Table 3. Hazard Ratios for Death from Any Cause Among Patients with NSCLC, According to Multivariate Cox Regression Analysis.