lung dataset

Lung cancer data set.

Lung cancer data set.

Gene expression data for lung cancer classification between two classes: adenocarcinoma (ADCA); malignant pleural mesothe-lioma (MPM). The lung data set contains 181 tissue samples (150 ADCA and 31 MPM). Each sample is described by 12533 genes. data

data(lung)

Format

A matrix with 12534 rows (12533 rows show the gene expressions for 181 tissue samples, reported in columns, while the last row reports the corresponding sample's class label). The samples class's label coded as follows:

  • 1: adenocarcinoma sample (ADCA).
  • 2: malignant pleural mesothe-lioma sample (MPM).

Source

http://cilab.ujn.edu.cn/datasets.htm

References

Gordon GJ, Jensen RV, Hsiao L-L, Gullans SR, Blumenstock JE, Ramaswamy S, Richards WG, Sugarbaker DJ, Bueno R. (2002) Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer research: 62(17), 4963-4967.

Examples

data(lung) str(lung)
  • Maintainer: Osama Mahmoud
  • License: GPL (>= 2)
  • Last published: 2014-09-15

About the dataset

  • Number of rows: 12534
  • Number of columns: 181
  • Class: matrix, array

Column names and types (First 10)

  • sample 1:numeric
  • sample 2:numeric
  • sample 3:numeric
  • sample 4:numeric
  • sample 5:numeric
  • sample 6:numeric
  • sample 7:numeric
  • sample 8:numeric
  • sample 9:numeric
  • sample 10:numeric