Breast_A {fabia} | R Documentation |
Microarray data from Broad Institute “Cancer Program Data Sets” which was produced by van't Veer et al. 2002. Array S54 was removed because it is an outlier.
Goal was to find a gene signature to predict the outcome of a cancer therapy that is to predict whether metastasis will occur. A 70 gene signature has been discovered.
Here we want to find subclasses in the data set.
Hoshida et al. 2007 found 3 subclasses and verified that 50/61 cases from class 1 and 2 were ER positive and only in 3/36 from class 3.
bA
is the data set with
97 samples and 1213 genes,
bAc1
are the
three subclasses from Hoshida et al. 2007.
Breast_A
Matrix XBreast
: 97 samples and 1213 probe sets,
Vector CBreast
:
three subclasses from Hoshida
Broad Institute “Cancer Program Data Sets”: http://www.broadinstitute.org/cgi-bin/cancer/datasets.cgi
Hoshida Y, Brunet J-P, Tamayo P, Golub TR, Mesirov JP, ‘Subclass Mapping: Identifying Common Subtypes in Independent Disease Data Sets’, PLoS ONE 2(11): e1195, 2007.
van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, et al. ‘Gene expression profiling predicts clinical outcome of breast cancer’, Nature 415:530-536, 2002.