M value

M value: why I decided to split the probes into "red" and "green"

SVD on the entire 27k by 511 patients M value matrix, plot 1st eigenarray (u matrix) and "color" the points according to the dye with with each CpG was labeled:

M value: analysis of red probes, identification of adjustment variables

Correlation of the red probes, first four eigengenes with the batch effect:

kruskal.test(mredsvd$v[,2],as.factor(batch))

        Kruskal-Wallis rank sum test

data:  mredsvd$v[, 2] and as.factor(batch)
Kruskal-Wallis chi-squared = 10.8903, df = 12, p-value = 0.5383

kruskal.test(mredsvd$v[,3],as.factor(batch))

        Kruskal-Wallis rank sum test

data:  mredsvd$v[, 3] and as.factor(batch)
Kruskal-Wallis chi-squared = 21.7447, df = 12, p-value = 0.04048

kruskal.test(mredsvd$v[,4],as.factor(batch))

        Kruskal-Wallis rank sum test

data:  mredsvd$v[, 4] and as.factor(batch)
Kruskal-Wallis chi-squared = 35.8388, df = 12, p-value = 0.0003439

After removing the batch effect p-value for the association of the first principal component with the batch is 0.7748.