Important update (January 20th, 2011): the data below have been corrected for the BCR batch which is not necessarily the processing batch. The dataset needs to be reanalyzed.
Batch vs clinical traits
Number of clinical traits: 57, number of theoretical DNA methylation batches: 20
Correlation of batch with the center
Correlation with clinical traits (complete table is here)
UCEC,DataType,NumberOfNAs,Test,Pvalue
histological_type,factor,69,Pearson's Chi-squared test,9.48E-27
year_of_initial_pathologic_diagnosis,integer,68,Kruskal-Wallis rank sum test,1.18E-17
days_to_form_completion,integer,68,Kruskal-Wallis rank sum test,3.78E-15
prosp_tissue_coll,factor,68,Pearson's Chi-squared test,9.22E-15
retro_tissue_coll,factor,68,Pearson's Chi-squared test,6.46E-14
tumor_grade,factor,73,Pearson's Chi-squared test,9.17E-11
days_to_last_followup,integer,84,Kruskal-Wallis rank sum test,1.15E-10
surgical_approach,factor,72,Pearson's Chi-squared test,5.31E-04
followup_met_assessment_outcome_success_margin_status,factor,126,Pearson's Chi-squared test,1.07E-03
total_pelv_lnr,integer,80,Kruskal-Wallis rank sum test,1.21E-03
first_pathologic_diagnosis_biospecimen_acquisition_method_type,factor,70,Pearson's Chi-squared test,1.53E-03
peritoneal_wash,factor,85,Pearson's Chi-squared test,1.11E-02
vital_status,factor,68,Pearson's Chi-squared test,2.98E-02
days_to_birth,integer,68,Kruskal-Wallis rank sum test,6.14E-02
age_at_initial_pathologic_diagnosis,integer,68,Kruskal-Wallis rank sum test,6.22E-02
person_neoplasm_cancer_status,factor,97,Pearson's Chi-squared test,6.44E-02
total_aor.lnp,integer,137,Kruskal-Wallis rank sum test,6.49E-02
total_aor_lnr,integer,83,Kruskal-Wallis rank sum test,8.01E-02
weight,integer,72,Kruskal-Wallis rank sum test,8.12E-02
Batch vs survival
DNA methylation
27k, M value, didn't split into red and green. Had to remove two arrays that had NA value for unmethylated or methylated probe intensities (TCGA-A5-A0VQ-01A-11D-A10Q-05,TCGA-BS-A0UF-01A-11D-A10Q-05). Ended up with 115 arrays total.