...
- Compute correlation coefficient matrix.
- Determine optimal value for the scale free exponent, beta, and collect regression statistics.
- Compute the topological overlap matrix (TOM).
- Perform hierarchical clustering of genes, based on TOM.
- Detect and label modules in TOM, using "Dynamic Tree Cutting".
- Merge modules based on hierarchical clustering of representative genes.
- Cluster samples hierarchically.
- Compute intra/inter-module network statistics, per gene.
- Produce diagnostic plots (dendrograms, heat maps, statistical scatter plots).
- Produce tabular output of module membership, network statistics, and scale-free regression statistics.
...
- leverage the UCLA-WGCNA package for the "common" steps, 1->5, gaining significant performance
- provide the user a parameter choice at step 5, to do "tree cutting" in the manner of the Sage algorithm, or in that of the UCLA-WGCNA algorithm
- provide two algorithms for step 6 (module merging), allowing a user to choose the Sage or UCLA-WGCNA algorithm
- leverage the UCLA-WGCNA dendrogram/module plotting algorithm in step 9
- maintain the Sage algorithms for the Sage-specific post-processing, i.e. step 7, statistics in step 8 , the and the heat maps in step 9.
External dependencies
...
module merging, by analyzing most-highly-connected genes in each module
fixed-cluster-number tree cutting for sample module definition
computation of within- and between- module per-gene connectivity statistics
...
Dataset | # Probes | # Samples | Sage Time | Sage Space | Package Time | Package Space | Sage beta | Package beta | Gene trees same? | Module difference (%) |
---|---|---|---|---|---|---|---|---|---|---|
Female mouse liver | 3600 | 135 | --- | --- | --- | --- | 6.5 | 6.5 | TRUE | 3.7% |
Cranio | 2534 | 249 | --- | --- | --- | --- | 4.0 | 4.5 | FALSE / TRUE* | 44% / 0.9%* |
Methylation, top 5K genes | 5000 | 555 | --- | --- | --- | --- | 8.5 | 8.5 | TRUE | 0 |
Colon cancer, top 5K genes | 5000 | 322 | --- | --- | --- | --- | 3 | 3.5 | __ / __* | 11% / 0.5%* |
Human liver cohort, top 5K genes | 5000 | 427 | --- | --- | --- | --- | 6.5 | 5.5 | __ / __* | 33% / 0* |
Methylation (full set)** | 27,578 | 555 | 24h:45m | 180GB | 13h:20m | 196GB | 8 | 11.5 | NO FALSE / ___* | 14% / __%* |
Colon cancer, top 40K genes** | 40,000 | 322 | Out of memory*** | --- | Out of memory*** | --- | --- | --- | --- | --- |
Human liver cohort** | 40,102 | 427 | Out of memory*** | --- | Out of memory*** | --- | --- | --- | --- | --- |
...