Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This data set fails to complete, for both the Sage coexpression code and the UCLA-WGCNA code.  In the latter case the error is:

"Error in matrix(0, nBlockGenes, nBlockGenes) :   too many elements specified"

According to

http://stat.ethz.ch/R-manual/R-devel/library/base/html/Memory-limits.html

"On all builds of R, the maximum length (number of elements) of a vector is 2^31 - 1 ~ 2*10^9, as lengths are stored as signed integers.".

If an algorithm requires a matrix of size #probes x #probes (e.g. a correlation or TOM 'distance matrix' for clustering), then # probes <46340.

The code does run to completion for the UCLA-WGCNA code when using a preprocessing option to break up the genes into 'blocks' using K-means clustering.  Requesting a block size of 10,000 resulted in 6 blocks and 2h:47m of run time.