Meeting objectives:
- What data can we access?
- How do we access it?
- What formats is it in?
- How is data organized including meta-data sample annotations, and how we can interact with it.
- Is it possible to drill down to something akin to a schema description of how the data are organized and begin exploring the database?
Notes on available sequencing data
TCGA samples sequenced: 11/1/2011
There are 4883 sequencing files, but an individual may have multiple data types (exome, genome, miRNA, or RNAseq), and occasionally more than one sequence file/data type. There are 1610 subjects that have at least one type of data. I have attached the complete file where it also shows cancer type
1610 Number of unique individuals with any type of Sequencing data
1338 Number of unique individuals with Exome data
99 Number of unique individuals with genome data
552 Number of unique individuals with miRNA data
306 Number of unique individuals with RNAseq data