Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

File view driving this table: The following is a brief description of the relevant columns in the folder view below (by column header):

https://www.synapse.org/#!Synapse:syn9630847syn18488466/tables/

The following is a brief description of the relevant columns in the table below (by column header):

...

NOTE: Studies are selected in this view with is.dataset == TRUE 

NOTE: Please only display cards for which featured == TRUE

data portal name: name as it should appear in data portal

current synapse file view column: corresponding name (e.g., for SQL query) in Synapse table view

eventual synapse file view column: name (e.g., for SQL query) in Synapse table view that we will eventually migrate to

difference between current and eventual columns: as we migrate to GDC, we will put new annotation keys in "eventual" column names. for now, use "current."priority: 1 (i.e., top), 2 (i.e., bottom) key/annotation, or 3 (i.e., more ...).

facet: true if column_name should be faceted in data portal.

card location: primary (i.e., top) or secondary (i.e., bottom) key/annotation

The following annotations do not exist on file: 

The following annotations need to be "ported" to GDC:  

The following have been added to the synapse table, but are blank. They need to be filled in:


current
column namedata portal namesynapse file view columneventual synapse file view columnpriorityfacetshow on card–no,  primary, secondaryconceptexampleGDC equivalentfaceted on GDCfacet on CSBCGDC referencesizerestricted valuescommentsin AMP-AD portalin NF portal
SpeciesDataset namespeciesnamespecies1yesno










primaryidnoneyesScientific ThemeNAthemeyesprimarytumor-heterogeneitynoneyesData Categoryassaydata_categoryyesprimaryBroad categorization of the contents of the data file.
  • Transcriptome Profiling
data_categoryyesyesCSBC will need to add values to those in GDC (which only cover sequencing)Data TypeNAdata_typeno?noSpecific content type of the data file.
  • Exon Expression Quantification
  • Gene Expression Quantification
  • Isoform Expression Quantification
  • Splice Junction Quantification
data_typeyesyesCSBC will need to add values to those in GDC (which only cover sequencing)Data FormatfileFormatdata_formatno?noFormat of the data files.
  • CSV
  • HDF5
  • TSV
  • TXT
  • SRA XML
  • MAGE-TAB
  • SDRF
  • IDF
  • ADF
data_formatyesyesExperiment Strategyassayexperimental_strategyyesprimaryidname and download button should link to thisno










featuredfeaturedshould the card be displayed?no










Grant TypegrantTypeNAno
U54 or U01








GrantcenterNameNAno










ProgramconsortiumNAno
CSBC or PSON








Speciesspecies2yes

none
yes





ThemeTheme2yes
tumor-heterogeneitynone
yes





AssayexperimentalStrategy2yesThe sequencing strategy used to generate the data file.  REMOVE "sequencing" for CSBC.
  • RNA-Seq
  • Total RNA-Seq
experimental_strategyyesyes


CSBC will need to add values to those in GDC (which only cover sequencing)

file_namePlatformnoplatformprimaryThe name (or part of a name) of a file (of any type).file_namenonofile_sizenonoThe size of the data file (object) in bytes.file_sizenonomd5sumnonoThe 128-bit hash value expressed as a 32 digit hexadecimal number (in lower case) used as a file's digital fingerprint.md5sumnonoplatformyesno2yes

platformyesyesworkflow_typeNAnonoGeneric name for the workflow used to analyze a data set.
  • BWA
  • BWA with BQSR
  • BWA-aln
  • BWA-mem
  • BWA with Mark Duplicates and BQSR
yes??





Disease Typedisease_typeyesprimaryDisease TypetumorType2yesThe text term used to describe the type of malignant disease, as categorized by the World Health Organization's (WHO) International Classification of Diseases for Oncology (ICD-O).
  • Acinar Cell Neoplasms
  • Adenomas and Adenocarcinomas
  • Adnexal and Skin Appendage Neoplasms
  • Basal Cell Neoplasms
  • Blood Vessel Tumors
case/disease_typeyesyeshttps://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=caseTissuetissue_or_organ_of_originyesprimaryThe text term used to describe the anatomic site of origin, of the patient's malignant disease, as described by the World Health Organization's (WHO) International Classification of Diseases for Oncology (ICD-O).
  • Abdomen, NOS
  • Abdominal esophagus
  • Accessory sinus, NOS
  • Acoustic nerve
  • Adrenal gland, NOS
diagnosis/tissue_or_organ_of_originyesyeshttps://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=diagnosistissue_typenonoText term that represents a description of the kind of tissue collected with respect to disease status or proximity to tumor tissue.
  • Tumor
  • Normal
  • Abnormal
  • Peritumoral
  • Unknown
sample/tissue_typenoyeshttps://docs.gdc.cancer.gov/Data_Dictionary/viewer/#?view=table-definition-view&id=sample





GDC Data Dictionary viewer: https://docs.gdc.cancer.gov/Data_Dictionary/viewer/

...