Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Experimental data is often shared in a variety of formats. Carefully choosing a data format is a great way to extend the impact of your research, by ensuring others can use it in the future (see Raw Data vs Results for a deeper discussion of this concept). In the table below, we’ve provided a quick reference .

On this page, we begin by defining the difference between raw data and results, and then we provide a quick reference reference table for different types of data that you might be sharing: sharing.

Raw data vs. results

From a reusability perspective, data is the most useful to future users. Both results and data can be shared, but data is more important for reproducibility and reuse.

We consider data to be raw or partially processed information from a single sample, depending on the type of experiment.

Results are generally post-analysis information from an aggregate of samples or manuscript figures.

For example, if you are sharing gene expression information, raw data would be the raw, zipped, fastq.gz files, while differential expression analysis and volcano plots would be considered results. This distinction is well defined for many types of data, but for assays we encounter less often this may be less clear. "Results" might also be acceptable for assays that do not lend themselves to re-analysis, such as western blotting. We can work with you to help figure this out.

Data reference table

Assay

Preferred file formats

RNA-seq

.fastq.gz, .bam, .cram

microarray

platform dependent  (e.g. .cel, .idat)

whole-genome sequencing

.fastq.gz, .bam, .cram

whole-exome sequencing

.fastq.gz, .bam, .cram (.vcf, .maf acceptable with justification)

methylation data

Platform dependent: .idat for Illumina arrays, .fastq.gz for sequencing

western blotting

Individual images or final figure (e.g. samp1.png or samp1.tif)

microscopy

Microscope’s native imaging format (e.g. nd2, abi), OME-TIFF, lossless image files (e.g. .tif, .png) separated by channel 

In vitro drug screening data

A .csv/.tsv file following this template (see instructions)

In vivo tumor growth experiments

A .csv/.tsv file following this template (see instructions)

PK/PD data

.csv/.tsv file - no template exists

proteomics

platform dependent