Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Data Types in the scope of processing:

  • Whole exome sequencing dataExome Sequencing

  • Whole genome sequencing dataGenome Sequencing

  • Bulk RNA sequencing dataSequencing

  • Single cell RNA sequencing data

Eligibility criteria for data files to be staged for processing:

...

Raw data files present in the study must have file format annotated as “fastq” or “bam” or “cram”

Required annotations for RNA Sequencing

Annotation term

Additional Details

1

fileFormat

Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing.

...

Data files must originate from samples that were “flash frozen”. Data from FFPE samples are not eligible for processing due to possible degradation of genomic samples due to the preservation process.

...

Data files must have the required minimal annotations according to the NF-OSI data model (as suggested in the NF Data Curator App). These annotations include among others:

  • individualID

  • specimenID

  • assay

  • libraryPreparationMethod

  • Platform

  • isStranded

  • readPairOrientation

  • readPair

  • specimenPreparationMethod

  • isCellLine

  • isXenograft

  • species

...

Only raw data files uploaded to Synapse projects are eligible to be staged for processing at this time.

Ineligibility criteria for specific processing methods:

...

WES files are not eligible for structural variant calling or copy number variant calling

2

individualID

Individual IDs are necessary to create the sample sheets.

3

specimenID

specimen IDs are necessary to interpret the analysis.

4

Assay

Choose between Bulk RNA Seq or Single Cell RNA Seq.

5

Species

The corresponding genome requires knowledge of the species.

6

libraryPreparationMethod

This refers to the name of the library preparation, such as KAPA Hyper PCR 3.

7

Platform

This refers to the name of the platform used, for example, illumina.

8

readPair

Specify whether the read pair is 1 or 2.

9

specimenPreparationMethod

Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.

10

tumorType

If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type.

11

isStranded*

This answer should be either "yes" or "no."

12

readPairOrientation*

Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

...

Required annotations for Whole Genome Sequencing

Annotation term

Additional Details

1

fileFormat

Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing.

2

individualID

Individual IDs are necessary to create the sample sheets.

3

specimenID

specimen IDs are necessary to interpret the analysis.

4

Assay

Choose between Bulk RNA Seq or Single Cell RNA Seq.

5

Species

The corresponding genome requires knowledge of the species.

6

libraryPreparationMethod

This refers to the name of the library preparation, such as KAPA Hyper PCR 3.

7

Platform

This refers to the name of the platform used, for example, illumina.

8

readPair

Specify whether the read pair is 1 or 2.

9

specimenPreparationMethod

Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.

10

tumorType

If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type. NOTE: Files from samples lacking tumor-normal pairs will not be eligible for Somatic variant calls or for microsatellite instability processing.

11

isStranded*

This answer should be either "yes" or "no."

12

readPairOrientation*

Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

...

Required annotations for Whole Exome Sequencing

Annotation term

Additional Details

1

fileFormat

Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing. Note: WES files are not eligible for variant calling if BED file is not available

2

individualID

Individual IDs are necessary to create the sample sheets.

3

specimenID

specimen IDs are necessary to interpret the analysis.

4

Assay

Choose between Bulk RNA Seq or Single Cell RNA Seq.

5

Species

The corresponding genome requires knowledge of the species.

6

libraryPreparationMethod

This refers to the name of the library preparation, such as KAPA Hyper PCR 3.

7

Platform

This refers to the name of the platform used, for example, illumina.

8

readPair

Specify whether the read pair is 1 or 2.

9

specimenPreparationMethod

Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.

10

tumorType

If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type. NOTE: Files from samples lacking tumor-normal pairs will not be eligible for Somatic variant calls

...

or for microsatellite instability processing.

11

isStranded*

This answer should be either "yes" or "no."

12

readPairOrientation*

Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

Availability of processing workflows:

...