Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents
minLevel3
maxLevel3
outlinefalse
typelist
printablefalse
With the support of the Neurofibromatosis Therapeutic Acceleration Program (NTAP), we are excited to introduce a pilot initiative aimed at uniformly processing of genomic and transcriptomic data on the NF Data Portal. This initiative encompasses all projects that have received funding from NTAP between 2018 and 2023. The processing of high-dimensional genomic and transcriptomic data will be carried out through the use of standardized data processing pipelines. By ensuring uniformity in data processing, we will be able to share the processed data on the NF Data Portal and facilitate its utilization in other data analysis and exploration platforms. If you have a dataset shared on the NF Data Portal and would like to utilize the NF-OSI Processing pipelines, please reach out to us at nf-osi@sagebionetworks.org. We would be happy to assist you in a case-by-case basis.

Datatypes eligible for processing

Expand
titleWhole Exome Sequencing

A technique that focuses on sequencing only the exons, which are the coding regions of the genome, of an individual's DNA. WES is used to identify genetic variations in these regions that may be associated with certain diseases or traits.

Expand
titleWhole Genome Sequencing

A technique that involves sequencing the entire genome of an individual. This includes both the coding regions and the non-coding regions of DNA. WGS is used to identify genetic variations throughout the entire genome that may be associated with certain diseases or traits.

Expand
titleBulk RNA Sequencing

A technique that involves sequencing the transcriptome of a bulk population of cells. This technique provides information on the gene expression levels of all genes in the sample and is commonly used to compare gene expression profiles between different samples or conditions.

Expand
titleSingle cell RNA sequencing

A technique that involves sequencing the transcriptome of individual cells. This technique provides information on the gene expression levels of each cell and is commonly used to study cell heterogeneity and identify rare cell populations.

Required annotations for data files to be staged for processing

...

For each of the following assays, data files must be annotated with the terms listed below.

RNA Sequencing

Annotation term
Additional Details
1
fileFormat
Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing.
2
individualID
Individual IDs are necessary to create the sample sheets.
3
specimenID
specimen IDs are necessary to interpret the analysis.
4
Assay
Choose between Bulk RNA Seq or Single Cell RNA Seq.
5
Species
The corresponding genome requires knowledge of the species.
6
libraryPreparationMethod
This refers to the name of the library preparation, such as KAPA Hyper PCR 3.
7
Platform
This refers to the name of the platform used, for example, illumina.
8
readPair
Specify whether the read pair is 1 or 2.
9
specimenPreparationMethod
Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.
10
tumorType
If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type.
11
isStranded*
This answer should be either "yes" or "no."
12
readPairOrientation*
Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

...

Whole Genome Sequencing

Annotation term
Additional Details
1
fileFormat
Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing.
2
individualID
Individual IDs are necessary to create the sample sheets.
3
specimenID
specimen IDs are necessary to interpret the analysis.
4
Assay
Choose between Bulk RNA Seq or Single Cell RNA Seq.
5
Species
The corresponding genome requires knowledge of the species.
6
libraryPreparationMethod
This refers to the name of the library preparation, such as KAPA Hyper PCR 3.
7
Platform
This refers to the name of the platform used, for example, illumina.
8
readPair
Specify whether the read pair is 1 or 2.
9
specimenPreparationMethod
Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.
10
tumorType
If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type. NOTE: Files from samples lacking tumor-normal pairs will not be eligible for Somatic variant calls or for microsatellite instability processing.
11
isStranded*
This answer should be either "yes" or "no."
12
readPairOrientation*
Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

...

Whole Exome Sequencing

Annotation term
Additional Details
1
fileFormat
Accepted file formats include "fastq", "bam", and "cram". If provided raw data are “bam” or “cram” format, only files that have not undergone any additional filtering (i.e. retains unmapped reads, have not been trimmed, etc) will be eligible for processing. Note: WES files are not eligible for variant calling if BED file is not available
2
individualID
Individual IDs are necessary to create the sample sheets.
3
specimenID
specimen IDs are necessary to interpret the analysis.
4
Assay
Choose between Bulk RNA Seq or Single Cell RNA Seq.
5
Species
The corresponding genome requires knowledge of the species.
6
libraryPreparationMethod
This refers to the name of the library preparation, such as KAPA Hyper PCR 3.
7
Platform
This refers to the name of the platform used, for example, illumina.
8
readPair
Specify whether the read pair is 1 or 2.
9
specimenPreparationMethod
Minimize RNA degradation with methods such as flash freezing or RNALater. FFPE is not recommended.
10
tumorType
If the tissue is normal, indicate "not applicable." Otherwise, specify the tumor type. NOTE: Files from samples lacking tumor-normal pairs will not be eligible for Somatic variant calls or for microsatellite instability processing.
11
isStranded*
This answer should be either "yes" or "no."
12
readPairOrientation*
Indicate the read pair orientation, such as forward or reverse.

* optional but recommended

Assay-specific workflow availability

...