Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Browse the glossary to learn more about terms and definitions commonly used within throughout Synapse.

ACT

Abbreviation for the Synapse Access and Compliance Team, a group of people who are responsible for setting, maintaining, and controlling governance throughout Sage and its platforms. The ACT recommends appropriate safeguards depending on the type of data and who can access it, and can apply certain limitations or conditions for data access based on how the data will be shared.

...

Learn how to install the Synapse command line client here.

Is this enough?

...

Controlled access

This is a type of data access setting—data set at as controlled access is available to registered, certified, or validated users that fulfil specific requirements for data access.

...

This is essentially a repository for metadata used throughout the Synapse site.

Need more here…

...

Data science

Data science is an interdisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains.

Source: Wikipedia. Is there a better / more Sage-focused explanation?

...

Data model

An abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. A data model explicitly determines the structure of data.

Source: Wikipedia. Is there a better / more Sage-focused explanation?

...

Digital object identifier (DOI)

...

Any distinct object in Synapse, including a file, folder, project… what else?

...

Experimental mode

A mode in Synapse where new features and feature updates that are still in development until they are ready to be pushed live. Anyone can test out this mode using the Experiment Mode link at the bottom right of Synapse.

...

Abbreviation for JavaScript Object Notation. JSON is a data-interchange format or language based on two structures: an object and associated values, or an array.

This may need tweaking… I tried to make it “approachable” but I may have missed key points or got something wrong.

...

JSON schema

A specific JSON-based format that defines the structure of JSON data for validation, documentation, and interaction control. It provides a contract for the JSON data required by a given application, and how that data can be modified.

Source: Wikipedia. Please check for accuracy/relevance

...

Key-value pairs

Key-value pairs are used in annotations, where the key defines a particular aspect of your data (for example, species, assay, file format) and the value defines a variable that belongs to that category (mouse, RNAseq, .bam).

Is this enough explanation? Are key-value pairs used outside of annotations?

...

Manifest

This is a file that gets uploaded alongside data, which specifies information about the data files being uploaded. It also contains annotations that will be associated with the file in Synapse. It tells the computer the current directory of the file to be uploaded (via path) and the Synapse ID of the folder where files will be uploaded (via parent). The manifest can also be used to describe provenance of each file, indicating how it was generated, but this is optional (but helpful).

...

In the context of data science, an ontology is essentially the system in place for naming and classifying entities and the relationships between them, as they exist in a particular data model. For example, the ontology of a research study would specify an appropriate naming convention for terms used throughout the study.

I made this up but not sure if it’s accurate / if I captured everything. Probably needs tweaking.

...

Open access

This is a type of data access setting—data set at as open access is available to all registered Synapse users, without use limitations.

...

Learn how to install the Synapse python client here.

Is this enough?

...

Registered User

This is one of four user account types in Synapse, which determines what actions a user can perform. Registered users can create projects and wikis, collaborate with other registered users and create Synapse teams, can download publicly available data, and can access controlled data (if they fulfil the conditions for use)

...

Find more information on user account types here.

...

REST (or REST API)

Need explanation

...

RNA-Seq

A sequencing technique which uses next-generation sequencing to reveal the presence and quantity of RNA in a biological sample at a given moment, analyzing the continuously changing cellular transcriptome (the set of all RNA transcripts in an individual or population of cells).

Source: Wikipedia. Is there a better / more Sage-focused explanation?

...

Schema

A snapshot of all the objects contained in a database and their relationship. Essentially, it is the structure of your data. In Synapse entities, such as views and tables, the schema defines the column names, as well as the values or types of data allowed in each column.

Please review for accuracy.

...

Sharing Settings

Determine who can access content in Synapse and what permissions those users have with respect to a dataset.

...

Every object in Synapse (file, folder, project, table, view, user, etc.) is designated a unique Synapse ID (also known as synID) that is readable by programmatic clients.

Is this accurate?

...

Validated User

This is one of four user account types in Synapse, which determines what actions a user can perform. Validated users are certified users that have applied to have their user profile validated. This validation makes you eligible to request access to mHealth data.

...