Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. A data contributor first makes a call to a Metadata Service, asking the question: "given this data set, what metadata should I specify?"
    The Metadata Service responses with a manifest schema (the list of annotations).
  2. The data contributor fill in the manifest file with explicit details for each file in the data set and submit the manifest file to the Metadata Service.
    The Metadata Service will validate the submitted manifest file and responses with either "the manifest file is completed", or "change need to be made to the manifest".
    This step may happen multiple times before the manifest file is completed.
  3. The data contributor makes a call to the Metadata Service, asking the question: "where should this dataset be uploaded to?"
    The Metadata Service responses with a Synapse Project; or an AWS S3 bucket; or a Google Cloud Storage bucket.
  4. a. The data contributor uploads the dataset to Synapse.
    b. The data contributor uploads the dataset to AWS S3 bucket, or to a Google Cloud Storage bucket, that Synapse does not have write permissions to. Then the data contributor creates external URLs File Handles in Synapse that point to the uploaded data.
  5. The data contributor annotates the dataset/ Links using the manifest file.

...