Annotating your data is the process of labeling files so that users can: get a general understanding of what information is contained in the file, determine what type of experiment was performed, determine what type of biospecimen was used, and assess whether the data is the type of data that they are looking for. Annotation is not a replacement for sharing experimental methods via a publication method section, in your Synapse project, or via a protocol repository like http://protocols.io.
To annotate your data, first:
If you are not yet logged in: log in to Synapse.org.
If you have not yet uploaded your data that you would like to annotate:
Navigate to your Synapse Project that was assigned to you at the time of project on-boarding.
Within the “Raw Data” or “Data” folder, create a new folder to house your current data upload.
Upload the data to this folder.
Navigate to the NF Data Curator app.
Follow the prompts to select the project, folder that you'd like to annotate and the type of assay you are annotating.
Click "Get Metadata Template" -> "Click to generate google sheets template" to generate a template for your new data, and wait for the app to provide a google sheet link. This may take a few moments, particularly if you have many files or lots of existing annotations on your files.
Note: you may download the manifest as an excel file (File → Download → Microsoft Excel), but we have not extensively tested functionality in Excel. If you encounter any issues filling in the manifest in Excel, please contact us at nf-osi@sagebionetworks.org.
Fill in the manifest.
For each column, hover over the column title to get a better idea of what type of information is needed in each column. For example, you can hover over the “individualID” column header to see the definition: “A unique identifier (non-PII) that represents the individual from which the data came. This could be a patient or animal ID.”
Use the dropdown selector for each cell to pick a value. For example, if you are annotating a western blot image, under assay, you’d select “westernBlot”.
If no terms fit, please type in a term that you feel best fits your file.
Our full dictionary, including definitions, is available in our Schema spreadsheet.
For columns without a dropdown, we do not have controlled vocabulary (these are generally project-specific IDs, such as an anonymous patient ID (called `individualID`).
Continue filling in the columns until all required (green columns) and relevant optional columns are filled in.
Submit your manifest.
If your manifest did not require any custom terms (as described in step 6c), then you can submit using the NF Data Curator app “Submit and Validate Metadata” tab.
Download the completed manifest as a csv file from Google Sheets (or, if using Excel, save as csv).
Upload the csv into the data curator app, following the on-screen instructions to validate your manifest. Follow the on-screen instructions if validation errors are present.
Once your manifest passes validation, click “Submit to Synapse” to annotate your files. Congratulations, you’re done annotating!
If your manifest did require custom terms (as described in step 6c), please email the completed google sheet or excel file to nf-osi@sagebionetworks.org so that we can add your new terms to our dictionary. We’ll then work with you to get the annotations applied to your data.
0 Comments