Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: remove confusing feature comparison row

...

Datasets

File Views

Underlying object type

Dataset

EntityView

Method for adding files

Select specific versions of individual files

Select projects and folders, including the latest version of all contained files, tables, or datasets

View and query file annotations

Edit file annotations in bulk

Versioning/snapshot functionality

Annotation functionality

DOI functionality

Limit on number of files

10,000

35,000

How to create a dataset

Create a dataset

  1. Navigate to the project that you want to create the dataset in

  2. Click the Datasets tab

  3. Click Add Dataset

  4. In the Create Dataset window, enter a name for the dataset

  5. Click Finish

You will now be directed to the new dataset that you just created (it will be empty at this point).

Note

Notice the banner indicating that this is a draft version of the dataset. A draft dataset should not be distributed externally until it is finalized by creating a stable version. See the section Create a Stable Version below for more information.

Add files to the dataset

  1. In the dataset, click Add Items

  2. In the Add Files to Dataset window, browse for the file(s) you want to add

    1. Click on the name of a project to see all folders, files, and tables contained within that project. Note that only files can be selected and added to the dataset

    2. If you want to see the contents of an individual folders, click the dropdown arrow next to a project name, or next to a folder, to reveal all of its contents. This will allow you to select in individual files contained within

    3. You can also search for individual files using the Search for Files tool (Note that you cannot use this tool to search for folders or projects, only individual files)

  3. Click the checkbox next to any of the file(s) that you want to add. If you want to add all files from within a folder, you can click the general checkbox at the top of the list to add all contents. You can also select which version of the file you want to appear in your dataset.

    (plus) Find more information at /wiki/spaces/DOCS/pages/2667675758.

    Your selections will appear in the Selected box at the bottom. They will remain in this “selected” status, even as you navigate through other folders and files. You can remove individual selections from here if necessary.

  4. Once you have selected all of the files you want, click Add Files. All files from your Selected box will be added to the dataset. At this point, before saving the dataset, you can still add or remove files from the dataset, or change the version of any files (see screenshot below)

  5. Click Save to save your current selection and return to the draft dataset

In the screenshot below, notice how there are several actions you can take after adding new files, before saving this version. You can use the checkmarks to select any of the files and remove them, or use the Version dropdowns to change the version selection of any file. You can also add new files. Note that you can also edit the draft dataset after saving, but if you create a stable version then it will reflect the selection that you included at the time of creation. See the section Create a Stable Version below for more information.

...

Here’s how to create a stable version:

  1. Click Dataset Tools and select Create a Stable Dataset Version from the dropdown menu

  2. In the Create Stable Version window, add an appropriate label for the version, and a comment if necessary. Note that you do not need to add a version number, since is is already added for you

You will now see the new version, as well as the full version history. From here, you can go back to your draft.

...

To add annotations to a dataset:

  1. Once in the dataset, click Dataset Tools and select Annotations from the dropdown list

  2. In the My Dataset window, click Edit

  3. As the on-screen instructions state, click the Add icon to begin adding annotations

  4. Complete your annotations using the fields provided

    • Use the + button to the right of each row to add a new value for any Key

    • Use the x button to the right of each row to delete that row

    • Click the Add icon to add another Key

  5. Click Save

Edit the dataset schema

You can customize the visible columns of a dataset. These columns will be auto-populated based on the annotation values of the underlying files in the dataset. Here’s how to customize these columns:

  1. In the dataset, click Dataset Tools, and select Show Dataset Schema from the dropdown menu

  2. Click Edit Schema at the bottom of the table

  3. In the resulting Edit Columns window, you can add columns to your dataset schema using any combination of these three options:

    • Click Add Column to manually add individual columns one by one. If these columns exist as annotations on one or more of the files in the Dataset, the values will be displayed in the Dataset. You cannot use a Dataset to bulk annotate files, so do not add columns that do not already exist as annotations, since this will not serve any purpose.

    • Click Add Default Dataset Columns to add the default columns used in Datasets

      • You can then customize this list by removing any of the default columns you don’t want to be included—to do so, click the checkbox next to any column(s), followed by the trash can icon at the top.

    • Click Import columns to import columns from another table in Synapse. Again, only columns which already exist as underlying annotations will be relevant.

  4. Once you have added all columns of interest, you can:

    • Use the arrows at the top to reorder the columns

    • Enter any values as needed in the Restrict Values column

    • Select/change any column facet using the Facet dropdown

  5. Click Save

Mint a DOI (digital object identifier)

...