Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

You can find the code for this script here clinicalVariableDescriptionsLoader.py

How to run the Data Loader (Deprecated)

We should be migrating data and maintaining it between version upgrades now. 

Once environments are running, you can populate the system with a set of starting data.  On one of the local servers, goto /work/platform/DatasetMetadataLoader and execute the following:# Make sure you have the latest version
svn up

  1. Execute the loader
  2. Replace <repo_instance> and <auth_instance> by the repository and authentication instances.
  3. Either make sure that <platform_admin_email> is a Synapse administrator on crowd, or replace it by a Synapse administrator account
    python datasetCsvLoader.py -e http://<repo_instance>/repo/v1
    -a http://<auth_instance>/auth/v1 -u <platform_admin_email> -p <platform_admin_pw>
    This will create a publicly-accessible project called Sage BioCuration, and populate it with curated data from Sage's repository data team.

If you need to repopulate the data in S3, pass the -3 argument to the data loader. It upload the data in serial right now so it takes an hour or two. We really should only need to do this if we've messed up our S3 bucket.