Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

How To

Upload a dataset to S3

TODO brian.holt: talk about your process, any scripts you used, and any gotchas to watch out for

The idea is that if you are on vacation, we should have enough info here to correctly follow your design for layout and encoding.For the initial upload, a GUI tool called BucketExplorer (http://www.bucketexplorer.com/) is used. Uploads are done from the internal host fremont.fhcrc.org using the local access account 'platform', with the same password as the platform@sagebase.org account. The most efficient way to connect is to use an NX protocol client (http://www.nomachine.com/download.php) to get a virtual desktop as the user platform. Once connected the preconfigured BucketExplorer can be found in the application menu in the lower left corner of the screen.

The initial datasets are stored in /work/platform/. This entire collection is mirrored exactly and can transfered by dragging and dropping into the data01.sagebase.org s3 bucket. BucketExplorer is very efficient, and will do hash comparisons and only transfer what files have changed. One can also get a visual comparison of what files have changed using the 'Comparer' button. During the transfer, BucketExplorer will parallelize the transfer into 20 streams for very efficient use of outgoing bandwidth to the cloud. 

Create a new IAM group

TODO deflaux: talk about Sage-specific stuff such as where we want to check in our policy files to SVN for each group

...