Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

...

Use Cases (In order of importance)

  1. make sure all values exist in user-defined enumeration
  2. set units + desc for a col
  3. Make sure all values in a col match an existing ontology
  4. standardized clinical variable names across studies (column)
  5. complete ontology for sage use => EFO partial soln (Brig)
  6. clean up misspellings, synonyms, capitalization => google refine
  7. generate script to curate data, apply same transformations to new/updated dataset => google refine
  8. Show description of term
  9. some sort of record of what was changed, and from what to what
  10. unit conversion
  11. linking across studies by id - cell lines - same patients multiple studies

Server Design

It appears that to satisfy the most important use cases we can use nearly all existing features within the repository service. This includes

...