...
Use Cases (In order of importance)
- make sure all values exist in user-defined enumeration
- set units + desc for a col
- Make sure all values in a col match an existing ontology
- standardized clinical variable names across studies (column)
- complete ontology for sage use => EFO partial soln (Brig)
- clean up misspellings, synonyms, capitalization => google refine
- generate script to curate data, apply same transformations to new/updated dataset => google refine
- Show description of term
- some sort of record of what was changed, and from what to what
- unit conversion
- linking across studies by id - cell lines - same patients multiple studies
Server Design
It appears that to satisfy the most important use cases we can use nearly all existing features within the repository service. This includes
...