Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

OpenCGA uses a hierarchical structure to organize datasets. Thus, Briefly, Projects, Studies and Cohorts are used to organize HGVA metadata:

  • Projects are entities which contain one or more Studies.

...

  •  
  • Study, in turn, represents a particular

...

  • data set with/without samples metadatacohorts, and obviously genomic variation data. For example, The 1000 Genomes Project is defined as a study in OpenCGA. Likewise, The Genome of the Netherlads or the Exome Aggregation Consortium are also two different studies, and so on.
  • Finally, a cohort is simply a set of samples defined within a study. For example, populations and super-populations within The 1000 Genomes Project are defined as cohorts. Thus, EUR, AMR or GBR are examples of cohorts.

Please, click on http://bioinfo.hpc.cam.ac.uk/hgva-1.0/... to get a full list of currently available datasets (studies) in HGVA and how they are organized in different projects.