OpenCGA uses a hierarchical structure to organize datasets. Briefly, Projects, Studies and Cohorts are used to organize HGVA metadata:
Here you can get more info about OpenCGA Catalog data models. Please, see below the full list of currently available datasets (loaded as studies) in HGVA and how they are organised in different projects.
Project name (alias) | Studies | Version/Date | |
---|---|---|---|
HGVA v1 (Dec. 2016) | HGVA v2 (Jul. 2017) | ||
Reference GRCh37 (reference_grch37) | 1000 genomes project GRCh37 | Phase 3 2016-05 | To be decided |
Exome Sequencing Project | 2016-05 | To be decided | |
Exome Aggregation Consortium | 0.3.1 2016-05 | To be decided | |
Genome of the Netherlands | Release 5 2016-05 | To be decided | |
UK10K project | 2016-05 | To be decided | |
Spanish Medical Genome Project | 2016-12 | To be decided | |
Reference GRCh38 (reference_grch38) | 1000 genomes project GRCh38 | Phase 3 2016-10 | To be decided |
Cancer GRCh37 (cancer_grch37) | QIMR Berghofer Melanoma | 2016-12 | To be decided |
Chronic Myeloid Leukemia - Russian Academy of Medical Sciences | 2016-12 | To be decided | |
Platinum (platinum) | Illumina Platinum | 2015-08 | To be decided |
Variant annotation was carried out by the CellBase project. Please, check CellBase documentation for details on additional data sources: Data sources and species
Table of Contents: