A genomic data analysis platform need to keep track of different resources such as metadata of files, sample annotations or jobs. OpenCGA Catalog aims to collect and integrate all the information needed for executing genomic analysis. This information is organized in nine main entities: users, studies, files, samples, datasets, cohorts, individuals, disease panels and jobs.
The main tasks of Catalog are to provide:
This section describes the most relevant entities. For more detailed information about the data models such as Java source code, examples or the JSON Schemas you can visit OpenCGA Catalog Data Models page. You can see an overview of the data model in this picture:
The most relevant entities in OpenCGA Catalog are:
All this information can be stored and retrieved using our Java and RESTful web services API.
Table of Contents: