A genomic data analysis platform need to keep track of different resources such as metadata of files, sample annotations or jobs. OpenCGA Catalog aims to collect and integrate all the information needed for executing genomic analysis. This information is organized in nine main entities: users, studies, files, samples, datasets, cohorts, individuals, disease panels and jobs.
Main Features
The main tasks of Catalog are to provide:
Authentication and authorization to the different resources.
A collaborative environment.
File audit to keep track of files and metadata.
Analysis and Jobs.
Sample, individual and cohort annotation.
Security
RESTful web services
All this information can be stored and retrieved using our Java and RESTful web services API.