Tip | ||
---|---|---|
| ||
Confluence spaces are great for sharing content and news with your team. This is your home page. Right now it shows recent space activity, but you can customize this page in anyway you like. |
Complete these tasks to get started
- Edit this home page - Click Edit in the top right of this screen to customize your Space home page
- Create your first page - Click the Create button in the header to get started
- Brand your Space - Click Configure Sidebar in the left panel to update space details and logo
- Set permissions - Click Space Tools in the left sidebar to update permissions and give others access
Metadata and Security
Metadata Database
OpenCGA Catalog implements a high-performance metadata database to track all files metadata, samples, families, ...
Security
OpenCGA implements authentication to control what data can be seen by users. Data such as Files, Samples, Families, .. can be shared in different way.
Variant and Alignment Storage
Variant Database
OpenCGA implements a high-performance and scalable variant NoSQL database to store and index thousands of whole genome VCF files. Performance observed show more than 2,000 whole genomes indexed a day.
Many variant operations have been implemented such as variant aggregation, stats calculation, variant annotation, export, ...
We have implemented the most advanced query engine and aggregation framework to query variants.
Alignment Storage
Indexing BAM files and calculating coverage is supported. You can efficiently query all these data through REST web services.
Easy to Use
REST API and Clients
We have implemented a comprehensive REST API to work with Catalog and query Variants and Alignment data in a secure way. To facilitate using REST we have developed four client libraries developed in Java, Python, R and Javascript.
Command Line Interface
OpenCGA implements two different command lines, one for the users and one for the admin. Users can fully operate OpenCGA from the command line.
Analysis Framework
Native Analysis and Plugins
OpenCGA implkements most common analysis such as stats or GWAS among many other ones. We will keep adding more common analysis in each version.
Users can implement their own native analysis for OpenCGA by developing a plugin. These plugins can easily be installed and executed in OpenCGA.
Wrapped Analysis
OpenCGA can also execute any other external binary (C++, Python R, ...) by creating a simple wrapper that connect OpenCGA storage engine with the binary. We also provide some official external binaries supported such as Plink
Clinical Analysis
Clinical Data and Disease Panels
You can store all you clinical data in our free data model solution in Catalog. You can define your clinical variables and annotate files, samples, individuals, families or cohort. Clinical Data is indexed automatically to provide a real-time queries and aggregations analysis.
Disease Panels are fully supported and versioned.
Clinical Interpretation Analysis
You can define different types of Clinical Analysis. We have implemented some automatic clinical interpretation algorithms for Rare Diseases (families) and Cancer. A Decision Support System has also been implemented in IVA.
Big Data Analysis
Rich Data Models
OpenCGA takes advantage of the rich data models developed in OpenCB. We make an extensive use of Variant and Variant Annotation data models.
Spark Analysis
OpenCGA implements several analysis top of the Variant storage. These analysis can use different programming models – such as MapReduce – or different technologies such as Spark.
A Spark-based library has developed to provide extra analysis capabilities.
Cloud
Cloud Architecture
OpenCGA architecture was designed to be fully compatible with modern cloud architectures, this makes of OpenCGA extremely efficient and performance in cloud environments.
Microsoft Azure
OpenCGA and Microsoft collaborated to test and validate HDInsight security and analysis performance.
Visualisation
Source Code
Web based on IVA project at https://github.com/opencb/iva/tree/app/hgva
Server based on OpenCGA at https://github.com/opencb/opencga
Contributing
IVA is a collaborative project that aims to integrate as many reference human studies as possible, you can contact us for feature request. If you want to contribute to the code you are more than welcome to contribute to IVA and OpenCGA
Zetta Genomics
Start-up
A University of Cambridge start-up is being launched during 2019. Zetta will provide official support and a number of different services.
This is will be officially announced in later 2019, if you want to know more about this please contact im411@cam.ac.uk
Development
Contributors
Ignacio Medina (HPCS, University of Cambridge)
Source Code
Web based on IVA project at https://github.com/opencb/iva/tree/app/hgva
Server based on OpenCGA at https://github.com/opencb/opencga
Contributing
IVA is a collaborative project that aims to integrate as many reference human studies as possible, you can contact us for feature request. If you want to contribute to the code you are more than welcome to contribute to IVA and OpenCGA
Recent space activity
Recently Updated | ||||||||
---|---|---|---|---|---|---|---|---|
|
Space contributors
Contributors | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|