Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Current »

An integrated suite of high-performance (big data) applications for the management and analysis of population-scale genomic data.

OpenCB Summary

By replacing datafiles with databases our software promotes:

  • Scalability; the ability to run queries in real-time across hundreds of thousands of genomes.
  • Accessibility; the ability to analyse from anywhere on the web without needing access to the local filesystem.
  • Integration; the ability to link data together using flexible data models covering reference data, variant data and clinical metadata.
  • Security; the ability to protect data using federated (e.g. SSO) authentication and role-based authorisation schemes. 

OpenCB solutions are typically used as:

  • The storage target of secondary analysis pipelines.
  • The data source for tertiary analysis workflows.

The open-source OpenCB software is developed and maintained by researchers from multiple organisations and made freely available at https://github.com/opencb


Flagship Projects


CellBase

The "unified reference"

CellBase aggregates over 10 TB reference data from over 20 data sources (and counting). Data are exposed via a single, consistent API. Users can use the public instance hosted by the University of Cambridge or install their own copy. Those in the latter camp can use CellBase to manage their own reference collection. 

OpenCGA

The "VCF database"

OpenCGA is software for storage and retrieval of genotype data and associated clinical and operational metadata. Its integration with CellBase provides for powerful variant annotation functionality. It provides extensive web services, APIs for R & Python, and its own command line interface.

IVA

The "web application" 

The Interactive Variant Analysis (IVA) makes it easy to work with the variant information stored in OpenCGA and annotated by CellBase. It has tools for browsing, filtering, analysis and interpretation that are tailored for studies of population genomics and genomic medicine.



Welcome to OpenCB!

The Computational Biology (OpenCB) open-source software initiative implements different high-performance and scalable open-source projects for the analysis of high-throughput genomic data.

OpenCB projects are developed and maintained by researchers of different universities and big projects such as University of Cambridge and Genomics England. OpenCB is open-source and freely available at https://github.com/opencb

Overview

OpenCB provides a high-performance and scalable platform for the analysis of high-throughput genomic data.

Contact

Ignacio Medina





Projects

CellBase

OpenCGA

IVA

Genome Maps

...


Main Contributors

Joaquin Dopazo

Augusto Rendon

Stefan Gräf

University of Cambrdige

Developers


  • No labels