OpenCGA is an open-source project that aims to provide a Big Data storage engine and analysis framework for genomic scale data analysis of hundreds of terabytes

Main Features

  • High-performance and scalable variant storage and index to load and merge VCF/gVCF files
  • Annotate and calculate statistics for all the variants
  • Clinical interpretation analysis of samples and families
  • Client libraries developed in Java, Pytho, R and Javscript
  • Integrated Catalog keeps track of users, files, jobs, clinical data...
  • Interactive web-based data mining tool based on IVA

Contact


Latest news:



Variant Storage and Analysis


Clinical Genomics


Developers

Source Code

Web based on IVA project at  https://github.com/opencb/iva/tree/app/hgva

Server based on OpenCGA at  https://github.com/opencb/opencga

Contributing

IVA is a collaborative project that aims to integrate as many reference human studies as possible, you can contact us for feature request. If you want to contribute to the code you are more than welcome to contribute to IVA and OpenCGA



Contributors

Ignacio Medina (HPCS, University of Cambridge)

Dr. Augusto Rendon (Genomics England)

Dr. Stefan Gräf (Clinical School, University of Cambridge)

Dr. Joaquin Dopazo (CIPF)

Recent space activity