Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Tip
iconfalse
titleWelcome to HGVA project!

Human Genomic Variation Archive (HGVA) is an open-access genomic variation resource that integrates variants from the main reference projects. HGVA also adds valuable information such as variant annotation: consequence types, population frequencies, protein effect predictions, variant-associated phenotype, etc.



HGVA aims to provide a high-performance and scalable resource to store, query and visualise variants from main human datasets. We have put special emphasis on making HGVA very responsive even with complex queries and to make the data available to researchers and bioinformaticians in three different ways: a rich web interface based on OpenCB IVA, client libs (Python, Java and JavaScript) and through a command-line. Finally, all data is is normalised and annotated using OpenCB CellBase.

HGVA does not intend to replace or provide the same archiving services of NCBI dbSNP or EMBL-EBI EVA. These archives sites provide an excellent submission and accessioning services and play a crucial role allowing scientist to submit variation data for many different species. On the contrary, we have selected - and we will keep integrating - high-profile studies which are the most widely used in genomic variation analysis pipelines. Big genomic variation repositories such as EVA or dbSNP are multi-species services with a more general archiving aimHGVA is focused on human data only and selects the most relevant and high-quality datasets implementing different rich user interfaces to allow researchers to query all these datasets ar use this information in genomic pipelines.

HGVA is developed and maintained by researchers at University of Cambridge and Genomics England and it is available at http://hgva.opencb.org

Main Features

  • Most important high-quality variant studies normalised and integrated in just one server database
  • Rich variant annotation performed, including HPO terms, consequence types, substitution effect prediction scores, Gene Ontology terms, etc.
  • Population frequencies calculated, including populations and super-populations
  • Interactive web-based data mining tool based on IVA
  • Clients in Python, Java, JavaScript for fast programmatic access

Contact

  • Javier Lopez (javier.lopez@genomicsengland.co.uk)
  • Ignacio Medina (im411@cam.ac.uk)


Latest news:

Blog Posts
max3
spaceshgva
sortcreation
reversetrue
contentexcerpts



Current Studies

Refernce Studies GRCh37

  • 1000 Genomes Phase 3
  • Exome Aggregation Consortium (ExAC)
  • Exome Sequencing Project (ESP6500)
  • Genome of the Netherlands(GoNL)
  • UK10K project
  • Spanish Medical Genome Project

Refernce Studies GRCh38

  • 1000 Genomes Phase 3

Cancer GRCh37

  • QIMR Berghofer Melanoma
  • Chronic Myeloid Leukemia - Russian Academy of Medical Sciences

Statistics

More than 250M variants reported and about 120M of unique variants


Chart
imageFormatpng
dataOrientationvertical
titleVariant Studies GCh37


StudyNum. variants
1kG Phase385170328
ExAC

63000000

ESP65001997952
GoNL20708427
UK10K ALSPAC46618311
UK10K TWINSUK46618311
MGP711005



Developers

Source Code

Web based on IVA project at  https://github.com/opencb/iva/tree/app/hgva

Server based on OpenCGA at  https://github.com/opencb/opencga

Contributing

HGVA is a collaborative project that aims to integrate as many reference human studies as possible, you can contact us for feature request. If you want to contribute to the code you are more than welcome to contribute to IVA and OpenCGA



Recent space activity

Recently Updated
typespage, comment, blogpost
max5
hideHeadingtrue
themesocial