Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 37 Next »

In this section you can find only the main top-level features scheduled for next releases. For a more detailed list you can go to GitHub Issues at https://github.com/opencb/opencga/issues

OpenCGA 2.x Planned Releases

2.0.0 (March 2018)

Still work in progress

General

  • Add Reactive Programming (RxJava) and Events, this will allow to be easily integrated into other custom Java-based applications
  • Upgrade to JUnit 5
  • Clean ups and remove deprecated code and APIs
  • IVA 1.1.0 will display biological networks and will allow to add Watchers to queries

Catalog

  • Audit improvements
  • Add Federation, this allows to connect several OpenCGA instances
  • Add new CreateResult, UpdateResult, ... to REST web services

Storage

  • Add a Watcher API to automatically run queries and notify results
  • Annotate from public/private internal Clinical Variant database
  • New Gene Expression database, this will include a Gene Annotation based on CellBase
  • Add new big data variant analysis such as PCA, this is being implemented in a different OpenCB repository 


OpenCGA 1.x Planned Releases

1.4.0 (January 2018)

General

  • Implement the new HTSGET 1.0 protocol
  • IVA 1.0.0 will implement a full clinical analysis support among many other features
  • Add many more functional tests
  • Documentation improvements with new diagrams and tutorials

Catalog

  • Complete all delete operations and implement cascade propagation when needed, with this the REST API can be considered complete
  • New Notification API, this will allow for example to implement user notifications
  • New PermissionRule feature, you can predefine rules for assigning permissions automatically to new data, e.g. set VIEW permission to USER to all samples where HOSPITAL = 'X'
  • Complete ClinicalAnalysis, ClinicalInterpretation and ClinicalReport data models and functionality

Variant Storage

  • Support the last pending structural variant Translocation, with this all structural variants are properly represented and stored
  • Implement variant imprecise queries and some specific structural variant queries
  • Improve variant stats and add simple variant analysis such as association or Hardy-Weinberg test, this will be stored and indexed in the new VariantScore object
  • Add INDEL left-alignment normalisation to VariantNormaliser
  • Variant Benchmark suite to study scalability and performance


1.3.0 (November 2017)

General

  • CLI autocompletion implemented
  • New CLI for automatic migrations
  • New and fully functional R client library for REST web services, with this the four client libraries are completed
  • New IVA 0.9.0 is developed coordinately to exploit all the new features, they will be released together
  • Many more functional tests added to test all new functionality described below
  • Review and improve Swagger documentation and descriptions
  • Documentation improvements with new diagrams and tutorials

Catalog

  • New Family data model finished, now it is production ready, this completes and integrates three related data models: Sample, Individual and Family
  • New Versioning feature implemented for Sample, Individual and Family. Now you can track any change in those data models, users can query o review any version of those documents
  • New Export functionality implemented, this allows to export a Project as it was at any specific release, this can then imported in a new OpenCGA server
  • New Study administrative group called admins, all users in this group will be granted some special permissions at Study level such as create groups or share data, this will make Study administration much easier
  • New Confidential permission for Variable Sets, now you can make some clinical data private for some users
  • New ClinicalAnalysis data model added, this allows to define and stored different clinical interpretation analysis, this is still experimental and it should not be used in production
  • Improvements in Group By queries, now you can pass a count parameter and aggregations only use data you can view, this can be useful for summarising data. Also, this has been added to Individual and Family
  • Ensure that all query GET REST web services accept comma-separated list of IDs, at the moment only few of them accept ID lists, this will reduce the number of REST calls needed improving the performance
  • New REST web service to execute remote scripts for Catalog, for instance "move samples from Study"
  • Performance improvements when checking permissions (ACL) in create and update methods, now on average 50% less database queries are needed

Variant Storage

  • Improve support for Structural Variants, in this release we will fully support Insertion, Deletion and Copy Number variants
  • New VariantMetadata implemented, this is exported together with the variant data to be further analysed with other OpenCB projects using Spark
  • New VariantScore object added to Variant data model, this will allow to store variant scores from cohort-related analysis such as association or Hardy-Weinberg tests in the next release
  • Implement some HBase physical schema improvements and a better integration with Solr
  • Support Amazon EMR Hadoop cluster
  • Performance improvements when querying variants from samples, this will have a big impact in clinical interpretation analysis

Alignment Storage

  • Major improvements in BAM query engine. New server-side filters added, this is an efficient implementation size the data converted and sent through the network is reduced. The available filters now are: region, minMapQ, maxNumberMismatches, maxNumberHits,  properlyPaired,  maxInsertSize,  unmmapped and duplicated.
  • New coverage calculator using BigWig. Now coverage is calculated and stored in BigWig format, the windowSize is configurable. Also, coverage can now be queried for a region and windowSize, the server aggregates and averages in windowSizes.
  • New REST and gRPC APIs implemented for the new query filters and coverage functionality. When using REST a JSON string using GA4GH data model is returned. When gRPC is used a binary stream is obtained, notice this is also filter in the server.

Unscheduled features

The following features have been accepted but no release version has been assigned:

  • Add test for the CLI
  • Support Slurm

You can find detailed information for some of them at https://github.com/opencb/opencga/milestone/10

Table of Contents:


  • No labels