...
...
- Different NoSQL databases for storage. Users can choose which database fits best current infrastructure and data size
- Apache Hadoop for big data processing and storage
- High-performance Computing (HPC) for computation-intensive analysis
- HTML5 and RESTful web services for information retrieval and data visualization
...
Platform Overview
The image below shows a global view of the infrastructure used by OpenCGA. When a file is uploaded to the system, it is stored in:
- A filesystem for archiving purposes. This filesystem could be UNIX or Hadoop-based
- A database for interactive queries. We plan to support MongoDB and HBase databases
...
Technical Documentation Overview
At this section you can find some useful links and information for researchers and software developers who are planning to deploy and/or integrate OpenCGA services with their software applications and tools. These are working documents:
- Data models : Describes data models for representing Variant and alignment data
- Architecture : Describes the technologies and architecture of OpenCGA and some other implementation details
- Storage implementation : Describes how the data models are mapped to the different database backends (Mongo and HBase)
- Releases and Roadmap : Do you want to know what's coming next?
- Download and install : Please have a look at the [README file](https://github.com/opencb/opencga/blob/develop/README.md) in the repository
Getting Involved
...