The first step to creating a CellBase instance is to download the data files. Download can be done through the CellBase CLI.
$ cellbase/build/bin$ ./cellbase.sh download --data genome,gene
The --data argument is required and is a comma separated list of data types to download. See below for the full list.
Type | Data sources |
---|---|
genome |
|
gene |
|
variation ** |
|
variation_functional_score |
|
regulation |
|
protein |
|
conservation ** |
|
clinical_variants ** |
|
repeats |
|
svs |
|
all ** | Downloads all of the above |
See Download Sources for details on versions and available organisms.
** Please note that many files are very large and can take several hours to download.
For example, to download all human (GRCh37) data from all sources and save it into the `/tmp/data/cellbase/v4/` directory, run:
cellbase/build/bin$ ./cellbase-admin.sh download -a GRCh37 --common /tmp/data/cellbase/v4/common/ -d all -o /tmp/data/cellbase/v4/ -s hsapiens
If download was successful, you can proceed to building the json objects that should be loaded into the corresponding database: Building the CellBase database