Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.



Platform

Components version

ComponentVersion
HDI3.6
HDP2.6.5.3008-11
Hadoop2.7.3.2.6.5.3008-11
HBase1.1.2.2.6.5.3008-11

Storage

Azure Data Lake Storage Gen2

Nodes


NodesCoresMemory(GB)Type
Hadoop Master3428Standard_D12_V2
Hadoop Worker10856Standard_DS13_V2
Loading Queue20416Standard_D4s_v3

Genomic Data 


Loading Performance


Code Block
collapsetrue

#Files   Day Hour
10 2019-07-10 19
4 2019-07-10 20
15 2019-07-10 21
30 2019-07-10 22
26 2019-07-10 23
28 2019-07-11 00
25 2019-07-11 01
23 2019-07-11 02
25 2019-07-11 03
28 2019-07-11 04
30 2019-07-11 05
30 2019-07-11 06
31 2019-07-11 07
30 2019-07-11 08
28 2019-07-11 09
28 2019-07-11 10
30 2019-07-11 11
32 2019-07-11 12
27 2019-07-11 13
27 2019-07-11 14
45 2019-07-11 15
14 2019-07-11 16
29 2019-07-11 17
35 2019-07-11 18
39 2019-07-11 19
11 2019-07-11 20


Operations

First batch of 700 files

74.096.015 variants

Aggregate

Prepare:    529.303s [ 00:08:49 ]

Aggregate: 9591.626s [ 02:39:52 ]

Write:     7012.733s [ 01:56:53 ]  -> Size : 59.5 GiB

Stats

1352.675s [ 00:22:33 ]

Annotate

Prepare:       722.327s [ 00:12:02 ]

Annot:       50384.383s [ 13:59:44 ]

Load:       28204.666s [ 07:44:04 ]

SampleIndex:  12403.542s [ 03:26:44 ]


Analysis Benchmark

Query and Aggregation Stats


Stats


GWAS


Clinical Analysis




Table of Contents:

Table of Contents
indent20px