Skip to main content

Table 1 The overlap-graph reduction results with the metagenomics dataset

From: Using Apache Spark on genome assembly for scalable overlap-graph reduction

Algorithm

Size

#EDGE (before)

#EDGE (after)

TIME

TER

Quarter

217,002,504

12,482,946

0.57

 

Half

434,005,009

23,818,401

0.80

 

Full

868,010,019

57,363,515

1.37

DER-CEC

Quarter

12,482,946

469,130

0.13

 

Half

23,818,401

763,474

0.23

 

Full

57,363,515

2,341,610

0.40

  1. #EDGE denotes the number of edges of the graph and TIME the running time (hours) for the computation