Skip to main content

Table 3 Synthetic paired-end Illumina sequencing datasets simulated using ART

From: A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis

Organism (dataset ID)

Accession number of reference genome assembly

ART simulation parameter

Genome size (MB)

Read length (bp)

Genome coverage

Fragment/insert size

Error rate (%)

Escherichia coli (EC-1)

GCF_000005845.2 (ASM584v2)

36

70×

200

0.866

4.6

Escherichia coli (EC-2)

GCF_000005845.2 (ASM584v2)

36

20×

200

0.866

4.6

Escherichia coli (EC-3)

GCF_000005845.2 (ASM584v2)

100

20×

200

0.952

4.6

Bacillus cereus (BC-1)

GCF_000007825.1 (ASM782v1)

56

50×

200

0.175

5.4

Bacillus cereus (BC-2)

GCF_000007825.1 (ASM782v1)

100

120×

300

0.109

5.4

Drosophila melanogaster (DM)

GCF_000001215.4 (Release 6)

100

10×

300

0.854

143