Skip to main content


Table 4 Description of programs designed for pooled samples.

From: A comprehensive literature review of haplotyping software and methods for use with unrelated individuals

Program name Algorithm Outputa Missing datab Assumptionsc Key features Limitations Pool Size, MAX #Loci, Type Platform Ref.d
Pools2 Clark's/EM HF/HA N/A None Haplotype-tagging SNPs Computationally slow Pools of 2 individuals, practical limit, biallelic PC [117]
      Accommodates a large number of SNPs Need to re-calculate several times to assure consistent results    
       EM issues    
LDPooled EM HF/HA No HWE Calculates LD LD impacts performance Based on pools of 4 individuals, practical limit, biallelic * [96]
      SNPs or microsatellites EM issues    
EHP.R EM HF Yes HWE Tests haplotype-disease association Variance increases with pool size, weaker LD and # loci Pools of 4 individuals, practical limit, biallelic PC/UNIX [98]
      Assessment of haplotype frequency estimate accuracy EM issues    
      Handles different types of missing data Requires knowledge of S-Plus 6.0 or R    
  1. a Program haplotype output, individual assignment, frequency estimates or both.
  2. b Ability of program to accept missing data.
  3. c Program assumptions.
  4. d List of references.
  5. *Could not determine from available data.
  6. EM: Expectation maximisation algorithm; EM issues: May be sensitive to HWE departures, long run times, and non-global max (requiring multiple restarts); HF: Haplotype frequency estimate; HA: Individual haplotype assignment; HWE: Hardy-Weinberg equilibrium; LD: Linkage disequilibrium; PC: IBM compatible personal computer; UNIX: Runs on Unix operating system, including Linux, FORTRAN, Solaris and others.