The program BIMBAM implements methods for assocation mapping, based on those described in
Servin, B and Stephens, M (2007). Imputation-based analysis of association studies: candidate genes and quantitative traits. PLoS Genetics, 2007.
BIMBAM can handle both large association studies (e.g., genome scans) and smaller studies of candidate genes/regions.
The software is distributed under the GNU Public License (GPL). To register and download, click here. Instructions are available here
The program fastPHASE implements methods described in
Scheet, P and Stephens, M (2006). A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet
fastPHASE can handle larger data-sets than PHASE (e.g., hundreds of thousands of markers in thousands of individuals), but does not provide estimates of recombination rates. Our experiments suggest that haplotype estimates are slightly less accurate than from PHASE, but missing genotype estimates appear to be similar or even slightly better than PHASE.
The software is free for non-commercial use, and may be licensed for commercial use. To view the terms and conditions, and then proceed to download, click here.
The program PHASE implements methods for estimating haplotypes from population genotype data described in
Stephens, M., and Donnelly, P. (2003). A comparison of Bayesian methods for haplotype reconstruction from population genotype data. American Journal of Human Genetics, 73:1162-1169.
Stephens, M., Smith, N., and Donnelly, P. (2001). A new statistical method for haplotype reconstruction from population data. American Journal of Human Genetics, 68, 978--989.
Stephens, M., and Scheet, P. (2005). Accounting for Decay of Linkage Disequilibrium in Haplotype Inference and Missing-Data Imputation. American Journal of Human Genetics, 76:449-462.
The software also incorporates methods for estimating recombination rates, and identifying recombination hotspots:
Crawford et al (2004). Evidence for substantial fine-scale variation in recombination rates across the human genome. Nature Genetics,.
The software is free for non-commercial use, and may be licensed for commercial use. To view the terms and conditions, and then proceed to download, click here.
Instructions for PHASE are included on the download site, or are also available here.
The program SCAT (Smoothed and Continuous AssignmenTs) implements a Bayesian statistical method for estimating allele frequencies and assigning samples of unknown (or known) origin across a continuous range of locations, based on genotypes collected at distinct sampling locations. In brief, the idea is to assume that allele frequencies vary smoothly in the study region, so allele frequencies are estimated at any given location using observed genotypes at near-by sampling locations, with data at the nearest sampling locations being given greatest weight. Details are given in
S K Wasser, A M Shedlock, K Comstock, E A Ostrander, B Mutayoba, and M Stephens. Assigning African elephant DNA to geographic region of origin: applications to the ivory trade. Proc Natl Acad Sci U S A, 41:14844-14852, 2004.
SCAT is available here.
N Li and M Stephens. Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics, 165(4)2213-2233, 2003.
It is available free from here.