Worldwide patterns of haplotype diversity at 9p21.3, a locus associated with type 2 diabetes and coronary heart disease
1 Institute of Molecular Medicine FIMM, University of Helsinki, and Unit of Public Health Genomics, National Institute for Health and Welfare, Tukholmankatu 8, 00290 Helsinki, Finland
2 Department of Genetics, Stanford University School of Medicine, 300 Pasteur Drive, Stanford, CA 94305, USA
3 Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853-2703, USA
4 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
5 MRC CAiTE Centre, University of Bristol, Oakfield Grove, Clifton, Bristol BS8 2BN, UK
Genome Medicine 2009, 1:51 doi:10.1186/gm51Published: 12 May 2009
A 100 kb region on 9p21.3 harbors two major disease susceptibility loci: one for type 2 diabetes (T2D) and one for coronary heart disease (CHD). The single nucleotide polymorphisms (SNPs) associated with these two diseases in Europeans reside on two adjacent haplotype blocks with independent effects on disease. To help delimit the regions that likely harbor the disease-causing variants in populations of non-European origin, we studied the haplotype diversity and allelic history of the 9p21.3 region using 938 unrelated individuals from 51 populations (Human Genome Diversity Panel). We used SNP data from Illumina's 650Y SNP arrays supplemented with five additional SNPs within the region of interest. Haplotype frequencies were analyzed with the EM algorithm implemented in PLINK. For the T2D locus, the TT risk haplotype of SNPs rs10811661 and rs10757283 was present at similar frequencies in all global populations, while a shared 6-SNP haplotype that carries the protective C allele of rs10811661 was found at a frequency of 2.9% in Africans and 41.3% in East Asians and was associated with low haplotype diversity. For the CHD locus, all populations shared a core risk haplotype spanning >17.5 kb, which shows dramatic increase in frequency between African (11.5%) and Middle Eastern (63.7%) populations. Interestingly, two SNPs (rs2891168 and rs10757278) tagging this CHD risk haplotype are most strongly associated with CHD disease status according to independent clinical fine-mapping studies. The large variation in linkage disequilibrium patterns identified between the populations demonstrates the importance of allelic background data when selecting SNPs for replication in global populations. Intriguingly, the protective allele for T2D and the risk allele for CHD show an increase in frequency in non-Africans compared to Africans, implying different population histories for these two adjacent disease loci.