首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 421 毫秒
1.
Haplotype tagging for the identification of common disease genes   总被引:61,自引:0,他引:61  
Genome-wide linkage disequilibrium (LD) mapping of common disease genes could be more powerful than linkage analysis if the appropriate density of polymorphic markers were known and if the genotyping effort and cost of producing such an LD map could be reduced. Although different metrics that measure the extent of LD have been evaluated, even the most recent studies have not placed significant emphasis on the most informative and cost-effective method of LD mapping-that based on haplotypes. We have scanned 135 kb of DNA from nine genes, genotyped 122 single-nucleotide polymorphisms (SNPs; approximately 184,000 genotypes) and determined the common haplotypes in a minimum of 384 European individuals for each gene. Here we show how knowledge of the common haplotypes and the SNPs that tag them can be used to (i) explain the often complex patterns of LD between adjacent markers, (ii) reduce genotyping significantly (in this case from 122 to 34 SNPs), (iii) scan the common variation of a gene sensitively and comprehensively and (iv) provide key fine-mapping data within regions of strong LD. Our results also indicate that, at least for the genes studied here, the current version of dbSNP would have been of limited utility for LD mapping because many common haplotypes could not be defined. A directed re-sequencing effort of the approximately 10% of the genome in or near genes in the major ethnic groups would aid the systematic evaluation of the common variant model of common disease.  相似文献   

2.
Humans show great variation in phenotypic traits such as height, eye color and susceptibility to disease. Genomic DNA sequence differences among individuals are responsible for the inherited components of these complex traits. Reports suggest that intermediate and large-scale DNA copy number and structural variations are prevalent enough to be an important source of genetic variation between individuals. Because association studies to identify genomic loci associated with particular phenotypic traits have focused primarily on genotyping SNPs, it is important to determine whether common structural polymorphisms are in linkage disequilibrium with common SNPs, and thus can be assessed indirectly in SNP-based studies. Here we examine 100 deletion polymorphisms ranging from 70 bp to 7 kb. We show that common deletions and SNPs ascertained with similar criteria have essentially the same distribution of linkage disequilibrium with surrounding SNPs, indicating that these polymorphisms may share evolutionary history and that most deletion polymorphisms are effectively assayed by proxy in SNP-based association studies.  相似文献   

3.
Genetic mapping with SNP markers in Drosophila.   总被引:10,自引:0,他引:10  
Map-based positional cloning of Drosophila melanogaster genes is hampered by both the time-consuming, error-prone nature of traditional methods for genetic mapping and the difficulties in aligning the genetic and cytological maps with the genome sequence. The identification of sequence polymorphisms in the Drosophila genome will make it possible to map mutations directly to the genome sequence with high accuracy and resolution. Here we report the identification of 7,223 single-nucleotide polymorphisms (SNPs) and 1,392 insertions/deletions (InDels) in common laboratory strains of Drosophila. These sequence polymorphisms define a map of 787 autosomal marker loci with a resolution of 114 kb. We have established PCR product-length polymorphism (PLP) or restriction fragment-length polymorphism (RFLP) assays for 215 of these markers. We demonstrate the use of this map by delimiting two mutations to intervals of 169 kb and 307 kb, respectively. Using a local high-density SNP map, we also mapped a third mutation to a resolution of approximately 2 kb, sufficient to localize the mutation within a single gene. These methods should accelerate the rate of positional cloning in Drosophila.  相似文献   

4.
Genome-wide mapping with biallelic markers in Arabidopsis thaliana.   总被引:17,自引:0,他引:17  
Single-nucleotide polymorphisms, as well as small insertions and deletions (here referred to collectively as simple nucleotide polymorphisms, or SNPs), comprise the largest set of sequence variants in most organisms. Positional cloning based on SNPs may accelerate the identification of human disease traits and a range of biologically informative mutations. The recent application of high-density oligonucleotide arrays to allele identification has made it feasible to genotype thousands of biallelic SNPs in a single experiment. It has yet to be established, however, whether SNP detection using oligonucleotide arrays can be used to accelerate the mapping of traits in diploid genomes. The cruciferous weed Arabidopsis thaliana is an attractive model system for the construction and use of biallelic SNP maps. Although important biological processes ranging from fertilization and cell fate determination to disease resistance have been modelled in A. thaliana, identifying mutations in this organism has been impeded by the lack of a high-density genetic map consisting of easily genotyped DNA markers. We report here the construction of a biallelic genetic map in A. thaliana with a resolution of 3.5 cM and its use in mapping Eds16, a gene involved in the defence response to the fungal pathogen Erysiphe orontii. Mapping of this trait involved the high-throughput generation of meiotic maps of F2 individuals using high-density oligonucleotide probe array-based genotyping. We developed a software package called InterMap and used it to automatically delimit Eds16 to a 7-cM interval on chromosome 1. These results are the first demonstration of biallelic mapping in diploid genomes and establish means for generalizing SNP-based maps to virtually any genetic organism.  相似文献   

5.
Complex SNP-related sequence variation in segmental genome duplications   总被引:23,自引:0,他引:23  
There is uncertainty about the true nature of predicted single-nucleotide polymorphisms (SNPs) in segmental duplications (duplicons) and whether these markers genuinely exist at increased density as indicated in public databases. We explored these issues by genotyping 157 predicted SNPs in duplicons and control regions in normal diploid genomes and fully homozygous complete hydatidiform moles. Our data identified many true SNPs in duplicon regions and few paralogous sequence variants. Twenty-eight percent of the polymorphic duplicon sequences we tested involved multisite variation, a new type of polymorphism representing the sum of the signals from many individual duplicon copies that vary in sequence content due to duplication, deletion or gene conversion. Multisite variations can masquerade as normal SNPs when genotyped. Given that duplicons comprise at least 5% of the genome and many are yet to be annotated in the genome draft, effective strategies to identify multisite variation must be established and deployed.  相似文献   

6.
High-resolution genetic analysis of the human genome promises to provide insight into common disease susceptibility. To perform such analysis will require a collection of high-throughput, high-density analysis reagents. We have developed a polymorphism detection system that uses public-domain sequence data. This detection system is called the single nucleotide polymorphism pipeline (SNPpipeline). The analytic core of the SNPpipeline is composed of three components: PHRED, PHRAP and DEMIGLACE. PHRED and PHRAP are components of a sequence analysis suite developed to perform the semi-automated analysis required for large-scale genomes (provided courtesy of P. Green). Using these informatics tools, which examine redundant raw expressed sequence tag (EST) data, we have identified more than 3,000 candidate single-nucleotide polymorphisms (SNPs). Empiric validation studies of a set of 192 candidates indicate that 82% identify variation in a sample of ten Centre d'Etudes Polymorphism Humain (CEPH) individuals. Our results suggest that existing sequence resources may serve as a valuable source for identifying genetic variation.  相似文献   

7.
Most human sequence variation is in the form of single-nucleotide polymorphisms (SNPs). It has been proposed that coding-region SNPs (cSNPs) be used for direct association studies to determine the genetic basis of complex traits. The success of such studies depends on the frequency of disease-associated alleles, and their distribution in different ethnic populations. If disease-associated alleles are frequent in most populations, then direct genotyping of candidate variants could show robust associations in manageable study samples. This approach is less feasible if the genetic risk from a given candidate gene is due to many infrequent alleles. Previous studies of several genes demonstrated that most variants are relatively infrequent (<0.05). These surveys genotyped small samples (n<75) and thus had limited ability to identify rare alleles. Here we evaluate the prevalence and distribution of such rare alleles by genotyping an ethnically diverse reference sample that is more than six times larger than those used in previous studies (n=450). We screened for variants in the complete coding sequence and intron-exon junctions of two candidate genes for neuropsychiatric phenotypes: SLC6A4, encoding the serotonin transporter; and SLC18A2, encoding the vesicular monoamine transporter. Both genes have unique roles in neuronal transmission, and variants in either gene might be associated with neurobehavioral phenotypes.  相似文献   

8.
Single nucleotide polymorphisms (SNPs) are valuable genetic markers of human disease. They also comprise the highest potential density marker set available for mapping experimentally derived mutations in model organisms such as Caenorhabditis elegans. To facilitate the positional cloning of mutations we have identified polymorphisms in CB4856, an isolate from a Hawaiian island that shows a uniformly high density of polymorphisms compared with the reference Bristol N2 strain. Based on 5.4 Mbp of aligned sequences, we predicted 6,222 polymorphisms. Furthermore, 3,457 of these markers modify restriction enzyme recognition sites ('snip-SNPs') and are therefore easily detected as RFLPs. Of these, 493 were experimentally confirmed by restriction digest to produce a snip-SNP map of the worm genome. A mapping strategy using snip-SNPs and bulked segregant analysis (BSA) is outlined. CB4856 is crossed into a mutant strain, and exclusion of CB4856 alleles of a subset of snip-SNPs in mutant progeny is assessed with BSA. The proximity of a linked marker to the mutation is estimated by the relative proportion of each form of the biallelic marker in populations of wildtype and mutant genomes. The usefulness of this approach is illustrated by the rapid mapping of the dyf-5 gene.  相似文献   

9.
Genome-wide patterns of genetic variation among elite maize inbred lines   总被引:6,自引:0,他引:6  
Lai J  Li R  Xu X  Jin W  Xu M  Zhao H  Xiang Z  Song W  Ying K  Zhang M  Jiao Y  Ni P  Zhang J  Li D  Guo X  Ye K  Jian M  Wang B  Zheng H  Liang H  Zhang X  Wang S  Chen S  Li J  Fu Y  Springer NM  Yang H  Wang J  Dai J  Schnable PS  Wang J 《Nature genetics》2010,42(11):1027-1030
We have resequenced a group of six elite maize inbred lines, including the parents of the most productive commercial hybrid in China. This effort uncovered more than 1,000,000 SNPs, 30,000 indel polymorphisms and 101 low-sequence-diversity chromosomal intervals in the maize genome. We also identified several hundred complete genes that show presence/absence variation among these resequenced lines. We discuss the potential roles of complementation of presence/absence variations and other deleterious mutations in contributing to heterosis. High-density SNP and indel polymorphism markers reported here are expected to be a valuable resource for future genetic studies and the molecular breeding of this important crop.  相似文献   

10.
Radiation hybrid map of the mouse genome.   总被引:13,自引:0,他引:13  
Radiation hybrid (RH) maps are a useful tool for genome analysis, providing a direct method for localizing genes and anchoring physical maps and genomic sequence along chromosomes. The construction of a comprehensive RH map for the human genome has resulted in gene maps reflecting the location of more than 30,000 human genes. Here we report the first comprehensive RH map of the mouse genome. The map contains 2,486 loci screened against an RH panel of 93 cell lines. Most loci (93%) are simple sequence length polymorphisms (SSLPs) taken from the mouse genetic map, thereby providing direct integration between these two key maps. We performed RH mapping by a new and efficient approach in which we replaced traditional gel- or hybridization-based assays by a homogeneous 5'-nuclease assays involving a single common probe for all genetic markers. The map provides essentially complete connectivity and coverage across the genome, and good resolution for ordering loci, with 1 centiRay (cR) corresponding to an average of approximately 100 kb. The RH map, together with an accompanying World-Wide Web server, makes it possible for any investigator to rapidly localize sequences in the mouse genome. Together with the previously constructed genetic map and a YAC-based physical map reported in a companion paper, the fundamental maps required for mouse genomics are now available.  相似文献   

11.
Dissecting the genetic basis of disease risk requires measuring all forms of genetic variation, including SNPs and copy number variants (CNVs), and is enabled by accurate maps of their locations, frequencies and population-genetic properties. We designed a hybrid genotyping array (Affymetrix SNP 6.0) to simultaneously measure 906,600 SNPs and copy number at 1.8 million genomic locations. By characterizing 270 HapMap samples, we developed a map of human CNV (at 2-kb breakpoint resolution) informed by integer genotypes for 1,320 copy number polymorphisms (CNPs) that segregate at an allele frequency >1%. More than 80% of the sequence in previously reported CNV regions fell outside our estimated CNV boundaries, indicating that large (>100 kb) CNVs affect much less of the genome than initially reported. Approximately 80% of observed copy number differences between pairs of individuals were due to common CNPs with an allele frequency >5%, and more than 99% derived from inheritance rather than new mutation. Most common, diallelic CNPs were in strong linkage disequilibrium with SNPs, and most low-frequency CNVs segregated on specific SNP haplotypes.  相似文献   

12.
Recombination and linkage disequilibrium in Arabidopsis thaliana   总被引:4,自引:0,他引:4  
Linkage disequilibrium (LD) is a major aspect of the organization of genetic variation in natural populations. Here we describe the genome-wide pattern of LD in a sample of 19 Arabidopsis thaliana accessions using 341,602 non-singleton SNPs. LD decays within 10 kb on average, considerably faster than previously estimated. Tag SNP selection algorithms and 'hide-the-SNP' simulations suggest that genome-wide association mapping will require only 40%-50% of the observed SNPs, a reduction similar to estimates in a sample of African Americans. An Affymetrix genotyping array containing 250,000 SNPs has been designed based on these results; we demonstrate that it should have more than adequate coverage for genome-wide association mapping. The extent of LD is highly variable, and we find clear evidence of recombination hotspots, which seem to occur preferentially in intergenic regions. LD also reflects the action of selection, and it is more extensive between nonsynonymous polymorphisms than between synonymous polymorphisms.  相似文献   

13.
Genealogies of mouse inbred strains   总被引:1,自引:0,他引:1  
The mouse is a prime organism of choice for modelling human disease. Over 450 inbred strains of mice have been described, providing a wealth of different genotypes and phenotypes for genetic and other studies. As new strains are generated and others become extinct, it is useful to review periodically what strains are available and how they are related to each other, particularly in the light of available DNA polymorphism data from microsatellite and other markers. We describe the origins and relationships of inbred mouse strains, 90 years after the generation of the first inbred strain. Given the large collection of inbred strains available, and that published information on these strains is incomplete, we propose that all genealogical and genetic data on inbred strains be submitted to a common electronic database to ensure this valuable information resource is preserved and used efficiently.  相似文献   

14.
High-resolution haplotype structure in the human genome   总被引:41,自引:0,他引:41  
Linkage disequilibrium (LD) analysis is traditionally based on individual genetic markers and often yields an erratic, non-monotonic picture, because the power to detect allelic associations depends on specific properties of each marker, such as frequency and population history. Ideally, LD analysis should be based directly on the underlying haplotype structure of the human genome, but this structure has remained poorly understood. Here we report a high-resolution analysis of the haplotype structure across 500 kilobases on chromosome 5q31 using 103 single-nucleotide polymorphisms (SNPs) in a European-derived population. The results show a picture of discrete haplotype blocks (of tens to hundreds of kilobases), each with limited diversity punctuated by apparent sites of recombination. In addition, we develop an analytical model for LD mapping based on such haplotype blocks. If our observed structure is general (and published data suggest that it may be), it offers a coherent framework for creating a haplotype map of the human genome.  相似文献   

15.
We have used large-scale insertional mutagenesis to identify functional landmarks relevant to cancer in the recently completed mouse genome sequence. We infected Cdkn2a(-/-) mice with Moloney murine leukemia virus (MoMuLV) to screen for loci that can participate in tumorigenesis in collaboration with loss of the Cdkn2a-encoded tumor suppressors p16INK4a and p19ARF. Insertional mutagenesis by the latent retrovirus was synergistic with loss of Cdkn2a expression, as indicated by a marked acceleration in the development of both myeloid and lymphoid tumors. We isolated 747 unique sequences flanking retroviral integration sites and mapped them against the mouse genome sequence databases from Celera and Ensembl. In addition to 17 insertions targeting gene loci known to be cancer-related, we identified a total of 37 new common insertion sites (CISs), of which 8 encode components of signaling pathways that are involved in cancer. The effectiveness of large-scale insertional mutagenesis in a sensitized genetic background is demonstrated by the preference for activation of MAP kinase signaling, collaborating with Cdkn2a loss in generating the lymphoid and myeloid tumors. Collectively, our results show that large-scale retroviral insertional mutagenesis in genetically predisposed mice is useful both as a system for identifying genes underlying cancer and as a genetic framework for the assignment of such genes to specific oncogenic pathways.  相似文献   

16.
Emerging technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously, enabling whole-genome association studies. Using empirical genotype data from the International HapMap Project, we evaluate the extent to which the sets of SNPs contained on three whole-genome genotyping arrays capture common SNPs across the genome, and we find that the majority of common SNPs are well captured by these products either directly or through linkage disequilibrium. We explore analytical strategies that use HapMap data to improve power of association studies conducted with these fixed sets of markers and show that limited inclusion of specific haplotype tests in association analysis can increase the fraction of common variants captured by 25-100%. Finally, we introduce a Bayesian approach to association analysis by weighting the likelihood of each statistical test to reflect the number of putative causal alleles to which it is correlated.  相似文献   

17.
Linkage disequilibrium (LD) mapping provides a powerful method for fine-structure localization of rare disease genes, but has not yet been widely applied to common disease. We sought to design a systematic approach for LD mapping and apply it to the localization of a gene (IBD5) conferring susceptibility to Crohn disease. The key issues are: (i) to detect a significant LD signal (ii) to rigorously bound the critical region and (iii) to identify the causal genetic variant within this region. We previously mapped the IBD5 locus to a large region spanning 18 cM of chromosome 5q31 (P<10(-4)). Using dense genetic maps of microsatellite markers and single-nucleotide polymorphisms (SNPs) across the entire region, we found strong evidence of LD. We bound the region to a common haplotype spanning 250 kb that shows strong association with the disease (P< 2 x 10(-7)) and contains the cytokine gene cluster. This finding provides overwhelming evidence that a specific common haplotype of the cytokine region in 5q31 confers susceptibility to Crohn disease. However, genetic evidence alone is not sufficient to identify the causal mutation within this region, as strong LD across the region results in multiple SNPs having equivalent genetic evidence-each consistent with the expected properties of the IBD5 locus. These results have important implications for Crohn disease in particular and LD mapping in general.  相似文献   

18.
Engineering a mouse balancer chromosome.   总被引:15,自引:0,他引:15  
Balancer chromosomes are genetic reagents that are used in Drosophila melanogaster for stock maintenance and mutagenesis screens. Despite their utility, balancer chromosomes are rarely used in mice because they are difficult to generate using conventional methods. Here we describe the engineering of a mouse balancer chromosome with the Cre-loxP recombination system. The chromosome features a 24-centiMorgan (cM) inversion between Trp53 (also known as p53) and Wnt3 on mouse chromosome 11 that is recessive lethal and dominantly marked with a K14-Agouti transgene. When allelic to a wild-type chromosome, the inversion suppresses crossing over in the inversion interval, accompanied by elevated recombination in the flanking regions. The inversion functions as a balancer chromosome because it can be used to maintain a lethal mutation in the inversion interval as a self-sustaining trans-heterozygous stock. This strategy can be used to generate similar genetic reagents throughout the mouse genome. Engineering of visibly marked inversions and deficiencies is an important step toward functional analyses of the mouse genome and will facilitate large-scale mutagenesis programs.  相似文献   

19.
Genome-wide association studies involving hundreds of thousands of SNPs in thousands of cases and controls are now underway. The first of many analytical challenges in these studies involves the choice of SNPs to genotype. It is not practical to construct a different panel of tag SNPs for each study, so the first generation of genome-wide scans will use predefined, commercially available marker panels, which will in part dictate their success or failure. We compare different approaches in use today, and show that although many of them provide substantial coverage of common variation in non-African populations, the precise extent is strongly dependent on the frequencies of alleles of interest and on specific considerations of study design. Overall, despite substantial differences in genotyping technologies, marker selection strategies and number of markers assayed, the first-generation high-throughput platforms all offer similar levels of genome coverage.  相似文献   

20.
A radiation hybrid map of the zebrafish genome.   总被引:12,自引:0,他引:12  
Recent large-scale mutagenesis screens have made the zebrafish the first vertebrate organism to allow a forward genetic approach to the discovery of developmental control genes. Mutations can be cloned positionally, or placed on a simple sequence length polymorphism (SSLP) map to match them with mapped candidate genes and expressed sequence tags (ESTs). To facilitate the mapping of candidate genes and to increase the density of markers available for positional cloning, we have created a radiation hybrid (RH) map of the zebrafish genome. This technique is based on somatic cell hybrid lines produced by fusion of lethally irradiated cells of the species of interest with a rodent cell line. Random fragments of the donor chromosomes are integrated into recipient chromosomes or retained as separate minichromosomes. The radiation-induced breakpoints can be used for mapping in a manner analogous to genetic mapping, but at higher resolution and without a need for polymorphism. Genome-wide maps exist for the human, based on three RH panels of different resolutions, as well as for the dog, rat and mouse. For our map of the zebrafish genome, we used an existing RH panel and 1,451 sequence tagged site (STS) markers, including SSLPs, cloned candidate genes and ESTs. Of these, 1,275 (87.9%) have significant linkage to at least one other marker. The fraction of ESTs with significant linkage, which can be used as an estimate of map coverage, is 81.9%. We found the average marker retention frequency to be 18.4%. One cR3000 is equivalent to 61 kb, resulting in a potential resolution of approximately 350 kb.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号