首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
CpG islands, genes and isochores in the genomes of vertebrates   总被引:6,自引:0,他引:6  
B A?ssani  G Bernardi 《Gene》1991,106(2):185-195
We have shown that human genes associated with CpG islands increase in number as they increase in % of guanine + cytosine (GC) levels, and that most genes associated with CpG islands are located in the GC-richest compartment of the human genome. This is an independent confirmation of the concentration gradient of CpG islands (detected as HpaII tiny fragments, or HTF) which was demonstrated in the genome of warm-blooded vertebrates [A?ssani and Bernardi, Gene 106 (1991) 173-183]. We then reassessed the location of CpG islands using the data currently available and confirmed that CpG islands are most frequently located in the 5'-flanking sequences of genes and that they overlap genes to variable extents. We have shown that such extents increase with the increasing GC levels of genes, the GC-richest genes being completely included in CpG islands. Under such circumstances, we have investigated the properties of the 'extragenic' CpG islands located in the 5'-flanking segments of homologous genes from both warm- and cold-blooded vertebrates. We have confirmed that, in cold-blooded vertebrates, CpG islands are often absent; when present, they have lower GC and CpG levels; the latter attain, however, statistically expected values. Finally, we have shown that CpG doublets increase with the increasing GC of exons, introns and intergenic sequences (including 'extragenic' CpG islands) in the genomes from both warm- and cold-blooded vertebrates. The correlations found are the same for both classes of vertebrates, and are similar for exons, introns and intergenic sequences (including 'extragenic' CpG islands). The findings just outlined indicate that the origin and evolution of CpG islands in the vertebrate genome are associated with compositional transitions (GC increases) in genes and isochores.  相似文献   

2.
CpG islands in vertebrate genomes   总被引:120,自引:0,他引:120  
  相似文献   

3.
Subirana JA  Anokian E 《Gene》2011,473(2):76-81
A very simple new program is presented (G-SQUARES). It is useful in order to visualize the composition and basic structural features of whole genomes and selected chromosome regions. The frequency of all dimer and tetramer sequences is reported. Overall structural features are calculated, such as the tendency for alternation. A direct visual comparison among different sequences is easily available. Furthermore, the features which are visualized indicate further studies which should be carried out. Examples are presented on Alu sequences, CpG islands, whole eukaryotic and bacterial genomes.  相似文献   

4.
CpG islands (CGIs) are CpG-rich regions compared to CpG-depleted bulk DNA of mammalian genomes and are generally regarded as the epigenetic regulatory regions in association with unmethylation, promoter activity and histone modifications. Accurate identification of CpG islands with epigenetic regulatory function in bulk genomes is of wide interest. Here, the common features of functional CGIs are identified using an average mutual information method to differentiate functional CGIs from the remaining CGIs. A new approach (CpG mutual information, CpG_MI) was further explored to identify functional CGIs based on the cumulative mutual information of physical distances between two neighboring CpGs. Compared to current approaches, CpG_MI achieved the highest prediction accuracy. This approach also identified new functional CGIs overlapping with gene promoter regions which were missed by other algorithms. Nearly all CGIs identified by CpG_MI overlapped with histone modification marks. CpG_MI could also be used to identify potential functional CGIs in other mammalian genomes, as the CpG dinucleotide contents and cumulative mutual information distributions are almost the same among six mammalian genomes in our analysis. It is a reliable quantitative tool for the identification of functional CGIs from bulk genomes and helps in understanding the relationships between genomic functional elements and epigenomic modifications.  相似文献   

5.
CpG islands (CGIs) are often considered as gene markers, but the number of CGIs varies among mammalian genomes that have similar numbers of genes. In this study, we investigated the distribution of CGIs in the promoter regions of 3,197 human-mouse orthologous gene pairs and found that the mouse genome has notably fewer CGIs in the promoter regions and less pronounced CGI characteristics than does the human genome. We further inferred CGI's ancestral state using the dog genome as a reference and examined the nucleotide substitution pattern and the mutational direction in the conserved regions of human and mouse CGIs. The results reveal many losses of CGIs in both genomes but the loss rate in the mouse lineage is two to four times the rate in the human lineage. We found an intriguing feature of CGI loss, namely that the loss of a CGI usually starts from erosion at the both edges and gradually moves towards the center. We found functional bias in the genes that have lost promoter-associated CGIs in the human or mouse lineage. Finally, our analysis indicates that the association of CGIs with housekeeping genes is not as strong as previously estimated. Our study provides a detailed view of the evolution of promoter-associated CGIs in the human and mouse genomes and our findings are helpful for understanding the evolution of mammalian genomes and the role of CGIs in gene function.  相似文献   

6.
Epigenomics: beyond CpG islands   总被引:11,自引:0,他引:11  
  相似文献   

7.
Han L  Su B  Li WH  Zhao Z 《Genome biology》2008,9(5):R79

Background  

CpG islands, which are clusters of CpG dinucleotides in GC-rich regions, are considered gene markers and represent an important feature of mammalian genomes. Previous studies of CpG islands have largely been on specific loci or within one genome. To date, there seems to be no comparative analysis of CpG islands and their density at the DNA sequence level among mammalian genomes and of their correlations with other genome features.  相似文献   

8.
DNA methylation of CpG islands plays a crucial role in the regulation of gene expression. More than half of all human promoters contain CpG islands with a tissue-specific methylation pattern in differentiated cells. Still today, the whole process of how DNA methyltransferases determine which region should be methylated is not completely revealed. There are many hypotheses of which genomic features are correlated to the epigenome that have not yet been evaluated. Furthermore, many explorative approaches of measuring DNA methylation are limited to a subset of the genome and thus, cannot be employed, e.g., for genome-wide biomarker prediction methods. In this study, we evaluated the correlation of genetic, epigenetic and hypothesis-driven features to DNA methylation of CpG islands. To this end, various binary classifiers were trained and evaluated by cross-validation on a dataset comprising DNA methylation data for 190 CpG islands in HEPG2, HEK293, fibroblasts and leukocytes. We achieved an accuracy of up to 91% with an MCC of 0.8 using ten-fold cross-validation and ten repetitions. With these models, we extended the existing dataset to the whole genome and thus, predicted the methylation landscape for the given cell types. The method used for these predictions is also validated on another external whole-genome dataset. Our results reveal features correlated to DNA methylation and confirm or disprove various hypotheses of DNA methylation related features. This study confirms correlations between DNA methylation and histone modifications, DNA structure, DNA sequence, genomic attributes and CpG island properties. Furthermore, the method has been validated on a genome-wide dataset from the ENCODE consortium. The developed software, as well as the predicted datasets and a web-service to compare methylation states of CpG islands are available at http://www.cogsys.cs.uni-tuebingen.de/software/dna-methylation/.  相似文献   

9.
CpG islands: Algorithms and applications in methylation studies   总被引:1,自引:0,他引:1  
Methylation occurs frequently at 5’-cytosine of the CpG dinucleotides in vertebrate genomes; however, this epigenetic feature is rarely observed in CpG islands (CGIs) or CpG clusters in the promoter regions of genes. Aberrant methylation of the promoter-associated CGIs might influence gene expression and cause carcinogenesis. Because of the functional importance, multiple algorithms have been available for identifying CGIs in a genome or a sequence. They can be categorized into the traditional algorithms (e.g., Gardiner-Garden and Frommer (1987), Takai and Jones (2002), and CpGPRoD (2002)) or statistical property based algorithms (CpGcluster (2006) and CG cluster (2007)). We reviewed the features of these algorithms and evaluated their performance on identifying functional CGIs using genome-wide methylation data. Moreover, identification of CGIs is an initial step in many recent studies for predicting methylation status as well as in the design of methylation detection platforms. We reviewed the benchmarks and features used in these studies.  相似文献   

10.
11.
CpG islands (CGIs) play a fundamental role in genome analysis and annotation, and contribute to improving the accuracy of promoter prediction. Besides, CGIs in promoter regions are abnormally methylated in cancer cells and thus can be used as tumor markers. However, current methods for identifying CGIs suffer from various drawbacks. We present a new algorithm for detecting CGIs, called CpG Island Finder (CpGIF), which combines the best features in the most commonly used algorithms and avoids their disadvantages as much as possible. Five public tools for CpG island searching are used to compare with CpGIF for the assessment of accuracy and computational efficiency. The results reveal that CpGIF has higher performance coefficient and correlation coefficient than these previous methods, which indicates that CpGIF is able to provide high sensitivity and specificity at the same time. CpGIF is also faster than those methods with comparable prediction accuracy.  相似文献   

12.
Kang MI  Rhyu MG  Kim YH  Jung YC  Hong SJ  Cho CS  Kim HS 《Genomics》2006,87(5):580-590
Alu and L1 retroelements have been suggested to initiate the spread of CpG methylation. In this study, the spread of CpG methylation was estimated based on the distance between the CpG islands and the nearest retroelements. All human genes (23,116) were examined and the correlations between the length of the CpG islands and the distance and density of the confronting retroelements were examined using nonoverlapping 5-kb windows. There was a linear relationship between the length of the CpG islands and the density of the Alu elements and an inverse relationship between the CpG islands and the L1 elements located more distantly, suggesting a suppressive effect of the Alu's on the spread of L1 methylation. Methylation analysis of the transitional CpG sites between the CpG islands and the nearest retroelements upstream of 16 genes was then carried out using DNA preparations from 11 different human tissues. Methylation-variable transitional CpGs were observed for the selected genes and the different tissues.  相似文献   

13.
In this paper, we use a statistical estimator developed in astrophysics to study the distribution and organization of features of the human genome. Using the human reference sequence we quantify the global distribution of CpG islands (CGI) in each chromosome and demonstrate that the organization of the CGI across a chromosome is non-random, exhibits surprisingly long range correlations (10 Mb) and varies significantly among chromosomes. These correlations of CGI summarize functional properties of the genome that are not captured when considering variation in any particular separate (and local) feature. The demonstration of the proposed methods to quantify the organization of CGI in the human genome forms the basis of future studies. The most illuminating of these will assess the potential impact on phenotypic variation of inter-individual variation in the organization of the functional features of the genome within and among chromosomes, and among individuals for particular chromosomes.  相似文献   

14.
Summary The compositional properties of DNAs from 122 species of fishes and from 18 other coldblooded vertebrates (amphibians and reptiles) were compared with those from 10 warm-blooded vertebrates (mammals and birds) and found to be substantially different. Indeed, DNAs from cold-blooded vertebrates are characterized by much lower intermolecular compositional heterogeneities and CsCl band asymmetries, by a much wider spectrum of modal buoyant densities in CsCl, by generally lower amounts of satellites, as well as by the fact that in no case do buoyant densities reach the high values found in the GC-richest components of DNAs from warm-blooded vertebrates.In the case of fish genomes, which were more extensively studied, different orders were generally characterized by modal buoyant densities that were different in average values as well as in their ranges. In contrast, different families within any given order were more often characterized by narrow ranges of modal buoyant densities, and no difference in modal buoyant density was found within any single genus (except for the genusAphyosemion, which should be split into several genera).The compositional differences that were found among species belonging to different orders and to different families within the same order are indicative of compositional transitions, which were shown to be essentially due to directional base substitutions. These transitions were found to be independent of geological time. Moreover, the rates of directional base substitutions were found to be very variable and to reach, in some cases, extremely high values, that were even higher than those of silent substitutions in primates. The taxonomic and evolutionary implications of these findings are discussed.  相似文献   

15.
Tandem repeats in the CpG islands of imprinted genes   总被引:4,自引:0,他引:4  
Hutter B  Helms V  Paulsen M 《Genomics》2006,88(3):323-332
  相似文献   

16.
CpG islands in genes showing tissue-specific expression   总被引:2,自引:0,他引:2  
Patterns of DNA methylation at CpG dinucleotides and their relations with gene expression are complex. Methylation-free CpG clusters, so-called HTF islands, are most often associated with the promoter regions of housekeeping genes, whereas genes expressed in a single-cell type are usually deficient in these sequences. However, in the human carbonic anhydrase (CA) gene family, both the ubiquitously expressed CAII and the muscle specific CAIII appear to have such CpG islands although erythrocyte-specific CAI does not. The CAII island is quantitatively more CpG rich than that of CAIII, with a CpG:GpC ratio of 0.94 compared with 0.82 for CAIII. Estimation of CpG:GpC ratios in the proximal-promoter regions of 44 vertebrate genes suggest that 40% of genes with tissue-specific or limited tissue distribution may show methylation-free CpG clusters in their promoter regions. In many cases the CpG:GpC ratio is less than that found in housekeeping genes and this may reflect variation in the interaction of CpG clusters with regulatory factors that define different patterns of tissue expression.  相似文献   

17.
Cytosine methylation and the fate of CpG dinucleotides in vertebrate genomes   总被引:30,自引:1,他引:29  
Summary The dinucleotide CpG is a hotspot for mutation in the human genome as a result of (1) the modification of the 5 cytosine by cellular DNA methyltransferases and (2) the consequent high frequency of spontaneous deamination of 5-methyl cytosine (5mC) to thymidine. DNA methylation thus contributes significantly, albeit indirectly, to the incidence of human genetic disease. We have attempted to estimate for the first time the in vivo rate of deamination of 5mC from the measured rate of 5mC deamination in vitro and the known error frequency of the cellular G/T mismatch-repair system. The accuracy and utility of this estimate (m d ) was then assessed by comparison with clinical data, and an improved estimate of m d (1.66x10-16 s-1) was derived. Comparison of the CpG mutation rates exibited by globin gene and pseudogene sequences from human, chimpanzee and macaque provided further estimates of m d , all of which were consistent with the first. Use of this value in a mathematical model then permitted the estimation of the length of time required to produce the level of CpG suppression currently found in the bulk DNA of vertebrate genomes. This time span, approximately 450 million years, corresponds closely to the estimated time since the emergence and adaptive radiation of the vertebrates and thus coincides with the probable advent of heavily methylated genomes. An accurate estimate of the 5mC deamination rate is important not only for clinical medicine but also for studies of gene evolution. Our data suggest both that patterns of vertebrate gene methylation may be comparatively stable over relatively long periods of evolutionary time, and that the rate of CpG deamination can, under certain limited conditions, serve as a molecular clock.  相似文献   

18.
19.
CpG islands are discrete regions of DNA with significantly greater frequencies of CpG doublets than bulk genomic DNA. They are most frequently associated with the 5'-ends of housekeeping genes and are involved in the regulation of their expression. In this study, the structure and evolution of CpG islands within genes of the myc family were evaluated with the protein-coding sequences of animals and their transducing viruses. These evaluations relied on a gene tree for the entire myc family to test the origins of CpG islands within their two protein-coding exons. Overall, CG-very rich and CG-rich islands are associated with exon 2 of the different myc genes of warm-blooded vertebrates and with exon 3 of the N-myc and s-myc sequences of mammals, but not birds. These overall distributions of well-developed islands can be related to the major transitions of the CG-rich genomes of warm-blooded vertebrates from the CG-poor ones of other animals. In turn, the greater variability of well-developed islands within exon 3 of the N-myc gene and among the different retrogenes of the myc family can be attributed to their reduced functional constraints, as evidenced by their limited and very restricted patterns of expression, respectively.  相似文献   

20.
CpG islands as gene markers in the human genome.   总被引:65,自引:0,他引:65  
F Larsen  G Gundersen  R Lopez  H Prydz 《Genomics》1992,13(4):1095-1107
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号