首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 546 毫秒
1.
Summary We have investigated the compositional properties of coding sequences from cold-blooded vertebrates and we have compared them with those from warm-blooded vertebrates. Moreover, we have studied the compositional correlations of coding sequences with the genomes in which they are contained, as well as the compositional correlations among the codon positions of the genes analyzed.The distribution of GC levels of the third codon positions of genes from cold-blooded vertebrates are distinctly different from those of warm-blooded vertebrates in that they do not reach the high values attained by the latter. Moreover, coding sequences from cold-blooded vertebrates are either equal, or, in most cases, lower in GC (not only in third, but also in first and second codon positions) than homologous coding sequences from warm-blooded vertebrates; higher values are exceptional. These results at the gene level are in agreement with the compositional differences between cold-blooded and warm-blooded vertebrates previously found at the whole genome (DNA) level (Bernardi and Bernardi 1990a,b).Two linear correlations were found: one between the GC levels of coding sequences (or of their third codon positions) and the GC levels of the genomes of cold-blooded vertebrates containing them; and another between the GC levels of third and first+ second codon positions of genes from cold-blooded vertebrates. The first correlation applies to the genomes (or genome compartments) of all vertebrates and the second to the genes of all living organisms. These correlations are tantamount to a genomic code.  相似文献   

2.
The compositional distributions of large (main-band) DNA fragments from eight birds belonging to eight different orders (including both paleognathous and neognathous species) are very broad and extremely close to each other. These findings, which are paralleled by the compositional similarity of homologous coding sequences and their codon positions, support the idea that birds are a monophyletic group.The compositional distribution of third-codon positions of genes from chicken, the only avian species for which a relatively large number of coding sequences is known, is very broad and bimodal, the minor GC-richer peak reaching 100% GC. The very high compositional heterogeneity of avian genomes is accompanied (as in the case of mammalian genomes) by a very high speciation rate compared to cold-blooded vertebrates which are characterized by genomes that are much less heterogeneous. The higher GC levels attained by avian compared to mammalian genomes might be correlated with the higher body temperature (41–43°C) of birds compared to mammals (37°C).A comparison of GC levels of coding sequences and codon positions from man and chicken revealed very close average GC levels and standard deviations. Homologous coding sequences and codon positions from man and chicken showed a surprisingly high degree of compositional similarity which was, however, higher for GC-poor than for GC-rich sequences. This indicates that GC-poor isochores of warm-blooded vertebrates reflect the composition of the isochores of the genome of the common reptilian ancestor of mammals and birds, which underwent only a small compositional change at the transition from cold- to warm-blooded vertebrates. In contrast, the GC-rich isochores of birds and mammals are the result of large compositional changes at the same evolutionary transition, where were in part different in the two classes of warm-blooded vertebrates.Correspondence to: G. Bernaadi  相似文献   

3.
Bernardi G 《Gene》2000,241(1):3-17
The nuclear genomes of vertebrates are mosaics of isochores, very long stretches (>300kb) of DNA that are homogeneous in base composition and are compositionally correlated with the coding sequences that they embed. Isochores can be partitioned in a small number of families that cover a range of GC levels (GC is the molar ratio of guanine+cytosine in DNA), which is narrow in cold-blooded vertebrates, but broad in warm-blooded vertebrates. This difference is essentially due to the fact that the GC-richest 10-15% of the genomes of the ancestors of mammals and birds underwent two independent compositional transitions characterized by strong increases in GC levels. The similarity of isochore patterns across mammalian orders, on the one hand, and across avian orders, on the other, indicates that these higher GC levels were then maintained, at least since the appearance of ancestors of warm-blooded vertebrates. After a brief review of our current knowledge on the organization of the vertebrate genome, evidence will be presented here in favor of the idea that the generation and maintenance of the GC-richest isochores in the genomes of warm-blooded vertebrates were due to natural selection.  相似文献   

4.
CpG islands, genes and isochores in the genomes of vertebrates   总被引:6,自引:0,他引:6  
B A?ssani  G Bernardi 《Gene》1991,106(2):185-195
We have shown that human genes associated with CpG islands increase in number as they increase in % of guanine + cytosine (GC) levels, and that most genes associated with CpG islands are located in the GC-richest compartment of the human genome. This is an independent confirmation of the concentration gradient of CpG islands (detected as HpaII tiny fragments, or HTF) which was demonstrated in the genome of warm-blooded vertebrates [A?ssani and Bernardi, Gene 106 (1991) 173-183]. We then reassessed the location of CpG islands using the data currently available and confirmed that CpG islands are most frequently located in the 5'-flanking sequences of genes and that they overlap genes to variable extents. We have shown that such extents increase with the increasing GC levels of genes, the GC-richest genes being completely included in CpG islands. Under such circumstances, we have investigated the properties of the 'extragenic' CpG islands located in the 5'-flanking segments of homologous genes from both warm- and cold-blooded vertebrates. We have confirmed that, in cold-blooded vertebrates, CpG islands are often absent; when present, they have lower GC and CpG levels; the latter attain, however, statistically expected values. Finally, we have shown that CpG doublets increase with the increasing GC of exons, introns and intergenic sequences (including 'extragenic' CpG islands) in the genomes from both warm- and cold-blooded vertebrates. The correlations found are the same for both classes of vertebrates, and are similar for exons, introns and intergenic sequences (including 'extragenic' CpG islands). The findings just outlined indicate that the origin and evolution of CpG islands in the vertebrate genome are associated with compositional transitions (GC increases) in genes and isochores.  相似文献   

5.
Summary The compositional distribution of coding sequences from five vertebrates (Xenopus, chicken, mouse, rat, and human) is shifted toward higher GC values compared to that of the DNA molecules (in the 35–85-kb size range) isolated from the corresponding genomes. This shift is due to the lower GC levels of intergenic sequences compared to coding sequences. In the cold-blooded vertebrate, the two distributions are similar in that GC-poor genes and GC-poor DNA molecules are largely predominant. In contrast, in the warm-blooded vertebrates, GC-rich genes are largely predominant over GC-poor genes, whereas GC-poor DNA molecules are largely predominant over GC-rich DNA molecules. As a consequence, the genomes of warm-blooded vertebrates show a compositional gradient of gene concentration. The compositional distributions of coding sequences (as well as of DNA molecules) showed remarkable differences between chicken and mammals, and between mouse (or rat) and human. Differences were also detected in the compositional distribution of housekeeping and tissue-specific genes, the former being more abundant among GC-rich genes.  相似文献   

6.
A compositional transition was previously detected by comparing orthologous coding sequences from cold- and warm-blooded vertebrates (see Bernardi, G., Hughes, S., Mouchiroud, D., 1997. The major compositional transitions in the vertebrate genome. J. Mol. Evol. 44, S44-S51 for a review). The transition is characterized by higher GC levels (GC is the molar ratio of guanine+cytosine in DNA) and, especially, by higher GC3 levels (GC3 is the GC level of third codon positions) in coding sequences from warm-blooded vertebrates. This transition essentially affects GC-rich genes, although the nucleotide substitution rate is of the same order of magnitude in both GC-poor and GC-rich genes. In order to understand the evolutionary basis of the changes, we have compared the hydrophobicity of orthologous proteins from Xenopus and human. Although the differences are small in proteins encoded by coding sequences ranging from 0 to 65% in GC3, they are large in the proteins encoded by sequences characterized by GC3 values higher than 65%. The latter proteins are more hydrophobic in human than in Xenopus.  相似文献   

7.
CpG islands: features and distribution in the genomes of vertebrates   总被引:4,自引:0,他引:4  
B A?ssani  G Bernardi 《Gene》1991,106(2):173-183
We have investigated the distribution of unmethylated CpG islands in vertebrate genomes fractionated according to their base composition. Genomes from warm-blooded vertebrates (man, mouse and chicken) are characterized by abundant CpG islands, whose frequency increases in DNA fractions of increasing % of guanine + cytosine; % G + C (GC), in parallel with the distribution of genes and CpG doublets. Small, yet significant, differences in the distribution of CpG islands were found in the three genomes. In contrast, genomes from cold-blooded vertebrates (two reptiles, one amphibian, and two fishes) were characterized by an extreme scarcity or absence of CpG islands (detected in these experiments as HpaII tiny fragments or HTF). CpG islands associated with homologous genes from cold- and warm-blooded vertebrates were then compared by analyzing CpG frequencies, GC levels, HpaII sites, rare-cutter sites and G/C boxes (GGGGCGGGGC and closely related motifs) in sequences available in gene banks. Small, yet significant, differences were again detected among the CpG islands associated with homologous genes from warm-blooded vertebrates, in that CpG islands associated with mouse or rat genes often showed low CpG and/or GC levels, as well as low numbers of HpaII sites, rare-cutter sites and G/C boxes, compared to homologous human genes; more rarely, CpG islands were just absent. As far as cold-blooded vertebrates were concerned, a number of genes showed CpG islands, which exhibited a much lower frequency of CpG doublets than that found in CpG islands of warm-blooded vertebrates, but still approached the statistically expected frequency; none of the other features of CpG islands associated with genes from warm-blooded vertebrates were present. Other genes did not show any associated CpG islands, unlike their homologues from warm-blooded vertebrates.  相似文献   

8.
Warm-blooded isochore structure in Nile crocodile and turtle.   总被引:11,自引:0,他引:11  
  相似文献   

9.
The genomes of homeothermic (warm-blooded) vertebrates are mosaic interspersions of homogeneously GC-rich and GC-poor regions (isochores). Evolution of genome compartmentalization and GC-rich isochores is hypothesized to reflect either selective advantages of an elevated GC content or chromosome location and mutational pressure associated with the timing of DNA replication in germ cells. To address the present controversy regarding the origins and maintenance of isochores in homeothermic vertebrates, newly obtained as well as published nucleotide sequences of the insulin and insulin-like growth factor (IGF) genes, members of a well-characterized gene family believed to have evolved by repeated duplication and divergence, were utilized to examine the evolution of base composition in nonconstrained (flanking) and weakly constrained (introns and fourfold degenerate sites) regions. A phylogeny derived from amino acid sequences supports a common evolutionary history for the insulin/IGF family genes. In cold- blooded vertebrates, insulin and the IGFs were similar in base composition. In contrast, insulin and IGF-II demonstrate dramatic increases in GC richness in mammals, but no such trend occurred in IGF- I. Base composition of the coding portions of the insulin and IGF genes across vertebrates correlated (r = 0.90) with that of the introns and flanking regions. The GC content of homologous introns differed dramatically between insulin/IGF-II and IGF-I genes in mammals but was similar to the GC level of noncoding regions in neighboring genes. Our findings suggest that the base composition of introns and flanking regions is determined by chromosomal location and the mutational pressure of the isochore in which the sequences are embedded. An elevated GC content at codon third positions in the insulin and the IGF genes may reflect selective constraints on the usage of synonymous codons.   相似文献   

10.
We compared the exon/intron organization of vertebrate genes belonging to different isochore classes, as predicted by their GC content at third codon position. Two main features have emerged from the analysis of sequences published in GenBank: (1) genes coding for long proteins (i.e., 500 aa) are almost two times more frequent in GC-poor than in GC-rich isochores; (2) intervening sequences (=sum of introns) are on average three times longer in GC-poor than in GC-rich isochores. These patterns are observed among human, mouse, rat, cow, and even chicken genes and are therefore likely to be common to all warm-blooded vertebrates. Analysis of Xenopus sequences suggests that the same patterns exist in cold-blooded vertebrates. It could be argued that such results do not reflect the reality because sequence databases are not representative of entire genomes. However, analysis of biases in GenBank revealed that the observed discrepancies between GC-rich and GC-poor isochores are not artifactual, and are probably largely underestimated. We investigated the distribution of microsatellites and interspersed repeats in introns of human and mouse genes from different isochores. This analysis confirmed previous studies showing that Ll repeats are almost absent from GC-rich isochores. Microsatellites and SINES (Alu, B1, B2) are found at roughly equal frequencies in introns from all isochore classes. Globally, the presence of repeated sequences does not account for the increased intron length in GC-poor isochores. The relationships between gene structure and global genome organization and evolution are discussed.  相似文献   

11.
Vertebrate genomes are mosaics of isochores. On the assumption that marked differences exist in the isochore structure between warm-blooded and cold-blooded animals, variations among vertebrates were previously attributed to adaptation to homeothermy. However, based on the data of coding regions from representatives of extant vertebrates, including a turtle, a crocodile (Archosauromorpha) and a few kinds of snakes (Lepidosauromorpha), it was recently hypothesized that the common ancestors of mammals, birds and extant reptiles already had the "warm-blooded" isochore structure. To test this hypothesis, the nucleotide sequences of alpha-globin genes including non-coding regions (introns) from two snakes, N. kaouthia and E. climacophora, were determined (accession number: AB104824, AB104825). The correlation between the GC contents in the introns and exons of alpha-globin genes from snakes and those from other vertebrates supports the above hypothesis. Similar analysis using data for exons and introns of other genes obtained from the GenBank (Release 131) also support the above hypothesis.  相似文献   

12.
Fortes GG  Bouza C  Martínez P  Sánchez L 《Genetica》2007,129(3):281-289
To review the general consideration about the different compositional structure of warm and cold-blooded vertebrates genomes, we used of the increasing number of genetic sequences, including coding (exons) and non-coding (introns) regions, that have been deposited on the databases throughout last years. The nucleotide distributions of the third codon positions (GC3) have been analyzed in 1510 coding sequences (CDS) of fish, 1414 CDS of amphibians and 320 CDS of reptiles. Also, the relationship between GC content of 74, 56 and 25 CDS of fish, amphibians and reptiles, respectively and that of their corresponding introns (GCI) have been considerated. In accordance with recent data, sequence analysis showed the presence of very GC3-rich CDS in these poikilotherm vertebrates. However, very high diversity in compositional patterns among different orders of fish, amphibians and reptiles was found. Significant positive correlations between GC3 and GCI was also confirmed for the genes analyzed. Nevertheless, introns resulted to be poorer in GC than their corresponding CDS, this difference being larger than in human genome. Because the limited number of available sequences including exons and introns we must be cautious about the results derived from them. However, the indicious of higher GC richness of coding sequences than of their corresponding introns could aid to understand the discrepancy of sequence analysis with the ultracentrifugation studies in cold-blooded vertebrates that did not predict the existence of GC-rich isochores.  相似文献   

13.
There is a long-standing debate in molecular evolution concerning the putative importance of GC content in adapting the thermal stabilities of DNA and RNA. Most studies of this relationship have examined broad-scale compositional patterns, for example, total GC percentages in genomes and occurrence of GC-rich isochores. Few studies have systematically examined the GC contents of individual orthologous genes from differently thermally adapted species. When this has been done, the emphasis has been on comparing large numbers of genes in only a few species. We have approached the GC-adaptation temperature hypothesis in a different manner by examining patterns of base composition of genes encoding lactate dehydrogenase-A (ldh-a) and alpha-actin (alpha-actin) from 51 species of vertebrates whose adaptation temperatures ranged from -1.86 degrees C (Antarctic fishes) to approximately 45 degrees C (desert reptile). No significant positive correlation was found between any index of GC content (GC content of the entire sequence, GC content of the third codon position [GC(3)], and GC content at fourfold degenerate sites [GC(4)]) and any index of adaptation temperature (maximal, mean, or minimal body temperature). For alpha-actin, slopes of regression lines for all comparisons did not differ significantly from zero. For ldh-a, negative correlations between adaptation temperature and total GC content, GC(3), and GC(4) were observed but were shown to be due entirely to phylogenetic influences (as revealed by independent contrast analyses). This comparison of GC content across a wide range of ectothermic ("cold-blooded") and endothermic ("warm-blooded") vertebrates revealed that frogs of the genus Xenopus, which have commonly been used as a representative cold-blooded species, in fact are outliers among ectotherms for the alpha-actin analyses, raising concern about the appropriateness of choosing these amphibians as representative of ectothermic vertebrates in general. Our study indicates that, whereas GC contents of isochores may show variation among different classes of vertebrates, there is no consistent relationship between adaptation temperature and the percentage of thermal stability-enhancing G + C base pairs in protein-coding genes.  相似文献   

14.
15.
Codon usages are known to vary among vertebrates chiefly due to variations in isochore structure. Under the assumption that marked differences exist in isochore structure between warm-blooded and cold-blooded animals, the variations among vertebrates were previously attributed to an adaptation to homeothermy. However, based on data from a turtle species and a crocodile (Archosauromorpha), it was recently proposed that the common ancestors of mammals, birds and extent reptiles already had the "warm-blooded" isochore structure. We determined the nucleotide sequences of alpha-globin genes from two species of heterotherms, cuckoo (Cuculus canorus) and bat (Pipistrellus abramus), and three species of snakes (Lepidosauromorpha), Naja kaouthia from a tropical terrestrial habitat, Elaphe climacophora from a temperate terrestrial habitat, and Hydrophis melanocephalus from a tropical marine habitat. Our purposes were to assess the influence of differential body temperature patterns on codon usage and GC content at the third position of a codon (GC3), and to test the hypothesis concerning the phylogenetic position at which GC contents had increased in vertebrates. The results of principal component analysis (PCA) using the present data and data for other taxa from GenBank indicate that the primary difference in codon usage in globin genes among amniotes and other vertebrates lies in GC3. The codon usages (and GC3) in alpha-globin genes from two heterotherms and three snakes are similar to those in alpha-globin genes from warm-blooded vertebrates. These results refute the influence of body temperature pattern upon codon usages (and GC3) in alpha-globin genes, and support the hypothesis that the increase in GC content in the genome occurred in the common ancestor of amniotes.  相似文献   

16.
An analysis of silent substitutions in pairwise comparisons of homologous genes from different mammals has shown that, in spite of individual fluctuations, their frequencies (which are very strongly correlated with the frequency of substitutions per synonymous site calculated according to Li et al. 1985) do not vary, on the average, with the GC levels of silent positions. This holds in the general case, in which silent positions of pairs of homologous genes share the same composition, namely in the human/other primates, human/artiodactyls, and in the mouse/rat pairs, as well as in the special cases in which the composition of silent positions are different, namely in the human/rabbit and the human/rat (or human/mouse) pairs. A slightly lower frequency found for low GC values in the human/bovine and human/pig pairs seems to be due to the specific gene samples used. These results contradict the previously claimed existence of differences in mutation rates and of mutational biases in third codon positions of coding sequences located in different isochores of mammalian genomes. They also imply that the variations in nucleotide precursor pools through the cell cycle and the differences in replication timing, or in repair efficiency, which were reported for different isochores, do not lead, as claimed, to differences in mutation rates, not in mutational biases in mammals. The differences claimed appear to be due to using small gene samples when individual fluctuations from gene to gene are relatively large. Correspondence to: G. Bernardi  相似文献   

17.
We carried out a comprehensive survey of small subunit ribosomal RNA sequences from archaeal, bacterial, and eukaryotic lineages in order to understand the general patterns of thermal adaptation in the rRNA genes. Within each lineage, we compared sequences from mesophilic, moderately thermophilic, and hyperthermophilic species. We carried out a more detailed study of the archaea, because of the wide range of growth temperatures within this group. Our results confirmed that there is a clear correlation between the GC content of the paired stem regions of the 16S rRNA genes and the optimal growth temperature, and we show that this correlation cannot be explained simply by phylogenetic relatedness among the thermophilic archaeal species. In addition, we found a significant, positive relationship between rRNA stem length and growth temperature. These correlations are found in both bacterial and archaeal rRNA genes. Finally, we compared rRNA sequences from warm-blooded and cold-blooded vertebrates. We found that, while rRNA sequences from the warm-blooded vertebrates have a higher overall GC content than those from the cold-blooded vertebrates, this difference is not concentrated in the paired regions of the molecule, suggesting that thermal adaptation is not the cause of the nucleotide differences between the vertebrate lineages. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Nicolas Galtier]  相似文献   

18.
Summary The compositional distributions of coding sequences and DNA molecules (in the 50-100-kb range) are remarkably narrower in murids (rat and mouse) compared to humans (as well as to all other mammals explored so far). In murids, both distributions begin at higher and end at lower GC values. A comparison of homologous coding sequences from murids and humans revealed that their different compositional distributions are due to differences in GC levels in all three codon positions, particularly of genes located at both ends of the distribution. In turn, these differences are responsible for differences in both codon usage and amino acids. When GC levels at first+second codon positions and third codon positions, respectively, of murid genes are plotted against corresponding GC levels of homologous human genes, linear relationships (with very high correlation coefficients and slopes of about 0.78 and 0.60, respectively) are found. This indicates a conservation of the order of GC levels in homologous genes from humans and murids. (The same comparison for mouse and rat genes indicates a conservation of GC levels of homologous genes.) A similar linear relationship was observed when plotting GC levels of corresponding DNA fractions (as obtained by density gradient centrifugation in the presence of a sequence-specific ligand) from mouse and human. These findings indicate that orderly compositional changes affecting not only coding sequences but also noncoding sequences took place since the divergence of murids. Such directional fixations of mutations point to the existence of selective pressures affecting the genome as a whole.  相似文献   

19.
In situ hybridizations of single-copy GC-rich, gene-rich and GC-poor, gene-poor chicken DNA allowed us to localize the gene-rich and the gene-poor chromosomal regions in interphase nuclei of cold-blooded vertebrates. Our results showed that the gene-rich regions from amphibians (Rana esculenta) and reptiles (Podarcis sicula) occupy the more internal part of the nuclei, whereas the gene-poor regions occupy the periphery. This finding is similar to that previously reported in warm-blooded vertebrates, in spite of the lower GC levels of the gene-rich regions of cold-blooded vertebrates. This suggests that this similarity extends to chromatin structure, which is more open in the gene-rich regions of both mammals and birds and more compact in the gene-poor regions. In turn, this may explain why the compositional transition undergone by the genome at the emergence of homeothermy did not involve the entire ancestral genome but only a small part of it, and why it involved both coding and noncoding sequences. Indeed, the GC level increased only in that part of the genome that needed a thermodynamic stabilization, namely in the more open gene-rich chromatin of the nuclear interior, whereas the gene-poor chromatin of the periphery was stabilized by its own compact structure.  相似文献   

20.
DNA melting is rate-limiting for cytosine deamination, from which we infer that the rate of cytosine deamination should decline twofold for each 10% increase in GC content. Analysis of human DNA sequence data confirms that this is the case for 5-methylcytosine. Several lines of evidence further confirm that it is also the case for unmethylated cytosine and that cytosine deamination causes the majority of all C-->T and G-->A transitions in mammals. Thus, cytosine deamination and DNA base composition each affect the other, forming a positive feedback loop that facilitates divergent genetic drift to high or low GC content. Because a 10 degrees C increase in temperature in vitro increases the rate of cytosine deamination 5. 7-fold, cytosine deamination must be highly dependent on body temperature, which is consistent with the dramatic differences between the isochores of warm-blooded versus cold-blooded vertebrates. Because this process involves both DNA melting and positive feedback, it would be expected to spread progressively (in evolutionary time) down the length of the chromosome, which is consistent with the large size of isochores in modern mammals.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号