首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Chen Wang  Lukasz Kurgan 《Proteomics》2016,16(10):1486-1498
Intrinsically disordered proteins (IDPs) are abundant in various proteomes, where they play numerous important roles and complement biological activities of ordered proteins. Among functions assigned to IDPs are interactions with nucleic acids. However, often, such assignments are made based on the guilty‐by‐association principle. The validity of the extension of these correlations to all nucleic acid binding proteins has never been analyzed on a large scale across all domains of life. To fill this gap, we perform a comprehensive computational analysis of the abundance of intrinsic disorder and intrinsically disordered domains in nucleiomes (~548 000 nucleic acid binding proteins) of 1121 species from Archaea, Bacteria and Eukaryota. Nucleiome is a whole complement of proteins involved in interactions with nucleic acids. We show that relative to other proteins in the corresponding proteomes, the DNA‐binding proteins have significantly increased disorder content and are significantly enriched in disordered domains in Eukaryotes but not in Archaea and Bacteria. The RNA‐binding proteins are significantly enriched in the disordered domains in Bacteria, Archaea and Eukaryota, while the overall abundance of disorder in these proteins is significantly increased in Bacteria, Archaea, animals and fungi. The high abundance of disorder in nucleiomes supports the notion that the nucleic acid binding proteins often require intrinsic disorder for their functions and regulation.  相似文献   

2.
本文介绍了一个在微机(IBM PC)上实现的、用于核酸顺序分析的计算机程序系统.该系统由三个层次和18个功能块构成,菜单及人机对话使得用户能较快地掌握和使用它.在编程中,采用了树结构、先进后出栈和稀疏矩阵等数据结构技巧,运用了Bayes法等统计分析方法,Kruskal算法和Floyd算法等一系列图论方法也被得到应用,这个软件系统的推出对于分子生物学研究具有一定的积极作用.  相似文献   

3.
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea.  相似文献   

4.
本文报导了用于基因重组与基因合成实验设计的软件系统的建立.此系统由30个功能模块组成,为研究者提供了包括在DNA分子上寻找限制性内切酶位点、核酸分子片段之间同源性比较,基因化学合成的实验设计、特定顺序分析引物及核酸杂交探针的设计、阅读框的查找等功能.此外,本系统可以对外来数据库的资料进行援引和进一步分析,为分子生物学的研究提供有价值的信息.  相似文献   

5.
Argonaute proteins are programmable nucleases that are found in both eukaryotes and prokaryotes and provide defense against invading genetic elements. Although some prokaryotic argonautes (pAgos) were shown to recognize RNA targets in vitro, the majority of studied pAgos have strict specificity toward DNA, which limits their practical use in RNA-centric applications. Here, we describe a unique pAgo nuclease, KmAgo, from the mesophilic bacterium Kurthia massiliensis that can be programmed with either DNA or RNA guides and can precisely cleave both DNA and RNA targets. KmAgo binds 16–20 nt long 5′-phosphorylated guide molecules with no strict specificity for their sequence and is active in a wide range of temperatures. In bacterial cells, KmAgo is loaded with small DNAs with no obvious sequence preferences suggesting that it can uniformly target genomic sequences. Mismatches between the guide and target sequences greatly affect the efficiency and precision of target cleavage, depending on the mismatch position and the nature of the reacting nucleic acids. Target RNA cleavage by KmAgo depends on the formation of secondary structure indicating that KmAgo can be used for structural probing of RNA. These properties of KmAgo open the way for its use for highly specific nucleic acid detection and cleavage.  相似文献   

6.
Certain chicken cells that do not spontaneously release virus particles have been shown to produce a subgroup E avian RNA tumor virus, Rous-associated virus 60 (RAV-60), after infection with viruses of other subgroups. The nucleic acids of RAV-60 were analyzed for sequence homologies with the viral nucleic acids contained in the uninfected cell and with those of RAV-2, the exogenous virus used for the preparation of this particular RAV-60 isolate. In addition, these nucleic acids were compared with those of RAV-0, an endogenous virus spontaneously released from line 100 chicken cells. RAV-60 appears to be intermediate between RAV-0 and RAV-2 in its genetic composition, based on the pattern of hybridization obtained with the nucleic acids of these viruses and on the melting profiles of the various hybrid combinations. Of the three viruses tested, RAV-0 appears to have the greatest sequence homology with the viral nucleic acids of the uninfected cell. Hybridization between RAV-60 3-H-labeled complementary DNA and either DNA or RNA from the uninfected cell indicates that RAV-60 contains some nucleic acid sequences which are not present in the cell. In addition, some RAV-60 sequences which hybridize with the cell nucleic acid contain significant amounts of mismatching, as indicated by the lower thermal stability of these hybrid duplexes. Hybrid formation between these partially homologous sequences was excluded under stringent annealing conditions. The data indicate that RAV-60 is a recombinant between exogenous and endogenous viral genes.  相似文献   

7.
During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life.  相似文献   

8.
Cell wall types of Bacteria and Archaea The acaryote microorganisms are divided into the two domains Bacteria and Archaea. The third domain represent the Eukarya. There is no universal cell wall polymer found in all Bacteria and Archaea. Due to their morphology several cell wall types can be identified, but the chemical diversity of the individual polymers is considerably greater. Certain cell wall polymers are limited to one of the two domains of Bacteria or Archaea like the murein of the Bacteria or the pseudomurein of some methanogens. Peptidoglycans (murein, pseudomurein) do not occur in eukaryotes. On the other hand individual cell wall polymers possess similarities to polymers of other domains. The structural principle of the methanochondroitin is also implemented in the eukaryotic connective tissue. The cell wall polymers consist frequently of glycoconjugates in which the amino acid content (glycoproteins) or the glycan moiety (proteoglycan‐like polymers) predominate. Both components (carbohydrates, amino acids) can also occur in similar amounts (peptidoglycan). There exist also cell wall polymers, which consist only of glycans (slimes, methanochondroitin) or amino acids (proteins, poly‐γ‐D‐glutamyl polymers). Cell wall‐free species (Mycoplasma) also occur. The chemical composition of the cell surface polymers was one of the first phenotypic characteristics that supported the 16 sRNA concept of Carl Woese to assign acaryote organisms into the two domains Bacteria and Archaea. A common feature of all Archaea is the lack of muramic acid and an outer membrane. The later occurs in the gramnegative Bacteria. During the evolution of Bacteria and Archaea a great variety of chemically different cell wall polymers has been developed which allow the growth and interaction of Bacteria and Archaea in different habitats. In this paper, some important surface polymers of Bacteria and Archaea are presented according to their chemical composition.  相似文献   

9.
Shajani Z  Varani G 《Biopolymers》2007,86(5-6):348-359
RNA and DNA molecules experience motions on a wide range of time scales, ranging from rapid localized motions to much slower collective motions of entire helical domains. The many functions of RNA in biology very often require this molecule to change its conformation in response to biological signals in the form of small molecules, proteins or other nucleic acids, whereas local motions in DNA may facilitate protein recognition and allow enzymes acting on DNA to access functional groups on the bases that would otherwise be buried in Watson-Crick base pairs. Although these statements make a compelling case to study the sequence dependent dynamics in nucleic acids, there are few residue-specific studies of nucleic acid dynamics. Fortunately, NMR studies of dynamics of nucleic acids and nucleic acids-protein complexes are gaining increased attention. The aim of this review is to provide an update of the recent progress in studies of nucleic acid dynamics by NMR based on the application of solution relaxation techniques.  相似文献   

10.
Summary Phylogenies were inferred from both the gene and the protein sequences of the translational elongation factor termed EF-2 (for Archaea and Eukarya) and EF-G (for Bacteria). All treeing methods used (distance-matrix, maximum likelihood, and parsimony), including evolutionary parsimony, support the archaeal tree and disprove the eocyte tree (i.e., the polyphyly and paraphyly of the Archaea). Distance-matrix trees derived from both the amino acid and the DNA sequence alignments (first and second codon positions) showed the Archaea to be a monophyletia-holophyletic grouping whose deepest bifurcation divides a Sulfolobus branch from a branch comprising Methanococcus, Halobacterium, and Thermoplasma. Bootstrapped distance-matrix treeing confirmed the monophyly-holophyly of Archaea in 100% of the samples and supported the bifurcation of Archaea into a Sulfolobus branch and a methanogen-halophile branch in 97% of the samples. Similar phylogenies were inferred by maximum likelihood and by maximum (protein and DNA) parsimony. DNA parsimony trees essentially identical to those inferred from first and second codon positions were derived from alternative DNA data sets comprising either the first or the second position of each codon. Bootstrapped DNA parsimony supported the monophyly-holophyly of Archaea in 100% of the bootstrap samples and confirmed the division of Archaea into a Sulfolobus branch and a methanogen-halophile branch in 93% of the bootstrap samples. Distance-matrix and maximum likelihood treeing under the constraint that branch lengths must be consistent with a molecular clock placed the root of the universal tree between the Bacteria and the bifurcation of Archaea and Eukarya. The results support the division of Archaea into the kingdoms Crenarchaeota (corresponding to the Sulfolobus branch and Euryarchaeota). This division was not confirmed by evolutionary parsimony, which identified Halobacterium rather than Sulfolobus as the deepest offspring within the Archaea.Offprint requests to: P. Cammarano  相似文献   

11.
12.
We report the nucleotide sequence of the Group IV RNA bacteriophage SP. The entire sequence is 4276 nucleotides long. Four cistrons have been identified by comparison with the related Group III phage Q beta. The maturation protein contains 449 amino acids, the coat protein contains 131 amino acids, the read-through protein contains 330 amino acids and the replicase beta-subunit contains 575 amino acids. SP is 59 nucleotides longer than Q beta. We have analyzed both sequence and structural conservation between SP and Q beta and shown that the sequences for the coat and central region of the replicase are strongly conserved between the two genomes. We also show that the S and M replicase binding sites of Q beta are strongly conserved in SP. Interestingly, the base composition of SP and Q beta differ significantly from one another, and most of the differences can be accounted for by a strong preponderance of U in the third position of each codon of Q beta relative to SP. We also compare conserved hairpins associated with potential coat protein and replicase binding sites.  相似文献   

13.
The recently published complete DNA sequence of the bacterium Thermotoga maritima provides evidence, based on protein sequence conservation, for lateral gene transfer between Archaea and Bacteria. We introduce a new method of periodicity analysis of DNA sequences, based on structural parameters, which brings independent evidence for the lateral gene transfer in the genome of T.maritima. The structural analysis relates the Archaea-like DNA sequences to the genome of Pyrococcus horikoshii. Analysis of 24 complete genomic DNA sequences shows different periodicity patterns for organisms of different origin. The typical genomic periodicity for Bacteria is 11 bp whilst it is 10 bp for Archaea. Eukaryotes have more complex spectra but the dominant period in the yeast Saccharomyces cerevisiae is 10.2 bp. These periodicities are most likely reflective of differences in chromatin structure.  相似文献   

14.
The complete sequence of honeybee (Apis mellifera) mitochondrial DNA is reported being 16,343 bp long in the strain sequenced. Relative to their positions in the Drosophila map, 11 of the tRNA genes are in altered positions, but the other genes and regions are in the same relative positions. Comparisons of the predicted protein sequences indicate that the honeybee mitochondrial genetic code is the same as that for Drosophila; but the anticodons of two tRNAs differ between these two insects. The base composition shows extreme bias, being 84.9% AT (cf. 78.6% in Drosophila yakuba). In protein-encoding genes, the AT bias is strongest at the third codon positions (which in some cases lack guanines altogether), and least in second codon positions. Multiple stepwise regression analysis of the predicted products of the protein-encoding genes shows a significant association between the numbers of occurrences of amino acids and %T in codon family, but not with the number of codons per codon family or other parameters associated with codon family base composition. Differences in amino acid abundances are apparent between the predicted Apis and Drosophila proteins, with a relative abundance in the Apis proteins of lysine and a relative deficiency of alanine. Drosophila alanine residues are as often replaced by serine as conserved in Apis. The differences in abundances between Drosophila and Apis are associated with %AT in the codon families, and the degree of divergence in amino acid composition between proteins correlates with the divergence in %AT at the second codon positions. Overall, transversions are about twice as abundant as transitions when comparing Drosophila and Apis protein-encoding genes, but this ratio varies between codon positions. Marked excesses of transitions over chance expectation are seen for the third positions of protein-coding genes and for the gene for the small subunit of ribosomal RNA. For the third codon positions the excess of transitions is adequately explained as due to the restriction of observable substitutions to transitions for conserved amino acids with two-codon families; the excess of transitions over expectation for the small ribosomal subunit suggests that the conservation of nucleotide size is favored by selection.  相似文献   

15.
MOTIVATION: Sensory domains that are conserved among Bacteria, Archaea and Eucarya are important detectors of common signals detected by living cells. Due to their high sequence divergence, sensory domains are difficult to identify. We systematically look for novel sensory domains using sensitive profile-based searches initiated with regions of signal transduction proteins where no known domains can be identified by current domain models. RESULTS: Using profile searches followed by multiple sequence alignment, structure prediction and domain architecture analysis, we have identified a novel sensory domain termed FIST, which is present in signal transduction proteins from Bacteria, Archaea and Eucarya. Chromosomal proximity of FIST-encoding genes to those coding for proteins involved in amino acid metabolism and transport suggest that FIST domains bind small ligands, such as amino acids.  相似文献   

16.
Canaves JM 《Proteins》2004,56(1):19-27
Recently, the structures of two proteins belonging to the archease family, TM1083 from Thermotoga maritima and MTH1598 from Methanobacterium thermoautotrophicum, have been solved independently by two Protein Structure Initiative structural genomics pilot centers using X-ray crystallography and NMR, respectively. The archease protein family is a good example of one of the paradoxes of structural genomics: Approximately one third of protein structures produced by structural genomics centers have no known function and are still annotated as "hypothetical proteins" in the Protein Data Bank. In the case of archeases, despite the existence of two protein structures and abundant sequence information, there is still no function assigned to this protein family. Here, our group predicts, based on structural similarity, sequence conservation, and gene context analyses, that members of this protein family might function as chaperones or modulators of proteins involved in DNA/RNA processing. The conservation of genomic context for this protein family is constant from Archaea and Bacteria to humans, and suggests that unannotated open reading frames contiguous to them could be novel RNA/DNA binding proteins.  相似文献   

17.
Summary The compositional distributions of coding sequences and DNA molecules (in the 50-100-kb range) are remarkably narrower in murids (rat and mouse) compared to humans (as well as to all other mammals explored so far). In murids, both distributions begin at higher and end at lower GC values. A comparison of homologous coding sequences from murids and humans revealed that their different compositional distributions are due to differences in GC levels in all three codon positions, particularly of genes located at both ends of the distribution. In turn, these differences are responsible for differences in both codon usage and amino acids. When GC levels at first+second codon positions and third codon positions, respectively, of murid genes are plotted against corresponding GC levels of homologous human genes, linear relationships (with very high correlation coefficients and slopes of about 0.78 and 0.60, respectively) are found. This indicates a conservation of the order of GC levels in homologous genes from humans and murids. (The same comparison for mouse and rat genes indicates a conservation of GC levels of homologous genes.) A similar linear relationship was observed when plotting GC levels of corresponding DNA fractions (as obtained by density gradient centrifugation in the presence of a sequence-specific ligand) from mouse and human. These findings indicate that orderly compositional changes affecting not only coding sequences but also noncoding sequences took place since the divergence of murids. Such directional fixations of mutations point to the existence of selective pressures affecting the genome as a whole.  相似文献   

18.
Mimivirus is a nucleocytoplasmic large DNA virus (NCLDV) with a genome size (1.2 Mb) and coding capacity ( 1000 genes) comparable to that of some cellular organisms. Unlike other viruses, Mimivirus and its NCLDV relatives encode homologs of broadly conserved informational genes found in Bacteria, Archaea, and Eukaryotes, raising the possibility that they could be placed on the tree of life. A recent phylogenetic analysis of these genes showed the NCLDVs emerging as a monophyletic group branching between Eukaryotes and Archaea. These trees were interpreted as evidence for an independent "fourth domain" of life that may have contributed DNA processing genes to the ancestral eukaryote. However, the analysis of ancient evolutionary events is challenging, and tree reconstruction is susceptible to bias resulting from non-phylogenetic signals in the data. These include compositional heterogeneity and homoplasy, which can lead to the spurious grouping of compositionally-similar or fast-evolving sequences. Here, we show that these informational gene alignments contain both significant compositional heterogeneity and homoplasy, which were not adequately modelled in the original analysis. When we use more realistic evolutionary models that better fit the data, the resulting trees are unable to reject a simple null hypothesis in which these informational genes, like many other NCLDV genes, were acquired by horizontal transfer from eukaryotic hosts. Our results suggest that a fourth domain is not required to explain the available sequence data.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号