首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 734 毫秒
1.
Fourteen different major urinary protein (MUP) genomic clones from BALB/c mice were isolated. By restriction site mapping, six of these form two sets of three overlapping clones. By the criterion of cross-hybridization, the 10 different genes fall into two groups of four (Group 1) and three (Group 2) genes, while three genes fall into neither group. Southern blot analysis of genomic DNA with Group 1 and Group 2 plasmid subclones shows that the haploid mouse (BALB/c) genome contains approximately 15 Group 1 genes, 12 Group 2 genes and at least seven MUP genes that belong to neither group. An analysis of mouse-Chinese hamster hybrid cell lines shows that most, if not all, Group 1 and Group 2 genes are located on mouse chromosome 4.  相似文献   

2.
A J Clark  P M Clissold  J O Bishop 《Gene》1982,18(3):221-230
We describe ten Charon 4A genomic DNA clones from BALB/c mice which include at least seven different major urinary protein (MUP) genes. We have established the orientation of all seven sequences, and have placed six of them in precise register by means of restriction site maps and Southern blot hybridization with cloned cDNA sequences. Four of the seven genomic sequences (family I sequences) form hybrids with six independent cDNA clones that have a high thermal stability and hybridize more strongly with mRNA from three inbred mouse lines. Hybrids between the remaining three genomic sequences and the cDNA clones have a lower thermal stability and hybridize less strongly with mRNA from the three inbred lines. Homologies between different cloned sequences extend over as much as 15 kb. No clone contains parts of two MUP genes, and no homology has been detected between the 3' flanking region of one MUP gene and the 5' flanking region of another.  相似文献   

3.
4.

Background

Phylogenetic analyses based on datasets rich in both genes and species (phylogenomics) are becoming a standard approach to resolve evolutionary questions. However, several difficulties are associated with the assembly of large datasets, such as multiple copies of a gene per species (paralogous or xenologous genes), lack of some genes for a given species, or partial sequences. The use of undetected paralogous or xenologous genes in phylogenetic inference can lead to inaccurate results, and the use of partial sequences to a lack of resolution. A tool that selects sequences, species, and genes, while dealing with these issues, is needed in a phylogenomics context.

Results

Here, we present SCaFoS, a tool that quickly assembles phylogenomic datasets containing maximal phylogenetic information while adjusting the amount of missing data in the selection of species, sequences and genes. Starting from individual sequence alignments, and using monophyletic groups defined by the user, SCaFoS creates chimeras with partial sequences, or selects, among multiple sequences, the orthologous and/or slowest evolving sequences. Once sequences representing each predefined monophyletic group have been selected, SCaFos retains genes according to the user's allowed level of missing data and generates files for super-matrix and super-tree analyses in several formats compatible with standard phylogenetic inference software. Because no clear-cut criteria exist for the sequence selection, a semi-automatic mode is available to accommodate user's expertise.

Conclusion

SCaFos is able to deal with datasets of hundreds of species and genes, both at the amino acid or nucleotide level. It has a graphical interface and can be integrated in an automatic workflow. Moreover, SCaFoS is the first tool that integrates user's knowledge to select orthologous sequences, creates chimerical sequences to reduce missing data and selects genes according to their level of missing data. Finally, applying SCaFoS to different datasets, we show that the judicious selection of genes, species and sequences reduces tree reconstruction artefacts, especially if the dataset includes fast evolving species.
  相似文献   

5.
Lipases are physiologically important and ubiquitous enzymes that share a conserved domain and are classified into eight different families based on their amino acid sequences and fundamental biological properties. The Lipase3 family of lipases was reported to possess a canonical fold typical of α/β hydrolases and a typical catalytic triad, suggesting a distinct evolutionary origin for this family. Genes in the Lipase3 family do not have the same functions, but maintain the conserved Lipase3 domain. There have been extensive studies of Lipase3 structures and functions, but little is known about their evolutionary histories. In this study, all lipases within five plant species were identified, and their phylogenetic relationships and genetic properties were analyzed and used to group them into distinct evolutionary families. Each identified lipase family contained at least one dicot and monocot Lipase3 protein, indicating that the gene family was established before the split of dicots and monocots. Similar intron/exon numbers and predicted protein sequence lengths were found within individual groups. Twenty-four tandem Lipase3 gene duplications were identified, implying that the distinctive function of Lipase3 genes appears to be a consequence of translocation and neofunctionalization after gene duplication. The functional genes EDS1, PAD4, and SAG101 that are reportedly involved in pathogen response were all located in the same group. The nucleotide diversity (Dxy) and the ratio of nonsynonymous to synonymous nucleotide substitutions rates (Ka/Ks) of the three genes were significantly greater than the average across the genomes. We further observed evidence for selection maintaining diversity on three genes in the Toll-Interleukin-1 receptor type of nucleotide binding/leucine-rich repeat immune receptor (TIR-NBS LRR) immunity-response signaling pathway, indicating that they could be vulnerable to pathogen effectors.  相似文献   

6.
The nucleotide sequences of cloned genes coding for the elongation factor Tu of seven eubacteria have been determined. These genes were fiom Anacystis nidulans, Bacillus subtilis, Bacteroides fragilis, Deinonema spec., Pseudomonas cepacia, Shewanella putrefaciens and Streptococcus oralis. The primary structures of the genes were compared to the available sequences of prokaryotic elongation factors Tu and eukaryotic elongation factors 1 alpha. A conservation profile was determined for homologous amino acid residues. Sites of known or putative functions are usually located at highly conserved positions or within highly conserved sequence stretches. The aligned 24 amino acid sequences were used as basis for a phylogenetic analysis. The phylogenetic tree corroborates the kingdom as well as phylum concept deduced from 16S rRNA data.Abbreviations EF-Tu elongation factor Tu - GDP guanosine 5-diphosphate - GTP guanosine 5-triphosphate; tuf gene, gene coding for elongation factor Tu  相似文献   

7.
A BAC-based integrated linkage map of the silkworm Bombyx mori   总被引:3,自引:0,他引:3  

Background

In 2004, draft sequences of the model lepidopteran Bombyx mori were reported using whole-genome shotgun sequencing. Because of relatively shallow genome coverage, the silkworm genome remains fragmented, hampering annotation and comparative genome studies. For a more complete genome analysis, we developed extended scaffolds combining physical maps with improved genetic maps.

Results

We mapped 1,755 single nucleotide polymorphism (SNP) markers from bacterial artificial chromosome (BAC) end sequences onto 28 linkage groups using a recombining male backcross population, yielding an average inter-SNP distance of 0.81 cM (about 270 kilobases). We constructed 6,221 contigs by fingerprinting clones from three BAC libraries digested with different restriction enzymes, and assigned a total of 724 single copy genes to them by BLAST (basic local alignment search tool) search of the BAC end sequences and high-density BAC filter hybridization using expressed sequence tags as probes. We assigned 964 additional expressed sequence tags to linkage groups by restriction fragment length polymorphism analysis of a nonrecombining female backcross population. Altogether, 361.1 megabases of BAC contigs and singletons were integrated with a map containing 1,688 independent genes. A test of synteny using Oxford grid analysis with more than 500 silkworm genes revealed six versus 20 silkworm linkage groups containing eight or more orthologs of Apis versus Tribolium, respectively.

Conclusion

The integrated map contains approximately 10% of predicted silkworm genes and has an estimated 76% genome coverage by BACs. This provides a new resource for improved assembly of whole-genome shotgun data, gene annotation and positional cloning, and will serve as a platform for comparative genomics and gene discovery in Lepidoptera and other insects.  相似文献   

8.
The aminoacylation of tRNAs by the aminoacyl-tRNA synthetases recapitulates the genetic code by dictating the association between amino acids and tRNA anticodons. The sequences of tRNAs were analyzed to investigate the nature of primordial recognition systems and to make inferences about the evolution of tRNA gene sequences and the evolution of the genetic code. Evidence is presented that primordial synthetases recognized acceptor stem nucleotides prior to the establishment of the three major phylogenetic lineages. However, acceptor stem sequences probably did not achieve a level of sequence diversity sufficient to faithfully specify the anticodon assignments of all 20 amino acids. This putative bottleneck in the evolution of the genetic code may have been alleviated by the advent of anticodon recognition. A phylogenetic analysis of tRNA gene sequences from the deep Archaea revealed groups that are united by sequence motifs which are located within a region of the tRNA that is involved in determining its tertiary structure. An association between the third anticodon nucleotide (N36) and these sequence motifs suggests that a tRNA-like structure existed close to the time that amino acid-anticodon assignments were being established. The sequence analysis also revealed that tRNA genes may evolve by anticodon mutations that recruit tRNAs from one isoaccepting group to another. Thus tRNA gene evolution may not always be monophyletic with respect to each isoaccepting group.Based on a presentation made at a workshop— Aminoacyl-tRNA Synthetases and the Evolution of the Genetic Code—held at Berkeley, CA, July 17–20, 1994 Correspondence to: M.E. Saks  相似文献   

9.
Mice bearing the H-2 w7 haplotype have five C4-related genes and constitutively express the Slp antigen. To understand the structure and evolution of the five C4-related genes of the C3H.W7 mouse, we have determined nucleotide sequences of the 5 end region of these genes. A C4/Slp hybrid nature was confirmed for three of five C4-related genes as predicted previously by restriction enzyme analysis. The nucleotide sequences of the 5 flanking regions of these three hybrid genes showed close similarity to that of the C4 gene, while the 3 side of the ninth exon of the three hybrid genes showed close similarity to that of the Slp gene. In contrast, the regions between the first exon and the middle of the ninth exon of the three hybrid genes showed a mosaic structure of C4-like and Slp-like sequences. Moreover, the boundaries of the C4-like and Slp-like sequences were quite different among the three hybrid genes. The pattern of nucleotide sequence diversity in this region among the five C4-related sequences could be mainly explained not by point mutations but by gene conversions or unequal crossovers. These results suggest that multiple genetic recombinational events between two homologous sequences played an important role in the generation and diversification of the extra copies of the C4/Slp gene in the H-2 w7 mouse.The nucleotide sequence data reported in this paper have been submitted to the DDBJ, EMBL, and GenBank nucleotide sequence databases and have been assigned the accession numbers D90167-71.  相似文献   

10.
Here we compare the exonic sequences of four Group 1 mouse major urinary protein (MUP) genes and four Group 1 cDNA sequences. These define seven different nucleotide sequences which differ from each other by 0.35% of bases on average, and which would code for seven different MUP proteins that could probably be resolved physically into at least five classes. The sequences differ at 13 nucleotide positions and at six codons, and although they are closely related their descent cannot be described by a simple series of duplications. We also describe the sequence of another liver cDNA (pMUP15) which has diverged from the Group 1 consensus sequence in 14.6% of bases. The divergence is much greater over exons 1-3 than over exons 4-6, suggesting that an ancestral gene conversion event has occurred. pMUP15 also differs from the Group 1 genes in having a longer signal peptide sequence and a different splice configuration between exons 6 and 7. Unlike the Group 1 sequences, pMUP15 contains a potential N-linked glycosylation site. Other published work has shown that a shorter cDNA clone which is identical over their common sequence to pMUP15 codes for MUP proteins that are unusually large in size and acidic in pI. We show here that mouse urine does indeed contain a glycosylated MUP protein with those properties, presumably the product of the gene that corresponds to pMUP15.  相似文献   

11.
In the olive fruit fly Bactrocera oleae and the med fly Ceratitis capitata previous studies have shown the existence of two Adh genes in each species. This observation, in combination with the former finding that various Drosophila species of virilis and repleta group encode two isozymes of ADH which are the result of a gene duplication, challenged us to address a scenario dealing with the evolutionary history of the Adh gene duplication in Tephritidae. In our lab we proceeded to the cloning and sequence analysis of Adh genes from more tephritid species, a prerequisite for further study of this issue. Here we show that phylogenetic trees produced from either the nucleotide or the amino acid sequences of 14 tephritid Adh genes consisted of two main clusters, with Adh sequences of the same type grouping together (i.e., Adh1 sequences form a cluster and Adh2 sequences form a second one), as expected if there was one duplication event before speciation within the family Tephritidae. We used the amount of divergence between the two isozymic forms of Adh of the species carrying both Adh1 and Adh2 genes to obtain an estimate of the age of the duplication event. Interestingly, our data again support the hypothesis that the duplication of an ancestral Adh single gene in the family Tephritidae occurred before the emergence of the genera Bactrocera and Ceratitis, thus suggesting that Adh duplication was based on a prespeciation rather than a postspeciation event that might have involved two independent duplication events, one in each of the two genera.  相似文献   

12.
A gene complex, dot/icm, located in two independent chromosomal loci of L. pneumophila, the causative agent of Legionnaires' disease, is related to virulence. To investigate the evolutionary pattern of these pathogenicity islands of L. pneumophila, portions of four genes in the dot/icm complex, namely, dotA, dotB, icmB, and icmT, were amplified, sequenced, and phylogenetically analyzed, in addition to rpoB, which encodes an RNA polymerase -subunit. The nucleotide sequences and phylogenetic analyses of these five genes of 96 L. pneumophila strains revealed that several subgroups of L. pneumophila proliferated clonally. However, incongruent gene tree topologies and the results of statistical testing (Templeton Willcoxon signed-ranked and incongruence length differences tests) indicated that the evolutionary histories of these genes within the pathogenicity islands are not uniform, and that they constitute a mosaic structure. In addition, the non-uniform grouping of some reference strains suggests that intraspecific recombination might be still occurring in nature or in the laboratory.  相似文献   

13.
Globin mRNA isolated from anemic chicken was transcribed into cDNA and integrated into the Pst I cleavage site of plasmid pBR 322. After cloning in E. coli strain HB 101 and colony hybridization with125I-labelled globin mRNA the plasmids of individual clones were characterized by hybrid arrested cell-free translation. Thus we could isolate clones containing or globin chain nucleotide sequences.DNA was isolated from chicken blastoderms incubated for 18–20 h and from 11 d chicken embryos. A comparison of the restriction maps of the DNA from the two developmental stages with labelled nick translated plasmids and labelled cDNA did not indicate any globin gene rearrangements between these two stages of embryonic development. We conclude, that the adult chicken globin genes show a constant genomic organization during embryonic development. However, the restriction patterns of the globin gene family of the chicken strain investigated revealed some differences after 2 generations of propagation.  相似文献   

14.
Summary The general organization and primary amino acid sequences of theS. purpuratus cytoskeletal actin genes CyIIb and CyIIIb have been determined from restriction enzyme analysis, DNA sequencing, and RNA mapping studies. As is the case with the other sea urchin cytoskeletal actin genes previously studied, the CyIIb and CyIIIb genes contain two introns that interrupt the coding DNA following codon 121 and within codon 204. An intron ending 26–27 nucleotides (nt) upstream of the initiation codon has also been localized in the 5-flanking region of both genes. The CyIIb gene, which is part of a cluster of three genes linked in the order CyI-CyIIa-CyIIb, encodes a protein that differs from CyI by a single residue and from CyIIa by three residues. The substitutions observed within this linkage group are relatively conservative changes, and pairwise comparisons between genes indicate less than 5% mismatch in nucleotide sequence within the coding region. Nucleotide sequence comparisons of 5-flanking region and intron DNA, however, indicate greater similarity between the CyI and CyIIb genes than the CyIIa gene that separates them, suggestive of a potential gene conversion event between the flanking genes in the CyI-CyIIa-CyIIb linkage.The CyIIIb gene, part of a separate cluster of two functional genes ordered CyIIIa-CyIIIb, shares little similarity outside of coding DNA with genes of the other linkage group. Although CyIIIb exhibits strong nucleotide sequence similarity outside of coding DNA with the neighboring CyIIIa gene, it differs from that gene at six codons. The CyIIIb gene encodes a protein considerably different from all cytoskeletal actins previously reported, with changes clustered in the latter 40% of the coding sequence. An 81-nt tandem duplication of the C-terminal coding region is located adjacent to the termination codon of the CyIIIb gene, a potential relic of a slipped mispairing and replication event.  相似文献   

15.
The nucleotide sequences of 13 cDNAs encoding group II phospholipases A2 (PLA2 S), which are from viperidae snake venoms and from mammalian sources, were aligned and analyzed by phylogenetic trees constructed using various components of the sequences. The evolutionary trees derived from the combined sequences of the untranslated (5 and 3) region and the signal peptide region of cDNAs were in accord with the consequences from taxonomy. In contrast, the evolutionary trees from the mature protein-coding region sequences of cDNAs and from the amino acid sequences showed random patterns. These observations indicated that the mature protein-coding region has evolved through a process differently from the untranslated and signal peptide regions. The trees built from the nucleotide differences at each of three positions of codons in the mature protein-coding region suggested that snakevenom-gland PLA2 genes have evolved via a process different from mammalian PLA2 genes. The occurrence of accelerated evolution has been recently discovered in Trimeresurus flavoviridis venom-gland group II PLA2 isozyme genes (Nakashima et al. 1993, Proc Natl Acad Sci USA 90:5964–5968), so the present phylogenetic analysis together with the estimation of nucleotide divergence of cDNAs provides further evidence that snakevenom-group II PLA2 isozyme genes have evolved by accelerated evolution to gain diverse physiological activities. Correspondence to: M. Ohno  相似文献   

16.
Molecular phylogeny of the species Escherichia coli using the E. coli reference (ECOR) collection strains has been hampered by (1) the absence of rooting in the commonly used phenogram obtained from multilocus enzyme electrophoresis (MLEE) data and (2) the existence of recombination events between strains that scramble phylogenetic trees reconstructed from the nucleotide sequences of genes. We attempted to determine the phylogeny for E. coli based on the ECOR strain data by extracting from GenBank the nucleotide sequences of 11 chromosomal structural and 2 plasmid genes for which the Salmonella enterica homologous gene sequences were available. For each of the 13 DNA data sets studied, incongruence with a nonnucleotide whole-genome data set including MLEE, random amplified polymorphic DNA, and rrn restriction fragment length polymorphism data was measured using the incongruence length difference (ILD) test of Farris et al. As previously reported, the incongruence observed between the gnd and plasmid gene data and the whole-genome data was multiple, indicating numerous horizontal transfer and/or recombination events. In five cases, the incongruence detected by the ILD test was punctual, and the donor group was identified. Congruence was not rejected for the remaining data sets. The strains responsible for incongruences with the whole-genome data set were removed, leading to a "prior-agreement" approach, i.e., the determination of a phylogeny for E. coli based on several genes, excluding (1) the genes with multiple incongruences with the whole genome data, (2) the strains responsible for punctual incongruences, and (3) the genes incongruent with each other. The obtained phylogeny shows that the most basal group of E. coli strains is the B2 group rather than the A group, as generally thought. The D group then emerges as the sister group of the rest. Finally, the A and B1 groups are sister groups. Interestingly, the most primitive taxon within E. coli in terms of branching pattern, i.e., the B2 group, includes highly virulent extraintestinal strains with derived characters (extraintestinal virulence determinants) occurring on its own branch.   相似文献   

17.
Phormium yellow leaf (PYL) phytoplasma causes a lethal disease of the monocotyledon, New Zealand flax (Phormium tenax). The 16S rRNA genes of PYL phytoplasma were amplified from infected flax by PCR and cloned, and the nucleotide sequences were determined. DNA sequencing and Southern hybridization analysis of genomic DNA indicated the presence of two copies of the 16S rRNA gene. The two 16S rRNA genes exhibited sequence heterogeneity in 4 nucleotide positions and could be distinguished by the restriction enzymes BpmI and BsrI. This is the first record in which sequence heterogeneity in the 16S rRNA genes of a phytoplasma has been determined by sequence analysis. A phylogenetic tree based on 16S rRNA gene sequences showed that PYL phytoplasma is most closely related to the stolbur and German grapevine yellows phytoplasmas, which form the stolbur subgroup of the aster yellows group. This phylogenetic position of PYL phytoplasma was supported by 16S/23S spacer region sequence data.  相似文献   

18.
We present the nucleotide sequences of four members of the six-member human salivary prolinerich protein (PRP) gene family. The four genes are PRB1 and PRB2, which encode basic PRPs, and PRB3 and PRB4, which encode glycosylated PRPs. Each PRB gene is approximately 4.0 kb in length and contains four exons, the third of which is entirely composed of 63-bp tandem repeats and encodes the proline-rich portion of the protein products. Exon 3 contains different numbers of tandem repeats in the different PRB genes. Variation in the numbers of these repeats is also responsible for length variations in different alleles of the PRB genes. We have determined a probable evolutionary history of the human PRP gene family by comparing the nucleotide sequences of the six PRP genes. The present-day six PRP loci probably evolved from a single ancestral gene by four sequential gene duplications, leading to six genes that fall into three subsets, each consisting of two genes. During this evolutionary process, multiple rearrangements and gene conversion occurred mainly in the region from the 3 end of IVS2 and the 3 end of exon 3.  相似文献   

19.
Based on the recent hypothesis on the origin of eukaryotic phototrophs, red algae, green plants, and glaucophytes constitute the primary photosynthetic eukaryotes (whose plastids may have originated directly from a cyanobacterium-like prokaryote via primary endosymbiosis), whereas the plastids of other lineages of eukaryotic phototrophs appear to be the result of secondary or tertiary endosymbiotic events (involving a phototrophic eukaryote and a host cell). Although phylogenetic analyses using multiple plastid genes from a wide range of eukaryotic lineages have been carried out, some of the major phylogenetic relationships of plastids remain ambiguous or conflict between different phylogenetic methods used for nucleotide or amino acid substitutions. Therefore, an alternative methodology to infer the plastid phylogeny is needed. Here, we carried out a cladistic analysis of the loss of plastid genes after primary endosymbiosis using complete plastid genome sequences from a wide range of eukaryotic phototrophs. Since it is extremely unlikely that plastid genes are regained during plastid evolution, we used the irreversible Camin-Sokal model for our cladistic analysis of the loss of plastid genes. The cladistic analysis of the 274 plastid protein-coding genes resolved the 20 operational taxonomic units representing a wide range of eukaryotic lineages (including three secondary plastid-containing groups) into two large monophyletic groups with high bootstrap values: one corresponded to the red lineage and the other consisted of a large clade composed of the green lineage (green plants and Euglena) and the basal glaucophyte plastid. Although the sister relationship between the green lineage and the Glaucophyta was not resolved in recent phylogenetic studies using amino acid substitutions from multiple plastid genes, it is consistent with the rbcL gene phylogeny and with a recent phylogenetic study using multiple nuclear genes. In addition, our analysis robustly resolved the conflicting/ambiguous phylogenetic positions of secondary plastids in previous phylogenetic studies: the Euglena plastid was sister to the chlorophycean (Chlamydomonas) lineage, and the secondary plastids from the diatom (Odontiella) and cryptophyte (Guillardia) were monophyletic within the red lineage.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号