首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 656 毫秒
1.
Fu Q  Li L 《Analytical chemistry》2005,77(23):7783-7795
A stable-isotope dimethyl labeling strategy was previously shown to be a useful tool for quantitative proteomics. More recently, N-terminal dimethyl labeling was also reported for peptide sequencing in combination with database searching. Here, we extend these previous studies by incorporating N-terminal isotopic dimethylation for de novo sequencing of neuropeptides directly from tissue extract without any genomic information. We demonstrated several new sequencing applications of this method in addition to the identification of the N-terminal residue using the enhanced a(1) ion. The isotopic labeling also provides easier and more confident de novo sequencing of peptides by comparing similar MS/MS fragmentation patterns of the isotopically labeled peptide pairs. The current study on neuropeptides shows several distinct fragmentation patterns after N-terminal dimethylation which have not been reported previously. The y((n-1)) ion is enhanced in multiply charged peptides and is weak or missing in singly charged peptides. The MS/MS spectra of singly charged peptides are simplified due to the enhanced N-terminal fragments and suppressed internal fragments. The neutral loss of dimethylamine is also observed. The mechanisms for the above fragmentations are proposed. Finally, the structures of the immonium ion and related ions of N(alpha), N(epsilon)-tetramethylated lysine and N(epsilon)-dimethylated lysine are explored.  相似文献   

2.
With high-mass accuracy and consecutively obtained electron transfer dissociation (ETD) and higher-energy collisional dissociation (HCD) tandem mass spectrometry (MS/MS), reliable (≥97%) and sensitive fragment ions have been extracted for identification of specific amino acid residues in peptide sequences. The analytical benefit of these specific amino acid composition (AAC) ions is to restrict the database search space and provide identification of peptides with higher confidence and reduced false negative rates. The 6706 uniquely identified peptide sequences determined with a conservative Mascot score of >30 were used to characterize the AAC ions. The loss of amino acid side chains (small neutral losses, SNLs) from the charge reduced peptide radical cations was studied using ETD. Complementary AAC information from HCD spectra was provided by immonium ions. From the ETD/HCD mass spectra, 5162 and 6720 reliable SNLs and immonium ions were successfully extracted, respectively. Automated application of the AAC information during database searching resulted in an average 3.5-fold higher confidence level of peptide identification. In addition, 4% and 28% more peptides were identified above the significance level in a standard and extended search space, respectively.  相似文献   

3.
MALDI-quadrupole time-of-flight mass spectrometry was applied to identify proteins from organisms whose genomes are still unknown. The identification was carried out by successively searching a sequence database-first with a peptide mass fingerprint, then with a packet of noninterpreted MS/MS spectra, and finally with peptide sequences obtained by automated interpretation of the MS/MS spectra. A "MS BLAST" homology searching protocol was developed to overcome specific limitations imposed by mass spectrometric data, such as the limited accuracy of de novo sequence predictions. This approach was tested in a small-scale proteomic project involving the identification of 15 bands of gel-separated proteins from the methylotrophic yeast Pichia pastoris, whose genome has not yet been sequenced and which is only distantly related to other fungi.  相似文献   

4.
Gu S  Pan S  Bradbury EM  Chen X 《Analytical chemistry》2002,74(22):5774-5785
Here, we describe a method for protein identification and de novo peptide sequencing. Through in vivo cell culturing, the deuterium-labeled lysine residue (Lys-d4) introduces a 4-Da mass tag at the carboxyl terminus of proteolytic peptides when cleaved by certain proteases. The 4-Da mass difference between the unlabeled and the deuterated lysine assigns a mass signature to all lysine-containing peptides in any pool of proteolytic peptides for protein identification directly through peptide mass mapping. Furthermore, it was used to distinguish between N- and C-terminal fragments for accurate assignments of daughter ions in tandem MS/MS spectra for sequence assignment. This technique simplifies the labeling scheme and the interpretation of the MS/MS spectra by assigning different series of fragment ions correctly and easily and is very useful in de novo peptide sequencing. We have also successfully implemented this approach to the analysis of protein mixtures derived from the human proteome.  相似文献   

5.
The characterization of proteomes by mass spectrometry is largely limited to organisms with sequenced genomes. To identify proteins from organisms with unsequenced genomes, database sequences from related species must be employed for sequence-similarity protein identifications. Peptide sequence tags (Mann, 1994) have been used successfully for the identification of proteins in sequence databases using partially interpreted tandem mass spectra of tryptic peptides. We have extended the ability of sequence tag searching to the identification of proteins whose sequences are yet unknown but are homologous to known database entries. The MultiTag method presented here assigns statistical significance to matches of multiple error-tolerant sequence tags to a database entry and ranks alignments by their significance. The MultiTag approach has the distinct advantage over other sequence-similarity approaches of being able to perform sequence-similarity identifications using only very short (2-4) amino acid residue stretches of peptide sequences, rather than complete peptide sequences deduced by de novo interpretation of tandem mass spectra. This feature facilitates the identification of low abundance proteins, since noisy and low-intensity tandem mass spectra can be utilized.  相似文献   

6.
Pan S  Gu S  Bradbury EM  Chen X 《Analytical chemistry》2003,75(6):1316-1324
Identification of proteins with low sequence coverage using mass spectrometry (MS) requires tandem MS/MS peptide sequencing. It is very challenging to obtain a complete or to interpret an incomplete tandem MS/MS spectrum from fragmentation of a weak peptide ion signal for sequence assignment. Here, we have developed an effective and high-throughput MALDI-TOF-based method for the identification of membrane and other low-abundance proteins with a simple, one-dimensional separation step. In this approach, several stable isotope-labeled amino acid precursors were selected to mass-tag, in parallel, the human proteome of human skin fibroblast cells in a residue-specific manner during in vivo cell culturing. These labeled residues can be recognized by their characteristic isotope patterns in MALDI-TOF MS spectra. The isotope pattern of particular peptides induced by the different labeled precursors provides information about their amino acid compositions. The specificity of peptide signals in a peptide mass mapping is thus greatly enhanced, resolving a high degree of mass degeneracy of proteolytic peptides derived from the complex human proteome. Further, false positive matches in database searching can be eliminated. More importantly, proteins can be accurately identified through a single peptide with its m/z value and partial amino acid composition. With the increased solubility of hydrophobic proteins in SDS, we have demonstrated that our approach is effective for the identification of membrane and low-abundant proteins with low sequence coverage and weak signal intensity, which are often difficult for obtaining informative fragment patterns in tandem MS/MS peptide sequencing analysis.  相似文献   

7.
We investigated and compared three approaches for shotgun protein identification by combining MS and MS/MS information using LTQ-Orbitrap high mass accuracy data. In the first approach, we employed a unique mass identifier method where MS peaks matched to peptides predicted from proteins identified from an MS/MS database search are first subtracted before using the MS peaks as unique mass identifiers for protein identification. In the second method, we used an accurate mass and time tag method by building a potential mass and retention time database from previous MudPIT analyses. For the third method, we used a peptide mass fingerprinting-like approach in combination with a randomized database for protein identification. We show that we can improve protein identification sensitivity for low-abundance proteins by combining MS and MS/MS information. Furthermore, "one-hit wonders" from MS/MS database searching can be further substantiated by MS information and the approach improves the identification of low-abundance proteins. The advantages and disadvantages for the three approaches are then discussed.  相似文献   

8.
A MALDI QqTOF mass spectrometer has been used to identify proteins separated by one-dimensional or two-dimensional gel electrophoresis at the femtomole level. The high mass resolution and the high mass accuracy of this instrument in both MS and MS/MS modes allow identification of a protein either by peptide mass fingerprinting of the protein digest or from tandem mass spectra acquired by collision-induced dissociation of individual peptide precursors. A peptide mass map of the digest and tandem mass spectra of multiple peptide precursor ions can be acquired from the same sample in the course of a single experiment. Database searching and acquisition of MS and MS/MS spectra can be combined in an interactive fashion, increasing the information value of the analytical data. The approach has demonstrated its usefulness in the comprehensive characterization of protein in-gel digests, in the dissection of complex protein mixtures, and in sequencing of a low molecular weight integral membrane protein. Proteins can be identified in all types of sequence databases, including an EST database. Thus, MALDI QqTOF mass spectrometry promises to have remarkable potential for advancing proteomic research.  相似文献   

9.
Correct identification of a peptide sequence from MS/MS data is still a challenging research problem, particularly in proteomic analyses of higher eukaryotes where protein databases are large. The scoring methods of search programs often generate cases where incorrect peptide sequences score higher than correct peptide sequences (referred to as distraction). Because smaller databases yield less distraction and better discrimination between correct and incorrect assignments, we developed a method for editing a peptide-centric database (PC-DB) to remove unlikely sequences and strategies for enabling search programs to utilize this peptide database. Rules for unlikely missed cleavage and nontryptic proteolysis products were identified by data mining 11 849 high-confidence peptide assignments. We also evaluated ion exchange chromatographic behavior as an editing criterion to generate subset databases. When used to search a well-annotated test data set of MS/MS spectra, we found no loss of critical information using PC-DBs, validating the methods for generating and searching against the databases. On the other hand, improved confidence in peptide assignments was achieved for tryptic peptides, measured by changes in DeltaCN and RSP. Decreased distraction was also achieved, consistent with the 3-9-fold decrease in database size. Data mining identified a major class of common nonspecific proteolytic products corresponding to leucine aminopeptidase (LAP) cleavages. Large improvements in identifying LAP products were achieved using the PC-DB approach when compared with conventional searches against protein databases. These results demonstrate that peptide properties can be used to reduce database size, yielding improved accuracy and information capture due to reduced distraction, but with little loss of information compared to conventional protein database searches.  相似文献   

10.
We present a statistical model to estimate the accuracy of peptide assignments to tandem mass (MS/MS) spectra made by database search applications such as SEQUEST. Employing the expectation maximization algorithm, the analysis learns to distinguish correct from incorrect database search results, computing probabilities that peptide assignments to spectra are correct based upon database search scores and the number of tryptic termini of peptides. Using SEQUEST search results for spectra generated from a sample of known protein components, we demonstrate that the computed probabilities are accurate and have high power to discriminate between correctly and incorrectly assigned peptides. This analysis makes it possible to filter large volumes of MS/MS database search results with predictable false identification error rates and can serve as a common standard by which the results of different research groups are compared.  相似文献   

11.
Divinyl sulfone reacts at pH 8-9 with the alpha-amino groups of N-terminal residues, proline, the epsilon-amino groups of lysine, and the histidine side chains of peptides. This reaction leads to great enhancement of the abundance of the normally weak or missing "a(1)" fragment ion in MS/MS analysis defining the N-terminal residue of a peptide in a digest. This provides "one-step Edman-like" information that, together with a fairly accurately determined mass, often enables one to correctly identify a protein or family of proteins. The applicability of this procedure in proteomics was demonstrated with several peptides and tryptic digests of protein mixtures by LC-MS/MS experiments using a QTOF and MALDI-PSD analyses. Advantages of this approach are its simple chemistry, retention of charge multiplicity, and possibly, shortening of database search time. Used with other MS/MS data, it provides higher confidence in the scores and identification of a protein found in peptide mass fingerprinting. Moreover, this approach has an advantage in "de novo" sequencing due to its ability to decipher the first amino acid of a peptide whose information is normally unavailable in MS/MS spectra.  相似文献   

12.
A very high pressure liquid chromatography (VHPLC) system was constructed by modifying a commercially available pump in order to achieve pressures in excess of 1,200 bar (17,500 psi). A computer-controlled low-pressure mixer was used to generate solvent gradients. Protein digests were rapidly analyzed by reversed-phase VHPLC with linear solvent gradients coupled to either a tandem mass spectrometer using electrospray ionization or a UV/visible detector. The separations were performed at pressures ranging from 790 (11,500 psi) to 930 bar (13,500 psi) in 22-cm-long capillary columns packed with C18-modified 1.5-microm nonporous silica particles. A digest of bovine serum albumin (BSA) was analyzed by the VHPLC system connected to a mass spectrometer in MS mode. An analysis of 12.5 fmol of sample gave signal-to-noise ratios of tryptic peaks greater than 10:1 in the base peak plot mass chromatogram. This system was also used to analyze a proteolytic digest of a rat liver protein excised from a 2-D gel separation of a liver tissue lysate. For this analysis, the mass spectrometer was set up to perform data-dependent scanning (automated switching from MS mode to MS/MS mode when a peak was detected) for peptide sequencing and protein identification by database searching. The results of this analysis are compared to an analysis performed on the same sample using the nanoelectrospray-MS/MS technique. Though both techniques were able to identify the unknown protein, the VHPLC method gave twice as many sequenced peptides as nanoelectrospray and improved the signal-to-noise ratio of the spectra by at least a factor of 10. Direct comparisons with nanoelectrospray for MS and MS/MS data acquisition from a BSA digest were made. These comparisons show enhancements of greater than 20-fold for VHPLC over nanoelectrospray. In addition, the VHPLC/MS/MS data acquisition was accomplished in an automated manner.  相似文献   

13.
Infrared multiphoton dissociation (IRMPD) of N-terminal sulfonated peptides improves de novo sequencing capabilities in a quadrupole ion trap mass spectrometer. Not only does IRMPD promote highly efficient dissociation of the N-terminal sulfonated peptides but also the entire series of y ions down to the y(1) fragment may be detected due to alleviation of the low-mass cutoff problem associated with conventional collisional activated dissociation (CAD) methods in a quadrupole ion trap. Commercial de novo sequencing software was applied for the interpretation of CAD and IRMPD MS/MS spectra collected for seven unmodified peptides and the corresponding N-terminal sulfonated species. In most cases, the additional information obtained by N-terminal sulfonation in combination with IRMPD provided significant improvements in sequence identification. The software sequence tag results were combined with a commercial database searching algorithm to interpret sequence information of a tryptic digest on alpha-casein s1. Energy-variable CAD studies confirmed a 30-40% reduction in the critical energies of the N-terminal sulfonated peptides relative to unmodified peptides. This reduction in dissociation energy facilitates IRMPD in a quadrupole ion trap.  相似文献   

14.
A method for rapid and unambiguous identification of proteins by sequence database searching using the accurate mass of a single peptide and specific sequence constraints is described. Peptide masses were measured using electrospray ionization-Fourier transform ion cyclotron resonance mass spectrometry to an accuracy of 1 ppm. The presence of a cysteine residue within a peptide sequence was used as a database searching constraint to reduce the number of potential database hits. Cysteine-containing peptides were detected within a mixture of peptides by incorporating chlorine into a general alkylating reagent specific for cysteine residues. Secondary search constraints included the specificity of the protease used for protein digestion and the molecular mass of the protein estimated by gel electrophoresis. The natural isotopic distribution of chlorine encoded the cysteine-containing peptide with a distinctive isotopic pattern that allowed automatic screening of mass spectra. The method is demonstrated for a peptide standard and unknown proteins from a yeast lysate using all 6118 possible yeast open reading frames as a database. As judged by calculation of codon bias, low-abundance proteins were identified from the yeast lysate using this new method but not by traditional methods such as tandem mass spectrometry via data-dependent acquisition or mass mapping.  相似文献   

15.
A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks--"lookup peaks"--that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search tools--SEQUEST, Mascot, and X!Tandem--on both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.  相似文献   

16.
With the increasing availability of de novo sequencing algorithms for interpreting high-mass accuracy tandem mass spectrometry (MS/MS) data, there is a growing need for programs that accurately identify proteins from de novo sequencing results. De novo sequences derived from tandem mass spectra of peptides often contain ambiguous regions where the exact amino acid order cannot be determined. One problem this poses for sequence alignment algorithms is the difficulty in distinguishing discrepancies due to de novo sequencing errors from actual genomic sequence variation and posttranslational modifications. We present a novel, mass-based approach to sequence alignment, implemented as a program called OpenSea, to resolve these problems. In this approach, de novo and database sequences are interpreted as masses of residues, and the masses, rather than the amino acid codes, are compared. To provide further flexibility, the masses can be aligned in groups, which can resolve many de novo sequencing errors. The performance of OpenSea was tested with three types of data: a mixture of known proteins, a mixture of unknown proteins that commonly contain sequence variations, and a mixture of posttranslationally modified known proteins. In all three cases, we demonstrate that OpenSea can identify more peptides and proteins than commonly used database-searching programs (SEQUEST and ProteinLynx) while accurately locating sequence variation sites and unanticipated posttranslational modifications in a high-throughput environment.  相似文献   

17.
Subfemtomole peptide sequence analysis has been achieved using microcapillary HPLC columns, with integrated nanoelectrospray emitters, coupled directly to a Fourier transform ion cyclotron resonance mass spectrometer. Accurate mass (+/-0.010 Da) peptide maps are generated from a standard six-protein digest mixture, whose principle components span a concentration dynamic range of 1000:1. Iterative searches against approximately 189000 entries in the OWL database readily identify each protein, with high sequence coverage (20-60%), from as little as 10 amol loaded on-column. In addition, a simple variable-flow HPLC apparatus provides for on-line tandem mass spectrometric analysis of tryptic peptides at the 400-amol level. MS/MS data are searched against approximately 280000 entries in a nonredundant protein database using SEQUEST. Accurate precursor and product ion mass information readily identifies primary amino acid sequences differing by asparagine vs aspartic acid (deltam = 0.98 Da) and glutamine vs lysine (deltam = 0.036 Da).  相似文献   

18.
Detection and identification of pathogenic bacteria and their protein toxins play a crucial role in a proper response to natural or terrorist-caused outbreaks of infectious diseases. The recent availability of whole genome sequences of priority bacterial pathogens opens new diagnostic possibilities for identification of bacteria by retrieving their genomic or proteomic information. We describe a method for identification of bacteria based on tandem mass spectrometric (MS/MS) analysis of peptides derived from bacterial proteins. This method involves bacterial cell protein extraction, trypsin digestion, liquid chromatography MS/MS analysis of the resulting peptides, and a statistical scoring algorithm to rank MS/MS spectral matching results for bacterial identification. To facilitate spectral data searching, a proteome database was constructed by translating genomes of bacteria of interest with fully or partially determined sequences. In this work, a prototype database was constructed by the automated analysis of 87 publicly available, fully sequenced bacterial genomes with the GLIMMER gene finding software. MS/MS peptide spectral matching for peptide sequence assignment against this proteome database was done by SEQUEST. To gauge the relative significance of the SEQUEST-generated matching parameters for correct peptide assignment, discriminant function (DF) analysis of these parameters was applied and DF scores were used to calculate probabilities of correct MS/MS spectra assignment to peptide sequences in the database. The peptides with DF scores exceeding a threshold value determined by the probability of correct peptide assignment were accepted and matched to the bacterial proteomes represented in the database. Sequence filtering or removal of degenerate peptides matched with multiple bacteria was then performed to further improve identification. It is demonstrated that using a preset criterion with known distributions of discriminant function scores and probabilities of correct peptide sequence assignments, a test bacterium within the 87 database microorganisms can be unambiguously identified.  相似文献   

19.
Zhu K  Kim J  Yoo C  Miller FR  Lubman DM 《Analytical chemistry》2003,75(22):6209-6217
A method has been developed for high sequence coverage analysis of proteins isolated from breast cancer cell lines. Intact proteins are isolated using multidimensional liquid-phase separations that permit the collection of individual protein fractions. Protein digests are then analyzed by both matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) peptide mass fingerprinting and by capillary electrophoresis-electrospray ionization (CE-ESI)-TOF MS peptide mapping. These methods can be readily interfaced to the relatively clean proteins resulting from liquid-phase fractionation of cell lysates with little sample preparation. Using combined sequence information provided by both mapping methods, 100% sequence coverage is often obtained for smaller proteins, while for larger proteins up to 75 kDa, over 90% coverage can be obtained. Furthermore, an accurate intact protein MW value (within 150 ppm) can be obtained from ESI-TOF MS. The intact MW together with high coverage sequence information provides accurate identification. More notably the high sequence coverage of CE-ESI-TOF MS together with the MS/MS information provided by the ion trap/reTOF MS elucidates posttranslational modifications, sequence changes, truncations, and isoforms that may otherwise go undetected when standard MALDI-MS peptide fingerprinting is used. This capability is critical in the analysis of human cancer cells where large numbers of expressed proteins are modified, and these modifications may play an important role in the cancer process.  相似文献   

20.
Electrospray ionization (ESI) tandem mass spectrometry (MS/MS) of peptides in conjunction with automated sequence database searching of the resulting collision-induced dissociation (CID) spectra has become a powerful method for the identification of purified proteins or the components of protein mixtures. The success of the method is critically dependent on the manner by which the peptides are introduced into the mass spectrometer. In this report, we describe a capillary electrophoresis-based system for the automated, sensitive analysis of complex peptide mixtures. The system consists of an ESI-MS/MS instrument, a solid-phase extraction (SPE)-capillary zone electrophoresis (CZE) device for peptide concentration and separation, and an algorithm written in Instrument Control Language (ICL) which modulates the electrophoretic conditions in a data-dependent manner to optimize available time for the generation of high-quality CID spectra of peptides in complex samples. We demonstrate that the data-dependent modulation of the electric field significantly expands the analytical window for each peptide analyzed and that the sensitivity of the SPE-CZE technique is not noticeably altered by the procedure. By applying the technique to the analysis of in vivo phosphorylation sites of endothelial nitric oxide synthase (eNOS), we demonstrate the power of this system for the MS/MS analysis of minor peptide species in complex samples such as phosphopeptides generated by the proteolytic digestion of a large protein, eNOS, phosphorylated at low stoichiometry.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号