期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Pathway network inference from gene expression data

Ignacio Ponzoni María José Nueda Sonia Tarazona Stefan Götz David Montaner Julieta Sol Dussaut Joaquín Dopazo Ana Conesa 《BMC systems biology》2014,8(Z2):S7

相似文献

2.

Bagging statistical network inference from large-scale gene expression data

de Matos Simoes R Emmert-Streib F 《PloS one》2012,7(3):e33624

相似文献

3.

A Bayesian regression approach to the inference of regulatory networks from gene expression data 总被引：3，自引：0，他引：3

Rogers S Girolami M 《Bioinformatics (Oxford, England)》2005,21(14):3131-3137

MOTIVATION: There is currently much interest in reverse-engineering regulatory relationships between genes from microarray expression data. We propose a new algorithmic method for inferring such interactions between genes using data from gene knockout experiments. The algorithm we use is the Sparse Bayesian regression algorithm of Tipping and Faul. This method is highly suited to this problem as it does not require the data to be discretized, overcomes the need for an explicit topology search and, most importantly, requires no heuristic thresholding of the discovered connections. RESULTS: Using simulated expression data, we are able to show that this algorithm outperforms a recently published correlation-based approach. Crucially, it does this without the need to set any ad hoc threshold on possible connections. 相似文献

4.

Gene network inference from incomplete expression data: transcriptional control of hematopoietic commitment 总被引：2，自引：0，他引：2

Missal K Cross MA Drasdo D 《Bioinformatics (Oxford, England)》2006,22(6):731-738

MOTIVATION: The topology and function of gene regulation networks are commonly inferred from time series of gene expression levels in cell populations. This strategy is usually invalid if the gene expression in different cells of the population is not synchronous. A promising, though technically more demanding alternative is therefore to measure the gene expression levels in single cells individually. The inference of a gene regulation network requires knowledge of the gene expression levels at successive time points, at least before and after a network transition. However, owing to experimental limitations a complete determination of the precursor state is not possible. RESULTS: We investigate a strategy for the inference of gene regulatory networks from incomplete expression data based on dynamic Bayesian networks. This permits prediction of the number of experiments necessary for network inference depending on parameters including noise in the data, prior knowledge and limited attainability of initial states. Our strategy combines a gradual 'Partial Learning' approach based solely on true experimental observations for the network topology with expectation maximization for the network parameters. We illustrate our strategy by extensive computer simulations in a high-dimensional parameter space in a simulated single-cell-based example of hematopoietic stem cell commitment and in random networks of different sizes. We find that the feasibility of network inferences increases significantly with the experimental ability to force the system into different initial network states, with prior knowledge and with noise reduction. AVAILABILITY: Source code is available under: www.izbi.uni-leipzig.de/services/NetwPartLearn.html SUPPLEMENTARY INFORMATION: Supplementary Data are available at Bioinformatics online. 相似文献

5.

Statistical inference for simultaneous clustering of gene expression data 总被引：1，自引：0，他引：1

Pollard KS van der Laan MJ 《Mathematical biosciences》2002,176(1):99-121

Current methods for analysis of gene expression data are mostly based on clustering and classification of either genes or samples. We offer support for the idea that more complex patterns can be identified in the data if genes and samples are considered simultaneously. We formalize the approach and propose a statistical framework for two-way clustering. A simultaneous clustering parameter is defined as a function theta=Phi(P) of the true data generating distribution P, and an estimate is obtained by applying this function to the empirical distribution P(n). We illustrate that a wide range of clustering procedures, including generalized hierarchical methods, can be defined as parameters which are compositions of individual mappings for clustering patients and genes. This framework allows one to assess classical properties of clustering methods, such as consistency, and to formally study statistical inference regarding the clustering parameter. We present results of simulations designed to assess the asymptotic validity of different bootstrap methods for estimating the distribution of Phi(P(n)). The method is illustrated on a publicly available data set. 相似文献

6.

Recovering context-specific gene network modules from expression data: A brief review

Hui Yu Yuan-Yuan Li 《Frontiers of Biology in China》2009,4(4):414-418

With the popularization of microarray experiments in biomedical laboratories, how to make contextspecific knowledge discovery from expression data becomes a hot topic. While the static “reference networks” for key model organisms are nearly at hand, the endeavors to recover context-specific network modules are still at the beginning. Currently, this is achieved through filtering existing edges of the ensemble reference network or constructing gene networks ab initio. In this paper, we briefly review recent progress in the field and point out some research directions awaiting improved work, including expression-data-guided revision of reference networks. 相似文献

7.

Recovering context-specific gene network modules from expression data: A brief review

Hui Yu Yuan-Yuan Li 《生物学前沿》2009,4(4):414-418

With the popularization of microarray experi-ments in biomedical laboratories, how to make context-specific knowledge discovery from expression data becomes a hot topic. While the static "reference networks"for key model organisms are nearly at hand, the endeavors to recover context-specific network modules are still at the beginning. Currently, this is achieved through filtering existing edges of the ensemble reference network or constructing gene networks ab initio. In this paper, we briefly review recent progress in the field and point out some research directions awaiting improved work, includ-ing expression-data-guided revision of reference networks. 相似文献

8.

A Markov-blanket-based model for gene regulatory network inference

Ram R Chetty M 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2011,8(2):353-367

An efficient two-step Markov blanket method for modeling and inferring complex regulatory networks from large-scale microarray data sets is presented. The inferred gene regulatory network (GRN) is based on the time series gene expression data capturing the underlying gene interactions. For constructing a highly accurate GRN, the proposed method performs: 1) discovery of a gene's Markov Blanket (MB), 2) formulation of a flexible measure to determine the network's quality, 3) efficient searching with the aid of a guided genetic algorithm, and 4) pruning to obtain a minimal set of correct interactions. Investigations are carried out using both synthetic as well as yeast cell cycle gene expression data sets. The realistic synthetic data sets validate the robustness of the method by varying topology, sample size, time delay, noise, vertex in-degree, and the presence of hidden nodes. It is shown that the proposed approach has excellent inferential capabilities and high accuracy even in the presence of noise. The gene network inferred from yeast cell cycle data is investigated for its biological relevance using well-known interactions, sequence analysis, motif patterns, and GO data. Further, novel interactions are predicted for the unknown genes of the network and their influence on other genes is also discussed. 相似文献

9.

Applying unmixing to gene expression data for tumor phylogeny inference

Russell Schwartz Stanley E Shackney 《BMC bioinformatics》2010,11(1):42

Background

While in principle a seemingly infinite variety of combinations of mutations could result in tumor development, in practice it appears that most human cancers fall into a relatively small number of "sub-types," each characterized a roughly equivalent sequence of mutations by which it progresses in different patients. There is currently great interest in identifying the common sub-types and applying them to the development of diagnostics or therapeutics. Phylogenetic methods have shown great promise for inferring common patterns of tumor progression, but suffer from limits of the technologies available for assaying differences between and within tumors. One approach to tumor phylogenetics uses differences between single cells within tumors, gaining valuable information about intra-tumor heterogeneity but allowing only a few markers per cell. An alternative approach uses tissue-wide measures of whole tumors to provide a detailed picture of averaged tumor state but at the cost of losing information about intra-tumor heterogeneity. 相似文献

10.

ORIOGEN: order restricted inference for ordered gene expression data

Peddada S Harris S Zajd J Harvey E 《Bioinformatics (Oxford, England)》2005,21(20):3933-3934

SUMMARY: ORIOGEN is a user-friendly Java-based software package for selecting and clustering genes according to their profiles across various treatment groups. In particular, ORIOGEN is useful for analyzing data obtained from time-course or dose-response type experiments. AVAILABILITY: The ORIOGEN software can be downloaded freely from http://dir.niehs.nih.gov/dirbb/oriogen/index.cfm CONTACT: peddada@niehs.nih.gov (for statistical questions) and oriogen@constellagroup.com (for software support) SUPPLEMENTARY INFORMATION: ORIOGEN has a full set of help files. Also, an example input file is provided with the download. 相似文献

11.

A Bayesian network classification methodology for gene expression data. 总被引：5，自引：0，他引：5

Paul Helman Robert Veroff Susan R Atlas Cheryl Willman 《Journal of computational biology》2004,11(4):581-615

We present new techniques for the application of a Bayesian network learning framework to the problem of classifying gene expression data. The focus on classification permits us to develop techniques that address in several ways the complexities of learning Bayesian nets. Our classification model reduces the Bayesian network learning problem to the problem of learning multiple subnetworks, each consisting of a class label node and its set of parent genes. We argue that this classification model is more appropriate for the gene expression domain than are other structurally similar Bayesian network classification models, such as Naive Bayes and Tree Augmented Naive Bayes (TAN), because our model is consistent with prior domain experience suggesting that a relatively small number of genes, taken in different combinations, is required to predict most clinical classes of interest. Within this framework, we consider two different approaches to identifying parent sets which are supported by the gene expression observations and any other currently available evidence. One approach employs a simple greedy algorithm to search the universe of all genes; the second approach develops and applies a gene selection algorithm whose results are incorporated as a prior to enable an exhaustive search for parent sets over a restricted universe of genes. Two other significant contributions are the construction of classifiers from multiple, competing Bayesian network hypotheses and algorithmic methods for normalizing and binning gene expression data in the absence of prior expert knowledge. Our classifiers are developed under a cross validation regimen and then validated on corresponding out-of-sample test sets. The classifiers attain a classification rate in excess of 90% on out-of-sample test sets for two publicly available datasets. We present an extensive compilation of results reported in the literature for other classification methods run against these same two datasets. Our results are comparable to, or better than, any we have found reported for these two sets, when a train-test protocol as stringent as ours is followed. 相似文献

12.

Bayesian inference of MicroRNA targets from sequence and expression data.

Jim C Huang Quaid D Morris Brendan J Frey 《Journal of computational biology》2007,14(5):550-563

MicroRNAs (miRNAs) regulate a large proportion of mammalian genes by hybridizing to targeted messenger RNAs (mRNAs) and down-regulating their translation into protein. Although much work has been done in the genome-wide computational prediction of miRNA genes and their target mRNAs, an open question is how to efficiently obtain functional miRNA targets from a large number of candidate miRNA targets predicted by existing computational algorithms. In this paper, we propose a novel Bayesian model and learning algorithm, GenMiR++ (Generative model for miRNA regulation), that accounts for patterns of gene expression using miRNA expression data and a set of candidate miRNA targets. A set of high-confidence functional miRNA targets are then obtained from the data using a Bayesian learning algorithm. Our model scores 467 high-confidence miRNA targets out of 1,770 targets obtained from TargetScanS in mouse at a false detection rate of 2.5%: several confirmed miRNA targets appear in our high-confidence set, such as the interactions between miR-92 and the signal transduction gene MAP2K4, as well as the relationship between miR-16 and BCL2, an anti-apoptotic gene which has been implicated in chronic lymphocytic leukemia. We present results on the robustness of our model showing that our learning algorithm is not sensitive to various perturbations of the data. Our high-confidence targets represent a significant increase in the number of miRNA targets and represent a starting point for a global understanding of gene regulation. 相似文献

13.

Maximum likelihood inference of imprinting and allele-specific expression from EST data

Seoighe C Nembaware V Scheffler K 《Bioinformatics (Oxford, England)》2006,22(24):3032-3039

相似文献

14.

Protein network inference from multiple genomic data: a supervised approach

Yamanishi Y Vert JP Kanehisa M 《Bioinformatics (Oxford, England)》2004,20(Z1):i363-i370

相似文献

15.

Integrating heterogeneous gene expression data for gene regulatory network modelling

Sîrbu A Ruskin HJ Crane M 《Theorie in den Biowissenschaften》2012,131(2):95-102

Gene regulatory networks (GRNs) are complex biological systems that have a large impact on protein levels, so that discovering network interactions is a major objective of systems biology. Quantitative GRN models have been inferred, to date, from time series measurements of gene expression, but at small scale, and with limited application to real data. Time series experiments are typically short (number of time points of the order of ten), whereas regulatory networks can be very large (containing hundreds of genes). This creates an under-determination problem, which negatively influences the results of any inferential algorithm. Presented here is an integrative approach to model inference, which has not been previously discussed to the authors' knowledge. Multiple heterogeneous expression time series are used to infer the same model, and results are shown to be more robust to noise and parameter perturbation. Additionally, a wavelet analysis shows that these models display limited noise over-fitting within the individual datasets. 相似文献

16.

Cluster-based network model for time-course gene expression data

Inoue LY Neira M Nelson C Gleave M Etzioni R 《Biostatistics (Oxford, England)》2007,8(3):507-525

We propose a model-based approach to unify clustering and network modeling using time-course gene expression data. Specifically, our approach uses a mixture model to cluster genes. Genes within the same cluster share a similar expression profile. The network is built over cluster-specific expression profiles using state-space models. We discuss the application of our model to simulated data as well as to time-course gene expression data arising from animal models on prostate cancer progression. The latter application shows that with a combined statistical/bioinformatics analyses, we are able to extract gene-to-gene relationships supported by the literature as well as new plausible relationships. 相似文献

17.

Supervised enzyme network inference from the integration of genomic data and chemical information

Yamanishi Y Vert JP Kanehisa M 《Bioinformatics (Oxford, England)》2005,21(Z1):i468-i477

相似文献

18.

Transcriptional gene network inference from a massive dataset elucidates transcriptome organization and gene function

Belcastro V Siciliano V Gregoretti F Mithbaokar P Dharmalingam G Berlingieri S Iorio F Oliva G Polishchuck R Brunetti-Pierri N di Bernardo D 《Nucleic acids research》2011,39(20):8677-8688

相似文献

19.

FUMET: A fuzzy network module extraction technique for gene expression data

Priyakshi Mahanta Hasin Afzal Ahmed Dhruba Kumar Bhattacharyya Ashish Ghosh 《Journal of biosciences》2014,39(3):351-364

Construction of co-expression network and extraction of network modules have been an appealing area of bioinformatics research. This article presents a co-expression network construction and a biologically relevant network module extraction technique based on fuzzy set theoretic approach. The technique is able to handle both positive and negative correlations among genes. The constructed network for some benchmark gene expression datasets have been validated using topological internal and external measures. The effectiveness of network module extraction technique has been established in terms of well-known p-value, Q-value and topological statistics. 相似文献

20.

A regulatory network modeled from wild-type gene expression data guides functional predictions in Caenorhabditis elegans development

Brandilyn Stigler Helen M Chamberlin 《BMC systems biology》2012,6(1):1-15

Background

The iJO1366 reconstruction of the metabolic network of Escherichia coli is one of the most complete and accurate metabolic reconstructions available for any organism. Still, because our knowledge of even well-studied model organisms such as this one is incomplete, this network reconstruction contains gaps and possible errors. There are a total of 208 blocked metabolites in iJO1366, representing gaps in the network.

Results

A new model improvement workflow was developed to compare model based phenotypic predictions to experimental data to fill gaps and correct errors. A Keio Collection based dataset of E. coli gene essentiality was obtained from literature data and compared to model predictions. The SMILEY algorithm was then used to predict the most likely missing reactions in the reconstructed network, adding reactions from a KEGG based universal set of metabolic reactions. The feasibility of these putative reactions was determined by comparing updated versions of the model to the experimental dataset, and genes were predicted for the most feasible reactions.

Conclusions

Numerous improvements to the iJO1366 metabolic reconstruction were suggested by these analyses. Experiments were performed to verify several computational predictions, including a new mechanism for growth on myo-inositol. The other predictions made in this study should be experimentally verifiable by similar means. Validating all of the predictions made here represents a substantial but important undertaking. 相似文献