期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

On reliability of prediction in linear models

Ward R. 《Automatic Control, IEEE Transactions on》1981,26(6):1297-1299

For a linear noisy system assume an estimatehat{beta}of the unknown parameter vector was obtained from a certain past data on inputs and outputs of the system. The prediction of the output for any input vectorxis a linear combination ofxandhat{beta}. The question of concern is: For what - values of the inputs would the error in their predicted outputs be the smallest? It is found that they are the inputs which lie on the fitted plane of the orthogonal regression among the past data of the clean inputs. 相似文献

2.

Toward breast cancer survivability prediction models through improving training space

Jaree Thongkam Guandong Xu Yanchun Zhang Fuchun Huang 《Expert systems with applications》2009,36(10):12200-12209

Due to the difficulties of outlier and skewed data, the prediction of breast cancer survivability has presented many challenges in the field of data mining and pattern precognition, especially in medical research. To solve these problems, we have proposed a hybrid approach to generating higher quality data sets in the creation of improved breast cancer survival prediction models. This approach comprises two main steps: (1) utilization of an outlier filtering approach based on C-Support Vector Classification (C-SVC) to identify and eliminate outlier instances; and (2) application of an over-sampling approach using over-sampling with replacement to increase the number of instances in the minority class. In order to assess the capability and effectiveness of the proposed approach, several measurement methods including basic performance (e.g., accuracy, sensitivity, and specificity), Area Under the receiver operating characteristic Curve (AUC) and F-measure were utilized. Moreover, a 10-fold cross-validation method was used to reduce the bias and variance of the results of breast cancer survivability prediction models. Results have indicated that the proposed approach leads to improving the performance of breast cancer survivability prediction models by up to 28.34% due to the improved training data space. 相似文献

3.

Validity and reliability of evaluation procedures in comparative studies of effort prediction models

Ingunn Myrtveit Erik Stensrud 《Empirical Software Engineering》2012,17(1-2):23-33

We have in previous studies reported our findings and concern about the reliability and validity of the evaluation procedures used in comparative studies on competing effort prediction models. In particular, we have raised concerns about the use of accuracy statistics to rank and select models. Our concern is strengthened by the observed lack of consistent findings. This study offers more insights into the causes of conclusion instability by elaborating on the findings of our previous work concerning the reliability and validity of the evaluation procedures. We show that model selection based on the accuracy statistics MMRE, MMER, MBRE, and MIBRE contribute to conclusion instability as well as selection of inferior models. We argue and show that the evaluation procedure must include an evaluation of whether the functional form of the prediction model makes sense to better prevent selection of inferior models. 相似文献

4.

Predicting the recurrence of breast cancer using machine learning algorithms

Alzu’bi Amal Najadat Hassan Doulat Wesam Al-Shari Osama Zhou Leming 《Multimedia Tools and Applications》2021,80(9):13787-13800

Multimedia Tools and Applications - Breast cancer is one of the most common types of cancer among Jordanian women. Recently, healthcare organizations in Jordan have adopted electronic health... 相似文献

5.

The reliability issue of computer-aided breast cancer diagnosis.

B Kovalerchuk E Triantaphyllou J F Ruiz V I Torvik E Vityaev 《Computers and biomedical research》2000,33(4):296-313

This paper introduces a number of reliability criteria for computer-aided diagnostic systems for breast cancer. These criteria are then used to analyze some published neural network systems. It is also shown that the property of monotonicity for the data is rather natural in this medical domain, and it has the potential to significantly improve the reliability of breast cancer diagnosis while maintaining a general representation power. A central part of this paper is devoted to the representation/narrow vicinity hypothesis, upon which existing computer-aided diagnostic methods heavily rely. The paper also develops a framework for determining the validity of this hypothesis. The same framework can be used to construct a diagnostic procedure with improved reliability. 相似文献

6.

SREPT: software reliability estimation and prediction tool

Srinivasan Reference to Ramani Swapna Reference to S. Gokhale Kishor Reference to S. Trivedi 《Performance Evaluation》2000,39(1-4):37-60

Several tools have been developed for the estimation of software reliability. However, they are highly specialized in the approaches they implement and the particular phase of the software life-cycle in which they are applicable. There is an increasing need for a tool that can be used to track the quality of a software product during the software life-cycle, right from the architectural phase all the way up to the operational phase of the software. Also the conventional techniques for software reliability evaluation, which treat the software as a monolithic entity, are inadequate to assess the reliability of heterogeneous systems, which consist of a large number of globally distributed components. Architecture-based approaches are essential to assess the reliability and performance of such systems. This paper presents the high-level design of a software reliability estimation and prediction tool (SREPT), that offers a unified framework consisting of techniques (including the architecture-based approach) to assist in the evaluation of software reliability during all phases of the software life-cycle. 相似文献

7.

Interval predictor models: Identification and reliability

M.C. Campi Author Vitae G. Calafiore Author Vitae S. Garatti Author Vitae 《Automatica》2009,45(2):382-392

This paper addresses the problem of constructing reliable interval predictors directly from observed data. Differently from standard predictor models, interval predictors return a prediction interval as opposed to a single prediction value. We show that, in a stationary and independent observations framework, the reliability of the model (that is, the probability that the future system output falls in the predicted interval) is guaranteed a priori by an explicit and non-asymptotic formula, with no further assumptions on the structure of the unknown mechanism that generates the data. This fact stems from a key result derived in this paper, which relates, at a fundamental level, the reliability of the model to its complexity and to the amount of available information (number of observed data). 相似文献

8.

Neural network models for breast cancer prognosis

R. M. Ripley A. L. Harris L. Tarassenko 《Neural computing & applications》1998,7(4):367-375

Estimating the risk of relapse for breast cancer patients is necessary, since it affects the choice of treatment. This problem involves analysing data of times to relapse of patients and relating them to prognostic variables. Some of the times to relapse will usually be censored.We investigate various ways of using neural network models to extend traditional statistical models in this situation. Such models are better able to model both non-linear effects of prognostic factors and interactions between them, than linear logistic or Cox regression models. With the dataset used in our study, however, the prediction of the risk of relapse is not significantly improved when using a neural network model. Predicting the risk that a patient will relapse within three years, say, is possible from this data, but not when any relapse will happen. 相似文献

9.

Neural network prediction of relapse in breast cancer patients

L. Tarassenko R. Whitehouse G. Gasparini A. L. Harris 《Neural computing & applications》1996,4(2):105-113

When a woman diagnosed as having breast cancer has a tumour removed, it is important to try and predict whether she is likely to relapse within, say, the next three years. In this paper, the performance of a neural network classifier trained on a number of prognostic indicators is shown to be better than that of the clinical experts working with the same information. To obtain meaningful statistics with the relatively small dataset available, the network is trained using a modified form of the leave-one-out method. A procedure is also introduced for investigating how much independentinformation each input parameter contributes. This shows that, in this type of retrospective study, the type of therapy given to the woman does not significantly affect the network's prediction of whether or not she will relapse within three years. Finally, since this problem, in common with many other medical problems, is plagued by a shortage of data, the final section of the paper reports on an investigation of whether or not multi-centre databases might be feasible. 相似文献

10.

Explanation and prediction: an architecture for default and abductive reasoning 总被引：4，自引：0，他引：4

David Poole 《Computational Intelligence》1989,5(2):97-110

Although there are many arguments that logic is an appropriate tool for artificial intelligence, there has been a perceived problem with the monotonicity of classical logic. This paper elaborates on the idea that reasoning should be viewed as theory formation where logic tells us the consequences of our assumptions. The two activities of predicting what is expected to be true and explaining observations are considered in a simple theory formation framework. Properties of each activity are discussed, along with a number of proposals as to what should be predicted or accepted as reasonable explanations. An architecture is proposed to combine explanation and prediction into one coherent framework. Algorithms used to implement the system as well as examples from a running implementation are given. 相似文献

11.

The reliability of analogy-based prediction

D. V. Vinogradov 《Automatic Documentation and Mathematical Linguistics》2017,51(4):191-195

This paper is focused on choosing a sufficient number of runs of a coupling Markov chain that makes it possible to generate, with a high confidence level, hypotheses such that at least one of them is inserted into any test example with high probability of positive prediction. The proposed technique is based on the Vapnik–Chervonenkis resampling method. 相似文献

12.

Neural network models for group behavior prediction: a case of soccer match attendance

Strnad Damjan Nerat Andrej Kohek Štefan 《Neural computing & applications》2017,28(2):287-300

Soccer match attendance is an example of group behavior with noisy context that can only be approximated by a limited set of quantifiable factors. However, match attendance is representative of a wider spectrum of context-based behaviors for which only the aggregate effect of otherwise individual decisions is observable. Modeling of such behaviors is desirable from the perspective of economics, psychology, and other social studies with prospective use in simulators, games, product planning, and advertising. In this paper, we evaluate the efficiency of different neural network architectures as models of context in attendance behavior by comparing the achieved prediction accuracy of a multilayer perceptron (MLP), an Elman recurrent neural network (RNN), a time-lagged feedforward neural network (TLFN), and a radial basis function network (RBFN) against a multiple linear regression model, an autoregressive moving average model with exogenous inputs, and a naive cumulative mean model. We show that the MLP, TLFN, and RNN are superior to the RBFN and achieve comparable prediction accuracy on datasets of three teams from the English Football League Championship, which indicates weak importance of context transition modeled by the TLFN and the RNN. The experiments demonstrate that all neural network models outperform linear predictors by a significant margin. We show that neural models built on individual datasets achieve better performance than a generalized neural model constructed from pooled data. We analyze the input parameter influences extracted from trained networks and show that there is an agreement between nonlinear and linear measures about the most significant attributes.

相似文献

13.

Improving the prediction of the clinical outcome of breast cancer using evolutionary algorithms

M. Wahde Z. Szallasi 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2006,10(4):338-345

There exist several methods for binary classification of gene expression data sets. However, in the majority of published methods, little effort has been made to minimize classifier complexity. In view of the small number of samples available in most gene expression data sets, there is a strong motivation for minimizing the number of free parameters that must be fitted to the data. In this paper, a method is introduced for evolving (using an evolutionary algorithm) simple classifiers involving a minimal subset of the available genes. The classifiers obtained by this method perform well, reaching 97% correct classification of clinical outcome on training samples from the breast cancer data set published by van't Veer, and up to 89% correct classification on validation samples from the same data set, easily outperforming previously published results. 相似文献

14.

AdaBoost算法在乳腺癌疾病预测中的研究

叶琳石胜源罗铁清《计算机时代》2021,(7):61-64

为了研究AdaBoost算法在乳腺癌疾病预测中的应用,收集乳腺癌诊断数据集并按照一定的比例拆分成测试数据和训练数据.利用AdaBoost、GaussianNB、KNeighbors算法模型分别进行测试,以准确率为评价标准来评价模型性能的好坏.当测试数据占30％时,AdaBoost算法模型预测乳腺癌疾病优于其他算法模型,... 相似文献

15.

Revealing determinant factors for early breast cancer recurrence by decision tree

Jimin Guo Benjamin C. M. Fung Farkhund Iqbal Peter J. K. Kuppen Rob A. E. M. Tollenaar Wilma E. Mesker Jean-Jacques Lebrun 《Information Systems Frontiers》2017,19(6):1233-1241

Early breast cancer recurrence is indicative of poor response to adjuvant therapy and poses threats to patients’ lives. Most existing prediction models for breast cancer recurrence are regression-based models and difficult to interpret. We apply a Decision Tree algorithm to the clinical information of a cohort of non-metastatic invasive breast cancer patients, to establish a classifier that categorizes patients based on whether they develop early recurrence and on similarities of their clinical and pathological diagnoses. The classifier predicts for whether a patient developed early disease recurrence; and is estimated to be about 70% accurate. For an independent validation cohort of 65 patients, the classifier predicts correctly for 55 patients. The classifier also groups patients based on intrinsic properties of their diseases; and for each subgroup lists the disease characteristics in a hierarchal order, according to their relevance to early relapse. Overall, it identifies pathological nodal stage, percentage of intra-tumor stroma and components of TGFβ-Smad signaling pathway as highly relevant factors for early breast cancer recurrence. Since most of the disease characteristics used by this classifier are results of standardized tests, routinely collected during breast cancer diagnosis, the classifier can easily be adopted in various research and clinical settings. 相似文献

16.

Semi-automated and fully automated mammographic density measurement and breast cancer risk prediction

Rafael Llobet Marina Pollán Joaquín Antón Josefa Miranda-García María Casals Inmaculada Martínez Francisco Ruiz-Perales Beatriz Pérez-Gómez Dolores Salas-Trejo Juan-Carlos Pérez-Cortés 《Computer methods and programs in biomedicine》2014

The task of breast density quantification is becoming increasingly relevant due to its association with breast cancer risk. In this work, a semi-automated and a fully automated tools to assess breast density from full-field digitized mammograms are presented. The first tool is based on a supervised interactive thresholding procedure for segmenting dense from fatty tissue and is used with a twofold goal: for assessing mammographic density (MD) in a more objective and accurate way than via visual-based methods and for labeling the mammograms that are later employed to train the fully automated tool. Although most automated methods rely on supervised approaches based on a global labeling of the mammogram, the proposed method relies on pixel-level labeling, allowing better tissue classification and density measurement on a continuous scale. The fully automated method presented combines a classification scheme based on local features and thresholding operations that improve the performance of the classifier. A dataset of 655 mammograms was used to test the concordance of both approaches in measuring MD. Three expert radiologists measured MD in each of the mammograms using the semi-automated tool (DM-Scan). It was then measured by the fully automated system and the correlation between both methods was computed. The relation between MD and breast cancer was then analyzed using a case–control dataset consisting of 230 mammograms. The Intraclass Correlation Coefficient (ICC) was used to compute reliability among raters and between techniques. The results obtained showed an average ICC = 0.922 among raters when using the semi-automated tool, whilst the average correlation between the semi-automated and automated measures was ICC = 0.838. In the case–control study, the results obtained showed Odds Ratios (OR) of 1.38 and 1.50 per 10% increase in MD when using the semi-automated and fully automated approaches respectively. It can therefore be concluded that the automated and semi-automated MD assessments present a good correlation. Both the methods also found an association between MD and breast cancer risk, which warrants the proposed tools for breast cancer risk prediction and clinical decision making. A full version of the DM-Scan is freely available. 相似文献

17.

Recommender system: prediction/diagnosis of breast cancer using hybrid machine learning algorithm

Rani Shalli Kaur Manpreet Kumar Munish 《Multimedia Tools and Applications》2022,81(7):9939-9948

Multimedia Tools and Applications - Breast cancer is the second popular cause of the women’s death. There are some existing techniques for identifying the breast cancer and one of them is... 相似文献

18.

Identification of breast cancer biomarkers in transgenic mouse models: A proteomics approach

Rodenburg W Pennings JL van Oostrom CT Roodbergen M Kuiper RV Luijten M de Vries A 《Proteomics. Clinical applications》2010,4(6-7):603-612

相似文献

19.

Applying reliability models to the space shuttle

Schneidewind N.F. Keller T.W. 《Software, IEEE》1992,9(4):28-33

The experience of a team that evaluated many reliability models and tried to validate them for the on-board system software of the National Aeronautics and Space Administration's (NASA's) space shuttle is presented. It is shown that three separate but related functions comprise an integrated reliability program: prediction, control, and assessment. The application of the reliability model and the allocation of test resources as part of a testing strategy are discussed 相似文献

20.

Better reliability assessment and prediction through data clustering

Tian J. 《IEEE transactions on pattern analysis and machine intelligence》2002,28(10):997-1007

This paper presents a new approach to software reliability modeling by grouping data into clusters of homogeneous failure intensities. This series of data clusters associated with different time segments can be directly used as a piecewise linear model for reliability assessment and problem identification, which can produce meaningful results early in the testing process. The dual model fits traditional software reliability growth models (SRGMs) to these grouped data to provide long-term reliability assessments and predictions. These models were evaluated in the testing of two large software systems from IBM. Compared with existing SRGMs fitted to raw data, our models are generally more stable over time and produce more consistent and accurate reliability assessments and predictions. 相似文献