期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

全文获取类型

收费全文	1550篇
免费	153篇
国内免费	143篇

学科分类

工业技术

1846篇

出版年

2023年	5篇
2022年	12篇
2021年	14篇
2020年	13篇
2019年	24篇
2018年	19篇
2017年	29篇
2016年	31篇
2015年	38篇
2014年	55篇
2013年	50篇
2012年	87篇
2011年	129篇
2010年	120篇
2009年	140篇
2008年	142篇
2007年	138篇
2006年	141篇
2005年	121篇
2004年	95篇
2003年	117篇
2002年	89篇
2001年	49篇
2000年	35篇
1999年	53篇
1998年	19篇
1997年	10篇
1996年	9篇
1995年	9篇
1994年	9篇
1993年	10篇
1992年	5篇
1991年	5篇
1990年	1篇
1989年	2篇
1988年	3篇
1987年	1篇
1986年	3篇
1985年	1篇
1984年	2篇
1983年	2篇
1982年	1篇
1981年	2篇
1980年	1篇
1979年	1篇
1976年	1篇
1964年	1篇
1963年	1篇
1955年	1篇

排序方式： 共有1846条查询结果，搜索用时 15 毫秒

[首页] « 上一页 [5] [6] [7] [8] [9] 10 [11] [12] [13] [14] [15] 下一页 » 末页»

91.

Using phrases as features in email classification

Matthew Chang Author Vitae Author Vitae 《Journal of Systems and Software》2009,82(6):1036-1045

In this paper, we report our experience on the use of phrases as basic features in the email classification problem. We performed extensive empirical evaluation using our large email collections and tested with three text classification algorithms, namely, a naive Bayes classifier and two k-NN classifiers using TF-IDF weighting and resemblance respectively. The investigation includes studies on the effect of phrase size, the size of local and global sampling, the neighbourhood size, and various methods to improve the classification accuracy. We determined suitable settings for various parameters of the classifiers and performed a comparison among the classifiers with their best settings. Our result shows that no classifier dominates the others in terms of classification accuracy. Also, we made a number of observations on the special characteristics of emails. In particular, we observed that public emails are easier to classify than private ones. 相似文献

92.

Genetic algorithm for text clustering based on latent semantic indexing

Wei Song Soon Cheol Park 《Computers & Mathematics with Applications》2009,57(11-12):1901

In this paper, we develop a genetic algorithm method based on a latent semantic model (GAL) for text clustering. The main difficulty in the application of genetic algorithms (GAs) for document clustering is thousands or even tens of thousands of dimensions in feature space which is typical for textual data. Because the most straightforward and popular approach represents texts with the vector space model (VSM), that is, each unique term in the vocabulary represents one dimension. Latent semantic indexing (LSI) is a successful technology in information retrieval which attempts to explore the latent semantics implied by a query or a document through representing them in a dimension-reduced space. Meanwhile, LSI takes into account the effects of synonymy and polysemy, which constructs a semantic structure in textual data. GA belongs to search techniques that can efficiently evolve the optimal solution in the reduced space. We propose a variable string length genetic algorithm which has been exploited for automatically evolving the proper number of clusters as well as providing near optimal data set clustering. GA can be used in conjunction with the reduced latent semantic structure and improve clustering efficiency and accuracy. The superiority of GAL approach over conventional GA applied in VSM model is demonstrated by providing good Reuter document clustering results. 相似文献

93.

A new document representation using term frequency and vectorized graph connectionists with application to document retrieval

Tommy W.S. Chow Haijun Zhang M.K.M. Rahman 《Expert systems with applications》2009,36(10):12023-12035

This paper presents a new document representation with vectorized multiple features including term frequency and term-connection-frequency. A document is represented by undirected and directed graph, respectively. Then terms and vectorized graph connectionists are extracted from the graphs by employing several feature extraction methods. This hybrid document feature representation more accurately reflects the underlying semantics that are difficult to achieve from the currently used term histograms, and it facilitates the matching of complex graph. In application level, we develop a document retrieval system based on self-organizing map (SOM) to speed up the retrieval process. We perform extensive experimental verification, and the results suggest that the proposed method is computationally efficient and accurate for document retrieval. 相似文献

94.

Least squares twin support vector machines for pattern classification

M. Arun Kumar M. Gopal 《Expert systems with applications》2009,36(4):7535-7543

In this paper we formulate a least squares version of the recently proposed twin support vector machine (TSVM) for binary classification. This formulation leads to extremely simple and fast algorithm for generating binary classifiers based on two non-parallel hyperplanes. Here we attempt to solve two modified primal problems of TSVM, instead of two dual problems usually solved. We show that the solution of the two modified primal problems reduces to solving just two systems of linear equations as opposed to solving two quadratic programming problems along with two systems of linear equations in TSVM. Classification using nonlinear kernel also leads to systems of linear equations. Our experiments on publicly available datasets indicate that the proposed least squares TSVM has comparable classification accuracy to that of TSVM but with considerably lesser computational time. Since linear least squares TSVM can easily handle large datasets, we further went on to investigate its efficiency for text categorization applications. Computational results demonstrate the effectiveness of the proposed method over linear proximal SVM on all the text corpuses considered. 相似文献

95.

一种基于特征重要度的文本分类特征加权方法 总被引：4，自引：0，他引：4

刘赫刘大有裴志利高滢《计算机研究与发展》2009,46(10)

针对文本分类中的特征加权问题,提出了一种基于特征重要度的特征加权方法.该方法基于实数粗糙集理论,通过定义特征重要度,将特征对分类的决策信息引入到特征权重中.然后,在标准文本数据集Reuters-21578 Top10和WebKB上进行了实验.结果表明,该方法能改善样本空间的分布状态,使同类样本更加紧凑,异类样本更加松散,从而简化从样本到类别的映射关系.最后,使用Nave Bayes,kNN和SVM分类器在上述数据集上对该方法进行了实验.结果表明,该方法能提高分类的准确率、召回率和F1值. 相似文献

96.

一种基于XML的非结构化数据转换方法

杨晶周双娥《计算机科学》2017,44(Z11):414-417

XML作为半结构化的语言,因其能预先定义标记等优势被普遍应用于非结构化到结构化信息的转换中。利用POI技术把网络上繁杂的非结构化数据转化为XML半结构化数据,把半结构化数据转化为结构化数据,使用户能够简便地查询所需信息。通过实验对SAX,DOM的解析效率进行了对比,实验表明解析相同大小的XML文件,SAX比DOM效率更高,而且此种差距会随着XML文件的增大而逐渐增大。相似文献

97.

新的文本分类特征选择方法研究

张玉芳王勇刘明熊忠阳《计算机工程与应用》2013,(5)

特征降维是文本分类过程中的一个重要环节。在现有特征选择方法的基础上,综合考虑特征词在正类和负类中的分布性质,综合四种衡量特征类别区分能力的指标,提出了一个新的特征选择方法,即综合比率(CR)方法。实验采用K-最近邻分类算法(KNN)来考查CR方法的有效性,实验结果表明该方法能够取得比现有特征选择方法更优的降维效果。相似文献

98.

一种启发式多标记分类器选择与排序策略

李哲王志海何颖婧付彬《中文信息学报》2013,27(4):119-127

在多标记分类问题当中,多标记分类器的目的是为实例预测一个与其关联的标记集合。典型方法之一是将多标记分类问题转化为多个二类分类问题,这些二类分类器之间可以存在一定的关系。简单地考虑标记间依赖关系可以在一定程度上改善分类性能,但同时计算复杂度也是必须考虑的问题。该文提出了一种利用多标记间依赖关系的有序分类器集合算法,该算法通过启发式的搜索策略寻找分类器之间的某种次序,这种次序可以更好地反映标记间的依赖关系。在实验中,该文选取了来自不同领域的数据集和多个评价指标,实验结果表明该文所提出的算法比一般多标记分类算法具有更好的分类性能。相似文献

99.

A spectral analysis approach to document summarization: Clustering and ranking sentences simultaneously

Xiaoyan Cai Author Vitae Author Vitae 《Information Sciences》2011,181(18):3816-3827

Automatic document summarization aims to create a compressed summary that preserves the main content of the original documents. It is a well-recognized fact that a document set often covers a number of topic themes with each theme represented by a cluster of highly related sentences. More important, topic themes are not equally important. The sentences in an important theme cluster are generally deemed more salient than the sentences in a trivial theme cluster. Existing clustering-based summarization approaches integrate clustering and ranking in sequence, which unavoidably ignore the interaction between them. In this paper, we propose a novel approach developed based on the spectral analysis to simultaneously clustering and ranking of sentences. Experimental results on the DUC generic summarization datasets demonstrate the improvement of the proposed approach over the other existing clustering-based approaches. 相似文献

100.

Visual enhancement of old documents with hyperspectral imaging

Seon Joo Kim^{Author Vitae} Fanbo Deng Author VitaeAuthor Vitae 《Pattern recognition》2011,44(7):1461-1469

Hyperspectral imaging (HSI) of historical documents is becoming more common at national libraries and archives. HSI is useful for many tasks related to document conservation and management as it provides detailed quantitative measurements of the spectral reflectance of the document that is not limited to the visible spectrum. In this paper, we focus on how to use the invisible spectra, most notably near-infrared (NIR) bands, to assist in visually enhancing old documents. Specifically, we demonstrate how to use the invisible bands to improve the visual quality of text-based documents corrupted with undesired artifacts such as ink-bleed, ink-corrosion, and foxing. For documents of line drawings that suffer from low contrast, we use details found in the invisible bands to enhance legibility. The key components of our framework involve detecting regions in the document that can be enhanced by the NIR spectra, compositing the enhanced gradient map using the NIR bands, and reconstructing the final image from the composited gradients. This work is part of a collaborative effort with the Nationaal Archief of the Netherlands (NAN) and Art Innovation, a manufacturer of hyperspectral imaging hardware designed specially for historical documents. Our approach is evaluated on historical documents from NAN that exhibit degradations common to documents found in most archives and libraries. 相似文献

[首页] « 上一页 [5] [6] [7] [8] [9] 10 [11] [12] [13] [14] [15] 下一页 » 末页»