期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Image annotation by semi-supervised cross-domain learning with group sparsity

Ying Yuan Fei Wu Jian Shao Yueting Zhuang 《Journal of Visual Communication and Image Representation》2013,24(2):95-102

With the explosive growth of multimedia data in the web, multi-label image annotation has been attracted more and more attention. Although the amount of available data is large and growing, the number of labeled data is quite small. This paper proposes an approach to utilize both unlabeled data in target domain and labeled data in auxiliary domain to boost the performance of image annotation. Moreover, since different kinds of heterogeneous features in images have different intrinsic discriminative power for image understanding, group sparsity is introduced in our approach to effectively utilize those heterogeneous visual features with data of target and auxiliary domains. We call this approach semi-supervised cross-domain learning with group sparsity (S²CLGS). The strength of the proposed S²CLGS method for multi-label image annotation is to integrate semi-supervised discriminant analysis, cross-domain learning and sparse coding together. Experiments demonstrate the effectiveness of S²CLGS in comparison with other image annotation algorithms. 相似文献

2.

基于Matlab的图像自动标注

张轩臧淼李金泉《现代电子技术》2014,(3):73-75

图像自动标注在检索大量数字图像时起到关键作用,它能将图像的视觉特征转化为图像的标注字信息,为用户的使用及检索带来极大的方便。研究了图像自动语义标注方法,设计并实现了基于Matlab图像自动标注系统,能够提取图像颜色特征和纹理特征,与已标注图像进行相似性度量并标注出图像语义关键词相似文献

3.

Image reconstruction with locally adaptive sparsity and nonlocal robust regularization

Weisheng Dong Guangming Shi Xin Li Lei Zhang Xiaolin Wu 《Signal Processing: Image Communication》2012,27(10):1109-1122

Sparse representation based modeling has been successfully used in many image-related inverse problems such as deblurring, super-resolution and compressive sensing. The heart of sparse representations lies on how to find a space (spanned by a dictionary of atoms) where the local image patch exhibits high sparsity and how to determine the image local sparsity. To identify the locally varying sparsity, it is necessary to locally adapt the dictionary learning process and the sparsity-regularization parameters. However, spatial adaptation alone runs into the risk of over-fitting the data because variation and invariance are two sides of the same coin. In this work, we propose two sets of complementary ideas for regularizing image reconstruction process: (1) the sparsity regularization parameters are locally estimated for each coefficient and updated along with adaptive learning of PCA-based dictionaries; (2) a nonlocal self-similarity constraint is introduced into the overall cost functional to improve the robustness of the model. An efficient alternative minimization algorithm is present to solve the proposed objective function and then an effective image reconstruction algorithm is presented. The experimental results on image deblurring, super-resolution and compressive sensing demonstrate that the proposed image reconstruct method outperforms many existing image reconstruction methods in both PSNR and visual quality assessment. 相似文献

4.

Saliency detection by multitask sparsity pursuit

Lang C Liu G Yu J Yan S 《IEEE transactions on image processing》2012,21(3):1327-1338

This paper addresses the problem of detecting salient areas within natural images. We shall mainly study the problem under unsupervised setting, i.e., saliency detection without learning from labeled images. A solution of multitask sparsity pursuit is proposed to integrate multiple types of features for detecting saliency collaboratively. Given an image described by multiple features, its saliency map is inferred by seeking the consistently sparse elements from the joint decompositions of multiple-feature matrices into pairs of low-rank and sparse matrices. The inference process is formulated as a constrained nuclear norm and as an l(2, 1)-norm minimization problem, which is convex and can be solved efficiently with an augmented Lagrange multiplier method. Compared with previous methods, which usually make use of multiple features by combining the saliency maps obtained from individual features, the proposed method seamlessly integrates multiple features to produce jointly the saliency map with a single inference step and thus produces more accurate and reliable results. In addition to the unsupervised setting, the proposed method can be also generalized to incorporate the top-down priors obtained from supervised environment. Extensive experiments well validate its superiority over other state-of-the-art methods. 相似文献

5.

Image annotation using high order statistics in non-Euclidean spaces

Songhao Zhu Juanjuan Hu Baoyun Wang Shuhan Shen 《Journal of Visual Communication and Image Representation》2013,24(8):1342-1348

Automatic image annotation is a promising way to achieve more effective image retrieval and image analysis by using keywords associated to the image content. Due to the semantic gap between low-level visual features and high-level semantic concepts of an image, however, the performances of many existing algorithms are not so satisfactory. In this paper, a novel image classification scheme, named high order statistics based maximum a posterior (HOS-MAP), is proposed to deal with the issue of image annotation. To bridge the gap between human judgment and machine intelligence, the proposed scheme first constructs a dissimilarity representation for each image in a non-Euclidean space; then, the information of dissimilarity diffusion distribution for each image is achieved with respect to the high-order statistics of a triplet of nearest neighbor images; finally, a maximum a posteriori algorithm with the information of Gaussian Mixture Model and dissimilarity diffusion distribution is adopted to estimate the relevance between each annotation and an input un-annotated image. Experimental results on a general-purpose image database demonstrate the effectiveness and efficiency of the proposed automatic image annotation scheme. 相似文献

6.

Image classification and annotation based on robust regularized coding

Haixia Zheng Horace H. S. Ip 《Signal, Image and Video Processing》2016,10(1):55-64

相似文献

7.

Image denoising using combined higher order non-convex total variation with overlapping group sparsity

Adam Tarmizi Paramesran Raveendran 《Multidimensional Systems and Signal Processing》2019,30(1):503-527

Multidimensional Systems and Signal Processing - It is widely known that the total variation image restoration suffers from the stair casing artifacts which results in blocky restored images. In... 相似文献

8.

Image distance metric learning based on neighborhood sets for automatic image annotation

《Journal of Visual Communication and Image Representation》2016

Since there is semantic gap between low-level visual features and high-level image semantic, the performance of many existing content-based image annotation algorithms is not satisfactory. In order to bridge the gap and improve the image annotation performance, a novel automatic image annotation (AIA) approach using neighborhood set (NS) based on image distance metric learning (IDML) algorithm is proposed in this paper. According to IDML, we can easily obtain the neighborhood set of each image since obtained image distance can effectively measure the distance between images for AIA task. By introducing NS, the proposed AIA approach can predict all possible labels of the image without caption. The experimental results confirm that the introduction of NS based on IDML can improve the efficiency of AIA approaches and achieve better annotation performance than the existing AIA approaches. 相似文献

9.

Image annotation using multi-view non-negative matrix factorization with different number of basis vectors

《Journal of Visual Communication and Image Representation》2017

Automatic Image Annotation (AIA) helps image retrieval systems by predicting tags for images. In this paper, we propose an AIA system using Non-negative Matrix Factorization (NMF) framework. The NMF framework discovers a latent space, by factorizing data into a set of non-negative basis and coefficients. To model the images, multiple features are extracted, each one represents images from a specific view. We use multi-view graph regularization NMF and allow NMF to choose a different number of basis vectors for each view. For tag prediction, each test image is mapped onto the multiple latent spaces. The distances of images in these spaces are used to form a unified distance matrix. The weights of distances are learned automatically. Then a search-based method is used to predict tags based on tags of nearest neighbors’. We evaluate our method on three datasets and show that it is competitive with the current state-of-the-art methods. 相似文献

10.

利用空域稀疏性的L型阵下二维波达方向估计 总被引：1，自引：0，他引：1

崔琛王粒宾《电路与系统学报》2013,(1):297-303

针对稀疏重构应用于二维波达方向估计时存在计算量大的问题,从减少过完备字典的原子个数以及求解模型的阶数这两个方面入手,提出一种具有较低计算复杂度的基于稀疏重构的二维波达方向估计方法。该方法中首先根据L型阵的结构特点,对基于阵列接收数据的稀疏线性模型进行分解和重新组合,抛弃与信源无关的原子,大大减少了原子个数;其次,提出利用多快拍数据协方差矩阵的最大特征值对应的特征向量作为待分解信号建立稀疏模型,使模型阶数从快拍数降至一阶。此外,给出了新方法的详细实施步骤,并且对其计算复杂度进行了理论分析。新方法估计精度高,并且能够自动实现俯仰角和方位角配对。最后,通过计算机仿真验证了新方法的有效性。相似文献

11.

Delineating buildings by grouping lines with MRFs 总被引：2，自引：0，他引：2

Krishnamachari S. Chellappa R. 《IEEE transactions on image processing》1996,5(1):164-168

Traditionally, Markov random field (MRF) models have been used in low-level image analysis. The article presents an MRF-based scheme to perform object delineation. The proposed edge-based approach involves extracting straight lines from the edge map of an image. Then, an MRF model is used to group these lines to delineate buildings in aerial images. 相似文献

12.

Image classification based on complex wavelet structural similarity

Abdul Rehman Yang Gao Jiheng Wang Zhou Wang 《Signal Processing: Image Communication》2013,28(8):984-992

Complex wavelet structural similarity (CW-SSIM) index has been recognized as a novel image similarity measure of broad potential applications due to its robustness to small geometric distortions such as translation, scaling and rotation of images. Nevertheless, how to make the best use of it in image classification problems has not been deeply investigated. In this paper, we introduce a series of novel image classification algorithms based on CW-SSIM and use handwritten digit recognition, and face recognition as examples for demonstration. Among the proposed approaches, the best compromise between accuracy and complexity is obtained by the CW-SSIM support vector machine based algorithms, which combines an unsupervised clustering method to divide the training images into clusters with representative images and a supervised learning method based on support vector machines to maximize the classification accuracy. Our experiments show that such a conceptually simple image classification method, which does not involve any registration, intensity normalization or sophisticated feature extraction processes, and does not rely on any modeling of the image patterns or distortion processes, achieves competitive performance with reduced computational cost. 相似文献

13.

一种基于HWD结构相似的图像质量评价方法

戴喆彭进业冯晓毅《电子设计工程》2013,21(1):181-183

针对传统图像质量评价方法峰值信噪比PSNR和结构相似度SSIM没有充分考虑人眼视觉特性,所得结果有时并不能与人眼的视觉所感知到的实际质量一致的问题,通过对图像结构相似度和人眼视觉系统的研究,文中提出了一种新的基于HWD结构相似的图像质量评价方法。首先对图像进行Hybrid Wavelets and Directional Filter Banks(HWD)分解,提取图像不同频带不同方向上的信息,然后计算各子带结构相似度,最后综合人眼视觉特性的CSF得到图像质量评价值。实验结果表明文中方法相比峰值信噪比PSNR、结构相似度SSIM算法具有更高的准确性和良好的相关性,可以更好的评价图像质量。相似文献

14.

Image quality assessment using a SVD-based structural projection

《Signal Processing: Image Communication》2014,29(3):293-302

The development of objective image quality assessment (IQA) metrics aligned with human perception is of fundamental importance to numerous image-processing applications. Recently, human visual system (HVS)-based engineering algorithms have received widespread attention for their low computational complexity and good performance. In this paper, we propose a new IQA model by incorporating these available engineering principles. A local singular value decomposition (SVD) is first utilised as a structural projection tool to select local image distortion features, and then, both perceptual spatial pooling and neural networks (NN) are employed to combine feature vectors to predict a single perceptual quality score. Extensive experiments and cross-validations conducted with three publicly available IQA databases demonstrate the accuracy, consistency, robustness, and stability of the proposed approach compared to state-of-the-art IQA methods, such as Visual Information Fidelity (VIF), Visual Signal to Noise Ratio (VSNR), and Structural Similarity Index (SSIM). 相似文献

15.

Testing (non-)existence of input-output relationships by estimating fractal dimensions 总被引：1，自引：0，他引：1

Rafajlowicz E. 《Signal Processing, IEEE Transactions on》2004,52(11):3151-3159

Our aim is to propose tests for (non-)existence of nonlinear relationships between signals, which, after passing a test, can be interpreted as input and output signals of a certain system, if its characteristic is sufficiently smooth. The proposed tests are based on the theoretical results on equality of fractal dimensions of these signals as well as on estimation of fractal dimensions from observations. They are applicable when at least one of these signals has the fractal dimensions strictly larger than one, i.e., it is rough enough. The tests are then verified on simulated data. Their applicability is illustrated by two sets of real data, namely, observations of two financial time series and samples of displacement-force signals in a magneto-hydrological damper. 相似文献

16.

Image quality assessment: from error visibility to structural similarity 总被引：201，自引：0，他引：201

Zhou Wang Bovik A.C. Sheikh H.R. Simoncelli E.P. 《IEEE transactions on image processing》2004,13(4):600-612

Objective methods for assessing perceptual image quality traditionally attempted to quantify the visibility of errors (differences) between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce an alternative complementary framework for quality assessment based on the degradation of structural information. As a specific example of this concept, we develop a Structural Similarity Index and demonstrate its promise through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000. 相似文献

17.

Compressive spectrum sensing in the cognitive radio networks by exploiting the sparsity of active radios

Jianrui Chen L. C. Jiao Jianshe Wu Xiaodong Wang 《Wireless Networks》2013,19(5):661-671

Spectrum sensing is a key technology to detect spectrum holes in cognitive network. It has been demonstrated that collaboration among cognitive users can improve the probability of detecting the primary users, but the fusion center is the bottleneck when a lot of collaborative information is transmitted. In this paper, we consider the cognitive radio users only transmit part of sensing information to relieve the transmission load. Besides, the sensing information will be inevitably influenced by various noise in the process of transmission. Therefore, the challenge is how we can detect spectrum holes successfully from these incomplete and inexact measurements. Most recently, there are some research results on this but the detection performance is not satisfactory. In this paper, we firstly formulate the collaborative spectrum sensing as an optimization model and then present a novel adaptive orthogonal matching pursuit algorithm by exploiting the sparsity of active primary users. Statistical property of the sensing data plays a crucial role in spectrum sensing. Theoretical analysis shows the presented scheme can detect active primary users rapidly and efficiently. Simulation results verify that the proposed method can obtain better detection performance with stronger noise background, which is more attractive in real applications. 相似文献

18.

Effective excursion detection by defect type grouping in in-lineinspection and classification

Shindo W. Wang E.H. Akella R. Strojwas A.J. Tomlinson W. Bartholomew R. 《Semiconductor Manufacturing, IEEE Transactions on》1999,12(1):3-10

In this paper, a new methodology for effective process excursion monitoring using defect review/classification information is proposed. We introduce a new defect classification scheme, in which relevant defect types that are likely to be caused by the same mechanism or source are grouped into a “defect family”. It is demonstrated that trending by the defect family drastically improves the detection efficiency of killer defect excursion by reducing or eliminating noise resulting from irrelevant benign defects. We compare the risks of missing critical excursions for monitoring by total defect count, killer defect count, and killer defect family, and illustrate the effectiveness of our methodology using data from actual fabline 相似文献

19.

Image segmentation by clustering 总被引：5，自引：0，他引：5

《Proceedings of the IEEE. Institute of Electrical and Electronics Engineers》1979,67(5):773-785

This paper describes a procedure for segmenting imagery using digital methods and is based on a mathematical-pattern recognition model. The technique does not require training prototypes but operates in an "unsupervised" mode. The features most useful for the given image to be segmented are retained by the algorithm without human interaction, by rejecting those attributes which do not contribute to homogeneous clustering in N-dimensional vector space. The basic procedure is a K-means clustering algorithm which converges to a local minimum in the average squared intercluster distance for a specified number of clusters. The algorithm iterates on the number of clusters, evaluating the clustering based on a parameter of clustering quality. The parameter proposed is a product of between and within cluster scatter measures, which achieves a maximum value that is postulated to represent an intrinsic number of clusters in the data. At this value, feature rejection is implemented via a Bhattacharyya measure to make the image segments more homogeneous (thereby removing "noisy" features); and reclustering is performed. The resulting parameter of clustering fidelity is maximized with segmented imagery resulting in psychovisually pleasing and culturally logical image segments. 相似文献

20.

QuickBird Image Fusion by a Multirresolution-Multidirectional Joint Image Representation

《Latin America Transactions, IEEE (Revista IEEE America Latina)》2007,5(1):32-37

A new fusion methodology for Multiespectral and Pancromatic images, has been proposed. This methodology is based on a joint multiresolution-multidirectional representation of the source images. For that an unique directional filters bank of low computacional complexity has been used.This kind of image representation allows an appropriated selection of the information extracted from the source images, avoiding some of the limitations inherent to other multiresolution fusión methods. The final aim is to obtain fused images with a high spectral and spatial quality simultaneously. The source images corresponds to the captured by Quickbird satellite (panchromatic and multispectral). The high quality of the obtained results shows the potential of the joint multiresolution-multidirectional representation for images fusion. 相似文献