High user interaction capability of mobile devices can help improve the accuracy of mobile visual search systems. At query time, it is possible to capture multiple views of an object from different viewing angles and at different scales with the mobile device camera to obtain richer information about the object compared to a single view and hence return more accurate results. Motivated by this, we propose a new multi-view visual query model on multi-view object image databases for mobile visual search. Multi-view images of objects acquired by the mobile clients are processed and local features are sent to a server, which combines the query image representations with early/late fusion methods and returns the query results. We performed a comprehensive analysis of early and late fusion approaches using various similarity functions, on an existing single view and a new multi-view object image database. The experimental results show that multi-view search provides significantly better retrieval accuracy compared to traditional single view search. 相似文献
In this technical note, we revisit the risk-sensitive optimal control problem for Markov jump linear systems (MJLSs). We first demonstrate the inherent difficulty in solving the risk-sensitive optimal control problem even if the system is linear and the cost function is quadratic. This is due to the nonlinear nature of the coupled set of Hamilton-Jacobi-Bellman (HJB) equations, stemming from the presence of the jump process. It thus follows that the standard quadratic form of the value function with a set of coupled Riccati differential equations cannot be a candidate solution to the coupled HJB equations. We subsequently show that there is no equivalence relationship between the problems of risk-sensitive control and H∞ control of MJLSs, which are shown to be equivalent in the absence of any jumps. Finally, we show that there does not exist a large deviation limit as well as a risk-neutral limit of the risk-sensitive optimal control problem due to the presence of a nonlinear coupling term in the HJB equations.
In distributed meeting applications, microphone arrays have been widely used to capture superior speech sound and perform speaker localization through sound source localization (SSL) and beamforming. This paper presents a unified maximum likelihood framework of these two techniques, and demonstrates how such a framework can be adapted to create efficient SSL and beamforming algorithms for reverberant rooms and unknown directional patterns of microphones. The proposed method is closely related to steered response power-based algorithms, which are known to work extremely well in real-world environments. We demonstrate the effectiveness of the proposed method on challenging synthetic and real-world datasets, including over six hours of recorded meetings. 相似文献
Web service compositions are becoming more and more complex, involving numerous interacting ad-hoc services. These services are often implemented as business processes themselves. By analysing such complex web service compositions one is able to better understand, control and eventually re-design them. Our contribution to this problem is a mining algorithm, based on a statistical technique to discover composite web service patterns from execution logs. Our approach is characterised by a “local” pattern’s discovery that covers partial results through a dynamic programming algorithm. Those locally discovered patterns are then composed iteratively until the composite Web service is discovered. The analysis of the disparities between the discovered model and the initial ad-hoc composite model (delta-analysis) enables initial design gaps to be detected and thus to re-engineer the initial Web service composition. 相似文献
This study proposes a new four-component algorithm for land use and land cover (LULC) classification using RADARSAT-2 polarimetric SAR (PolSAR) data. These four components are polarimetric decomposition, PolSAR interferometry, object-oriented image analysis, and decision tree algorithms. First, polarimetric decomposition can be used to support the classification of PolSAR data. It is aimed at extracting polarimetric parameters related to the physical scattering mechanisms of the observed objects. Second, PolSAR interferometry is used to extract polarimetric interferometric information to support LULC classification. Third, the main purposes of object-oriented image analysis are delineating image objects, as well as extracting various textural and spatial features from image objects to improve classification accuracy. Finally, a decision tree algorithm provides an efficient way to select features and implement classification. A comparison between the proposed method and the Wishart supervised classification which is based on the coherency matrix was made to test the performance of the proposed method. The overall accuracy of the proposed method was 86.64%, whereas that of the Wishart supervised classification was 69.66%. The kappa value of the proposed method was 0.84, much higher than that of the Wishart supervised classification, which exhibited a kappa value of 0.65. The results indicate that the proposed method exhibits much better performance than the Wishart supervised classification for LULC classification. Further investigation was carried out on the respective contribution of the four components to LULC classification using RADARSAT-2 PolSAR data, and it indicates that all the four components have important contribution to the classification. Polarimetric information has significant implications for identifying different vegetation types and distinguishing between vegetation and urban/built-up. The polarimetric interferometric information extracted from repeat-pass RADARSAT-2 images is important in reducing the confusion between urban/built-up and vegetation and that between barren/sparsely vegetated land and vegetation. Object-oriented image analysis is very helpful in reducing the effect of speckle in PolSAR images by implementing classification based on image objects, and the textural information extracted from image objects is helpful in distinguishing between water and lawn. The decision tree algorithm can achieve higher classification accuracy than the nearest neighbor classification implemented using Definiens Developer 7.0, and the accuracy of the decision tree algorithm is similar with that of the support vector classification which is implemented based on the features selected using genetic algorithms. Compared with the nearest neighbor and support vector classification, the decision tree algorithm is more efficient to select features and implement classification. Furthermore, the decision tree algorithm can provide clear classification rules that can be easily interpreted based on the physical meaning of the features used in the classification. This can provide physical insight for LULC classification using PolSAR data. 相似文献
The present paper investigates the 3D medial axis transform of objects bounded by freeform surfaces via the saddle point programming method, a mathematical programming approach used to identify the saddle points of a function. After exploring the local geometry and saddle point property of 3D medial axis transform, the mathematical programming method is employed to construct the saddle point programming models. Based on the optimality conditions that the optimal solutions should satisfy, a generic algorithm for computing various medial axis points is developed. In order to identify the junction points and localize the problem, the boundary and the skeletal curves are divided into skeletal segments, and it is proved to be efficient and accurate by numerical examples. 相似文献