Cooperative multi-agent reinforcement learning (MARL) is an important topic in the field of artificial intelligence,in which distributed constraint optimization (DCOP) algorithms have been widely used to coordinate the actions of multiple agents.However,dense communication among agents affects the practicability of DCOP algorithms.In this paper,we propose a novel DCOP algorithm dealing with the previous DCOP algorithms' communication problem by reducing constraints.The contributions of this paper are primarily threefold:① It is proved that removing constraints can effectively reduce the communication burden of DCOP algorithms.② An criterion is provided to identify insignificant constraints whose elimination doesn't have a great impact on the performance of the whole system.③ A constraint-reduced DCOP algorithm is proposed by adopting a variant of spectral clustering algorithm to detect and eliminate the insignificant constraints.Our algorithm reduces the communication burdern of the benchmark DCOP algorithm while keeping its overall performance unaffected.The performance of constraint-reduced DCOP algorithm is evaluated on four configurations of cooperative sensor networks.The effectiveness of communication reduction is also verified by comparisons between the constraint-reduced DCOP and the benchmark DCOP. 相似文献
Soft-sensing modeling can effectively solve the problems of large measurement lag, high price, and complex maintenance of online analytical instruments in the production process. At present, neural network based on data-driven is one of the main tools of soft sensor. In the process of modeling data collection, the collection of dominant variables is much more difficult than that of auxiliary variables, resulting in a large amount of unlabeled data. However, traditional soft sensor modeling methods ignore these unlabeled data and only use a small amount of labeled data for modeling, which has negative effect on the prediction accuracy of the model. To solve the problem of label missing, the nearest neighbor algorithm is used to pseudo label the unlabeled data. At the same time, a network structure is designed by combining convolution operation and gated recurrent unit neural network (GRU) to further utilize the unlabeled data, extract the dynamic feature from data at different time, and improve the prediction accuracy of the neural network. Finally, the method is applied to the prediction of propane concentration on the top of propylene distillation column. The results show that the model can solve the problem of label missing in the nonlinear dynamic system and has higher prediction accuracy. 相似文献