期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

全文获取类型

收费全文	1233篇
免费	26篇
国内免费	33篇

学科分类

工业技术

1292篇

出版年

2024年	5篇
2023年	32篇
2022年	66篇
2021年	52篇
2020年	38篇
2019年	20篇
2018年	14篇
2017年	19篇
2016年	26篇
2015年	23篇
2014年	62篇
2013年	58篇
2012年	45篇
2011年	88篇
2010年	60篇
2009年	79篇
2008年	75篇
2007年	67篇
2006年	72篇
2005年	48篇
2004年	72篇
2003年	48篇
2002年	50篇
2001年	35篇
2000年	23篇
1999年	23篇
1998年	16篇
1997年	21篇
1996年	15篇
1995年	13篇
1994年	9篇
1993年	3篇
1992年	7篇
1991年	3篇
1990年	1篇
1988年	2篇
1986年	1篇
1970年	1篇

排序方式： 共有1292条查询结果，搜索用时 0 毫秒

1 [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 下一页 » 末页»

Multiagent learning using a variable learning rate

Michael Bowling 《Artificial Intelligence》2002,136(2):215-250

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on the policies of the other agents. This creates a situation of learning a moving target. Previous learning algorithms have one of two shortcomings depending on their approach. They either converge to a policy that may not be optimal against the specific opponents' policies, or they may not converge at all. In this article we examine this learning problem in the framework of stochastic games. We look at a number of previous learning algorithms showing how they fail at one of the above criteria. We then contribute a new reinforcement learning technique using a variable learning rate to overcome these shortcomings. Specifically, we introduce the WoLF principle, “Win or Learn Fast”, for varying the learning rate. We examine this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games. We also present empirical results on a variety of more general stochastic games, in situations of self-play and otherwise, demonstrating the wide applicability of this method. 相似文献

Neural Network Feedback Control: Work at UTA’s Automation and Robotics Research Institute

F. L. Lewis 《Journal of Intelligent and Robotic Systems》2007,48(4):513-523

This is an outline of research in neural networks for feedback control done since the mid 1990s at the Automation and Robotics Research Institute (ARRI) of The University of Texas at Arlington (UTA). It shows how the developments of Intelligent Control Systems based on neural networks have followed three main generations. This statement provides a short, broad-brush perspective on the development of intelligent neural feedback controllers. 相似文献

An actor-critic algorithm for constrained Markov decision processes 总被引：2，自引：0，他引：2

V.S. Borkar 《Systems & Control Letters》2005,54(3):207-213

An actor-critic type reinforcement learning algorithm is proposed and analyzed for constrained controlled Markov decision processes. The analysis uses multiscale stochastic approximation theory and the envelope theorem' of mathematical economics. 相似文献

A fuzzy hierarchical reinforcement learning based scheduling method for semiconductor wafer manufacturing systems

《Journal of Manufacturing Systems》2021

Scheduling semiconductor wafer manufacturing systems has been viewed as one of the most challenging optimization problems owing to the complicated constraints, and dynamic system environment. This paper proposes a fuzzy hierarchical reinforcement learning (FHRL) approach to schedule a SWFS, which controls the cycle time (CT) of each wafer lot to improve on-time delivery by adjusting the priority of each wafer lot. To cope with the layer correlation and wafer correlation of CT due to the re-entrant process constraint, a hierarchical model is presented with a recurrent reinforcement learning (RL) unit in each layer to control the corresponding sub-CT of each integrated circuit layer. In each RL unit, a fuzzy reward calculator is designed to reduce the impact of uncertainty of expected finishing time caused by the rematching of a lot to a delivery batch. The results demonstrate that the mean deviation (MD) between the actual and expected completion time of wafer lots under the scheduling of the FHRL approach is only about 30 % of the compared methods in the whole SWFS. 相似文献

An intelligent negotiator agent design for bilateral contracts of electrical energy

《Expert systems with applications》2014,41(9):4073-4082

In this paper, an intelligent agent (using the Fuzzy SARSA learning approach) is proposed to negotiate for bilateral contracts (BC) of electrical energy in Block Forward Markets (BFM or similar market environments). In the BFM energy markets, the buyers (or loads) and the sellers (or generators) submit their bids and offers on a daily basis. The loads and generators could employ intelligent software agents to trade energy in BC markets on their behalves. Since each agent attempts to choose the best bid/offer in the market, conflict of interests might happen. In this work, the trading of energy in BC markets is modeled and solved using Game Theory and Reinforcement Learning (RL) approaches. The Stackelberg equation concept is used for the match making among load and generator agents. Then to overcome the negotiation limited time problems (it is assumed that a limited time is given to each generator–load pairs to negotiate and make an agreement), a Fuzzy SARSA Learning (FSL) method is used. The fuzzy feature of FSL helps the agent cope with continuous characteristics of the environment and also prevents it from the curse of dimensionality. The performance of the FSL (compared to other well-known traditional negotiation techniques, such as time-dependent and imitative techniques) is illustrated through simulation studies. The case study simulation results show that the FSL based agent could achieve more profits compared to the agents using other reviewed techniques in the BC energy market. 相似文献

Reactive Search strategies using Reinforcement Learning,local search algorithms and Variable Neighborhood Search

《Expert systems with applications》2014,41(10):4939-4949

Optimization techniques known as metaheuristics have been applied successfully to solve different problems, in which their development is characterized by the appropriate selection of parameters (values) for its execution. Where the adjustment of a parameter is required, this parameter will be tested until viable results are obtained. Normally, such adjustments are made by the developer deploying the metaheuristic. The quality of the results of a test instance [The term instance is used to refer to the assignment of values to the input variables of a problem.] will not be transferred to the instances that were not tested yet and its feedback may require a slow process of “trial and error” where the algorithm has to be adjusted for a specific application. Within this context of metaheuristics the Reactive Search emerged defending the integration of machine learning within heuristic searches for solving complex optimization problems. Based in the integration that the Reactive Search proposes between machine learning and metaheuristics, emerged the idea of putting Reinforcement Learning, more specifically the Q-learning algorithm with a reactive behavior, to select which local search is the most appropriate in a given time of a search, to succeed another local search that can not improve the current solution in the VNS metaheuristic. In this work we propose a reactive implementation using Reinforcement Learning for the self-tuning of the implemented algorithm, applied to the Symmetric Travelling Salesman Problem. 相似文献

内螺纹粘接再造技术

胡忆沩《中国胶粘剂》1997,6(6):30-31

本文介绍了两种内螺纹粘接再造技术，即钢丝增强粘接再造法和铜丝填充粘接再造法。相似文献

平盖开孔的有限元分析及其补强方法的探讨 总被引：1，自引：0，他引：1

管文华方德明张康达《化工机械》2006,33(6):350-353

为了使平盖开孔补强结构更为合理、可靠,使用圆形平板的理论解和有限元分析方法,对不同开孔直径的27个平盖模型进行了应力计算和分析比较,进一步探讨了不同的开孔补强方法。结果表明,对于开孔率小于0.5的平盖开孔补强采用均匀增加平盖厚度的方法,不能解决孔边产生的应力集中问题,难以达到补强的效果;而采用外加补强元件的方法,则应明确其补强范围应在2d范围内。相似文献

聚丙烯工程化研究 总被引：3，自引：0，他引：3

范文春钱欣《塑料工业》2004,32(10):18-20,23

采用纳米填充、交联和转晶的复合技术对聚丙烯进行了改性，优选了纳米碳酸钙为填料。当β成核剂用量为0．5％、助交联剂为3％时，复合材料的综合性能达到最好，接近普通工程塑料的各项力学性能。红外光谱分析和熔体质量流动速率的测定表明，复合材料体系产生了微交联；DSC分析发现，PP材料的聚集态结构中含有β球晶。相似文献

10.

电脑安全加固技术研究

马骋《电脑编程技巧与维护》2010,(16):114-115

从防止黑客攻击角度对电脑安全加固技术进行探讨。相似文献

1 [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] 下一页 » 末页»