期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Efficient implementation of globally-aware network flow control

Lizhong Chen Ruisheng Wang Timothy M. Pinkston 《Journal of Parallel and Distributed Computing》2012

Network flow control mechanisms that are aware of global conditions potentially can achieve higher performance than flow control mechanisms that are only locally aware. Owing to high implementation overhead, globally-aware flow control mechanisms in their purest form are seldom adopted in practice, leading to less efficient simplified implementations. In this paper, we propose an efficient implementation of a globally-aware flow control mechanism, called Critical Bubble Scheme, for k-ary n-cube networks. This scheme achieves near-optimal performance with the same minimal buffer requirements of globally-aware flow control and can be further generalized to implement the general class of buffer occupancy-based network flow control. We prove deadlock freedom of the proposed scheme and exploit its use in handling protocol-induced deadlocks in on-chip environments. We evaluate the proposed scheme using both synthetic traffic and real application loads. Simulation results show that the proposed scheme can reduce the buffer access component of packet latency by as much as 62% over locally-aware flow control, and improve average packet latency by 18.8% and overall execution time by 7.2% in full system simulation. 相似文献

2.

Experimental evaluation of decentralized cooperative cruise control for heavy-duty vehicle platooning

《Control Engineering Practice》2015

In this paper, we consider the problem of finding decentralized controllers for heavy-duty vehicle (HDV) platooning by establishing empiric results for a qualitative verification of a control design methodology. We present a linear quadratic control framework for the design of a high-level cooperative platooning controller suitable for modern HDVs. A nonlinear low-level dynamical model is utilized, where realistic response delays in certain modes of operation are considered. The controller performance is evaluated through numerical and experimental studies. It is concluded that the proposed controller behaves well in the sense that experiments show that it allows for short time headways to achieve fuel efficiency, without compromising safety. Simulation results indicate that the model mimics real life behavior. Experiment results show that the dynamic behavior of the platooning vehicles depends strongly on the gear switching logic, which is confirmed by the simulation model. Both simulation and experiment results show that the third vehicle never displays a bigger undershoot than its preceding vehicle. The spacing errors stay bounded within 6.8 m in the simulation results and 7.2 m in the experiment results for varying transient responses. Furthermore, a minimum spacing of −0.6 m and −1.9 m during braking is observed in simulations and experiments, respectively. The results indicate that HDV platooning can be conducted at close spacings with standardized sensors and control units that are already present on commercial HDVs today. 相似文献

3.

Comparative study of two flow control mechanisms in high speed networks

下载免费PDF全文

张孝林杜海宁等《计算机科学技术学报》2001,16(5):0-0

相似文献

4.

The impact of master–slave bridge access mode on the performance of multi-cluster 802.15.4 network

《Computer Networks》2007,51(10):2411-2449

Individual IEEE 802.15.4 clusters with separate coordinators can be interconnected to form larger networks. In this paper, we investigate the performance of 802.15.4 beacon enabled network which consists of κ source clusters interconnected to a sink cluster in a master–slave manner. The bridging function is performed by the coordinator of the source cluster, which periodically visits the sink cluster as an ordinary node. The bridge can deliver its data to the sink coordinator either by competing with other nodes in the sink cluster using the CSMA-CA access mechanism, or by using the dedicated GTS slots allocated by the sink coordinator. We compare the performance of these approaches under varying cluster size and packet arrival rate, and also consider both acknowledged and non-acknowledged transmission in the CSMA part of the superframe. We have presented numerical and simulation results for κ = 1 and κ = 2 and discussed the performance trend when κ further increases. The results for single source cluster show that under variable and low to moderate network loads, the CSMA approach is more adaptable to traffic conditions than GTS; under moderate to high loads, the use of acknowledged traffic leads to drastic performance deterioration of the CSMA bridge, whereas the GTS bridge is still able to provide reasonable performance. When number of source clusters increases, acknowledged CSMA-CA bridge mode shows larger performance deterioration in the inter-cluster traffic than in the local sink traffic. GTS interconnection in the presence of multiple source clusters, preserves the intensity of inter-cluster interconnections but it sacrifices the performance of the local sink traffic. In non-acknowledged mode with multiple source clusters, CSMA-CA interconnection performed in a more balanced way than GTS one, by deteriorating inter-cluster traffic and local traffic almost equally. The use of non-acknowledged transfer is preferred in all cases where the requirements of the sensing application allow it. 相似文献

5.

Booting,browsing and streaming time profiling,and bottleneck analysis on android-based systems

Ying-Dar Lin Cheng-Yuan Ho Yuan-Cheng Lai Tzu-Hsiung Du Shun-Lee Chang 《Journal of Network and Computer Applications》2013,36(4):1208-1218

Android-based systems perform slowly in three scenarios: booting, browsing, and streaming. Time profiling on Android devices involves three unique constraints: (1) the execution flow of a scenario invokes multiple software layers, (2) these software layers are implemented in different programming languages, and (3) log space is limited. To compensate for the first and second constraints, we assumed a staged approach using different profiling tools applied to different layers and programming languages. As for the last constraint and to avoid generating enormous quantities of irrelevant log data, we began profiling scenarios from an individual module, and then iteratively profiled an increased number of modules and layers, and finally consolidated the logs from different layers to identify bottlenecks. Because of this iteration, we called this approach a staged iterative instrumentation approach. To analyze the time required to boot the devices, we conducted experiments using off-the-shelf Android products. We determined that 72% of the booting time was spent initializing the user-space environment, with 44.4% and 39.2% required to start Android services and managers, and preload Java classes and resources, respectively. Results from analyzing browsing performance indicate that networking is the most significant factor, accounting for at least 90% of the delay in browsing. With regard to online streaming, networking and decoding technologies are two most important factors occupying 77% of the time required to prepare a 22 MB video file over a Wi-Fi connection. Furthermore, the overhead of this approach is low. For example, the overhead of CPU loading is about 5% in the browsing scenario. We believe that this proposed approach to time profiling represents a major step in the optimization and future development of Android-based devices. 相似文献

6.

Efficient organic light-emitting diodes with C60 buffer layer

Zhaoyue Lv Zhenbo Deng Denghui Xu Xiufang Li Yong Jia 《Displays》2009,30(1):23-26

Organic light-emitting diodes (OLEDs) with C₆₀ buffer layer were fabricated. The effect of C₆₀ buffer layer on the performance of the devices was investigated by inserting C₆₀ buffer layer at the interface between the electrode and organic layers. The device structures were (1) ITO/C₆₀ (0.0, 0.4, 0.7 and 1.0 nm)/NPB/Alq₃/LiF/Al and (2) ITO/NPB/Alq₃/C₆₀ (0.0, 0.4, 0.7 and 1.0 nm)/LiF/Al. The highest brightness and efficiency of the device (1) with 0.7 nm-thick C₆₀ layer reached 6439 cd/m² at 16 V and 1.80 cd/A at 6.4 V, respectively. The enhancements in brightness and efficiency are attributed to an improved balance of hole and electron injections due to C₆₀ layer blocking parts of the injected holes. On the contrary, the brightness and efficiency of the devices with the structure (2) had been hardly enhanced. 相似文献

7.

Performance analysis of networks of queues under active queue management scheme

《Simulation Modelling Practice and Theory》2007,15(4):416-425

Analysis of networks of queues under repetitive service blocking mechanism has been presented in this paper. Nodes are connected according to an arbitrary configuration and each node in the networks employs an active queue management (AQM) based queueing policy to guarantee certain quality of service for multiple class external traffic. This buffer management scheme has been implemented using queue thresholds. The use of queue thresholds is a well known technique for network traffic congestion control. The analysis is based on a queue-by-queue decomposition technique where each queue is modelled as a GE/GE/1/N queue with single server, R (R ⩾ 2) distinct traffic classes and {N = N₁, N₂, … , N_R} buffer threshold values per class under first-come-first-serve (FCFS) service rule. The external traffic is modelled using the generalised exponential (GE) distribution which can capture the bursty property of network traffic. The analytical solution is obtained using the maximum entropy (ME) principle. The forms of the state and blocking probabilities are analytically established at equilibrium via appropriate mean value constraints. The initial numerical results demonstrate the credibility of the proposed analytical solution. 相似文献

8.

Predictive functional control of an axis positioning system with an estimator-based internal model

《Control Engineering Practice》2019

This paper deals with an application of the predictive functional control with a state estimator-based internal model (PFC_ EBIM). The PFC_ EBIM has been shown to be effective in simulation. However, neither detailed experimental validation nor comparison with other controllers has been reported thus far. Here, the PFC_ EBIM is implemented in a single-axis positioning system, and a few experimental tests are conducted. Tracking performance of the PFC_ EBIM, standard PFC, and P – PI control for both smooth and non-smooth reference signals are compared. The experimental results prove the effectiveness of the PFC_ EBIM. 相似文献

9.

Robust controller design for automated kick handling in managed pressure drilling

《Journal of Process Control》2016

相似文献

10.

A reactive and dependable transport protocol for wireless mesh networks

Enrico Natalizio Pasquale PaceAuthor VitaeFrancesca GuerrieroAuthor Vitae Antonio VioliAuthor Vitae 《Journal of Parallel and Distributed Computing》2010

In the last few years, several different mesh network architectures have been conceived by both industry and academia; however, many issues on the deployment of efficient and fair transport protocols are still open. One of these issues is rate adaptation, that is, how to allocate the network resources among multiple flows, while minimizing the performance overhead. In order to address this problem, in this paper, we first define an analytical framework for a very simple topology. The model allows us to study the performance of an adaptive and responsive transport protocol when the effect of the lower layers are ignored. The mathematical approach alone does not represent a feasible solution, but it contributes to determining the strengths and weaknesses of our proposal. The main novelty of the proposed solution is that the congestion control approach is based on a hop-by-hop mechanism, which allows nodes to adapt their transmitting rates in a distributed way and to keep track of dynamic multi-hop network characteristics in a responsive manner. This is in contrast with classical literature solutions, founded on an end-to-end support. Anyway, to ensure the reliability, a coarse-grained end-to-end algorithm is integrated with the proposed hop-by-hop congestion control mechanism to provide packet level reliability at the transport layer. Performance evaluation, via extensive simulation experiments, shows that the proposed protocol achieves a high performance in terms of network throughput. 相似文献

11.

Piezoelectric fuel injection: Cycle-to-cycle control of tightly spaced injections

Chris A. Satkoski Neha S. Ruikar Scott D. Biggs Gregory M. Shaver 《Control Engineering Practice》2012,20(11):1175-1182

Piezoelectric fuel injectors will require closed-loop control to realize tightly spaced injections. This paper describes an estimation algorithm for cycle-to-cycle determination of an injection flow profile for use as feedback for closed-loop control. A control design-amenable, 2-pulse approximate model is outlined to represent the dynamics of simultaneously controlling the quantity of, and realized dwell time between, injection pulses. A control law is developed to provide an overdamped response of realized dwell time to prevent pulse bleeding. Closed-loop performance is validated with the experimental data. Desired fuel quantity, and very short dwell times of 0.0002 s (two crank angle degrees at 1800 RPM), are realized with tracking error convergence time constants of approximately four engine cycles (0.27 s at 1800 RPM). 相似文献

12.

Performance assessment and tuning for exchange of clinical documents cross healthcare enterprises

《Computer Standards & Interfaces》2016

BackgroundTo integrate electronic health records (EHRs) from diverse document sources across healthcare providers, facilities, or medical institutions, the IHE XDS.b profile can be considered as one of the solutions. In this research, we have developed an EHR/OpenXDS system which adopted the OpenXDS, an open source software that complied with the IHE XDS.b profile, and which achieved the EHR interoperability.ObjectiveWe conducted performance testing to investigate the performance and limitations of this EHR/OpenXDS system.MethodologyThe performance testing was conducted for three use cases, EHR submission, query, and retrieval, based on the IHE XDS.b profile for EHR sharing. In addition, we also monitored the depletion of hardware resources (including the CPU usage, memory usage, and network usage) during the test cases execution to detect more details of the EHR/OpenXDS system's limitations.ResultsIn this EHR/OpenXDS system, the maximum affordable workload of the EHR submissions were 400 EHR submissions per hour, the DSA CPU usage was 20%, memory usage was 1380 MB, the network usages were 0.286 KB input and 7.58 KB output per minute; the DPA CPU usage was 1%, memory usage was 1770 MB, the network usages were 7.75 KB input and 1.54 KB output per minute; the DGA CPU usage was 24%, memory usage was 2130 MB, the network usages were 1.3 KB input and 0.174 KB output per minute. The maximum affordable workload of the EHR queries were 600 EHR queries per hour, the DCA CPU usage was 66%, the memory usage was 1660 MB, the network usages were 0.230 KB input and 0.251 KB output per minute; the DGA CPU usage was 1%, the memory usage was 1890 MB, the network usages were 0.273 KB input and 0.22 KB output per minute. The maximum affordable workload of the EHR retrievals were 2000 EHR retrievals, the DCA CPU usage was 79%, the memory usage was 1730 MB, the network usages were 19.55 KB input and 1.12 KB output per minute; the DPA CPU usage was 3.75%, the memory usage was 2310 MB, and the network usages were 0.956 KB input and 19.57 KB output per minute.Discussion and conclusionFrom the research results, we suggest that future implementers who deployed the EHR/OpenXDS system should consider the following aspects. First, to ensure how many service volumes would be provided in the environment and then to adjust the hardware resources. Second, the IHE XDS.b profile is adopted by the SOAP (Simple Object Access Protocol) web service, it might then move onto the Restful (representational state transfer) web service which is more efficient than the SOAP web service. Third, the concurrency process ability should be added in the OpenXDS source code to improve the hardware usage more efficiently while processing the ITI-42, ITI-18, and ITI-43 transactions. Four, this research suggests that the work should continue on adjusting the memory usage for the modules of the OpenXDS thereby using the memory resource more efficiently, e.g., the memory configuration of the JVM (Java Virtual Machine), Apache Tomcat, and Apache Axis2. Fifth, to consider if the hardware monitoring would be required in the implementing environment. These research results provided some test figures to refer to, and it also gave some tuning suggestions and future works to continue improving the performance of the OpenXDS. 相似文献

13.

Analyzing MPI performance over 10-Gigabit ethernet

《Journal of Parallel and Distributed Computing》2005,65(10):1253-1260

Recent work with 10-Gigabit (10 GbE) network adapters has demonstrated good performance in TCP/IP-based local- and wide-area networks (LANs and WANs). In the present work we present an evaluation of host-based 10 GbE adapters in a system-area network (SAN) in support of a cluster. This evaluation focuses on the performance of the message-passing interface (MPI) when running over a 10 GbE interconnect. We find that MPI over 10 GbE provides communications performance comparable to that of TCP alone and fairly competitive with more exotic technologies such as MPI over Quadrics. The optimization of MPI and MPI-based applications to make use of this performance, however, is a non-trivial task. Consequently, it is difficult for MPI-based applications to realize this performance when running current-generation 10 GbE hardware. 相似文献

14.

Enhancement of an industrial finite-volume code for large-eddy-type simulation of incompressible high Reynolds number flow using near-wall modelling

Tobias Knopp Xiaoqin Zhang Roland Kessler Gert Lube 《Computer Methods in Applied Mechanics and Engineering》2010,199(13-16):890-902

We present a validation strategy for enhancement of an unstructured industrial finite-volume solver designed for steady RANS problems for large-eddy-type simulation with near-wall modelling of incompressible high Reynolds number flow. Different parts of the projection-based discretisation are investigated to ensure LES capability of the numerical method. Turbulence model parameters are calibrated by using a minimisation of least-squares functionals for first and second order statistics of the basic benchmark problems decaying homogeneous turbulence and turbulent channel flow. Then the method is applied to the flow over a backward facing step at Re_h = 37,500. Of special interest is the role of the spatial and temporal discretisation error for low order schemes. For wall-bounded flows, present results confirm existing best practice guidelines for mesh design. For free-shear layers, a sensor to quantify the resolution quality of the LES based on the resolved turbulent kinetic energy is presented and applied to the flow over a backward facing step at Re_h = 37,500. 相似文献

15.

Xen2MX: High-performance communication in virtualized environments

《Journal of Systems and Software》2014

Cloud computing infrastructures provide vast processing power and host a diverse set of computing workloads, ranging from service-oriented deployments to high-performance computing (HPC) applications. As HPC applications scale to a large number of VMs, providing near-native network I/O performance to each peer VM is an important challenge. In this paper we present Xen2MX, a paravirtual interconnection framework over generic Ethernet, binary compatible with Myrinet/MX and wire compatible with MXoE. Xen2MX combines the zero-copy characteristics of Open-MX with Xen's memory sharing techniques. Experimental evaluation of our prototype implementation shows that Xen2MX is able to achieve nearly the same raw performance as Open-MX running in a non-virtualized environment. On the latency front, Xen2MX performs as close as 96% to the case where virtualization layers are not present. Regarding throughput, Xen2MX saturates a 10 Gbps link, achieving 1159 MB/s, compared to 1192 MB/s of the non-virtualized case. Scales efficiently with the number of VMs, saturating the link for even smaller messages when 40 single-core VMs put pressure on the network adapters. 相似文献

16.

Bivariate quality control using two-stage intelligent monitoring scheme

《Expert systems with applications》2014,41(16):7579-7595

In manufacturing industries, it is well known that process variation is a major source of poor quality products. As such, monitoring and diagnosis of variation is essential towards continuous quality improvement. This becomes more challenging when involving two correlated variables (bivariate), whereby selection of statistical process control (SPC) scheme becomes more critical. Nevertheless, the existing traditional SPC schemes for bivariate quality control (BQC) were mainly designed for rapid detection of unnatural variation with limited capability in avoiding false alarm, that is, imbalanced monitoring performance. Another issue is the difficulty in identifying the source of unnatural variation, that is, lack of diagnosis, especially when dealing with small shifts. In this research, a scheme to address balanced monitoring and accurate diagnosis was investigated. Design consideration involved extensive simulation experiments to select input representation based on raw data and statistical features, artificial neural network recognizer design based on synergistic model, and monitoring–diagnosis approach based on two-stage technique. The study focused on bivariate process for cross correlation function, ρ = 0.1–0.9 and mean shifts, μ = ±0.75–3.00 standard deviations. The proposed two-stage intelligent monitoring scheme (2S-IMS) gave superior performance, namely, average run length, ARL₁ = 3.18–16.75 (for out-of-control process), ARL₀ = 335.01–543.93 (for in-control process) and recognition accuracy, RA = 89.5–98.5%. This scheme was validated in manufacturing of audio video device component. This research has provided a new perspective in realizing balanced monitoring and accurate diagnosis in BQC. 相似文献

17.

An improved swarm optimized functional link artificial neural network (ISO-FLANN) for classification

Satchidananda Dehuri Rahul Roy Sung-Bae Cho Ashish Ghosh 《Journal of Systems and Software》2012,85(6):1333-1345

Multilayer perceptron (MLP) (trained with back propagation learning algorithm) takes large computational time. The complexity of the network increases as the number of layers and number of nodes in layers increases. Further, it is also very difficult to decide the number of nodes in a layer and the number of layers in the network required for solving a problem a priori. In this paper an improved particle swarm optimization (IPSO) is used to train the functional link artificial neural network (FLANN) for classification and we name it ISO-FLANN. In contrast to MLP, FLANN has less architectural complexity, easier to train, and more insight may be gained in the classification problem. Further, we rely on global classification capabilities of IPSO to explore the entire weight space, which is plagued by a host of local optima. Using the functionally expanded features; FLANN overcomes the non-linear nature of problems. We believe that the combined efforts of FLANN and IPSO (IPSO + FLANN = ISO ? FLANN) by harnessing their best attributes can give rise to a robust classifier. An extensive simulation study is presented to show the effectiveness of proposed classifier. Results are compared with MLP, support vector machine(SVM) with radial basis function (RBF) kernel, FLANN with gradiend descent learning and fuzzy swarm net (FSN). 相似文献

18.

Design of membrane actuator based on ferromagnetic shape memory alloy composite for synthetic jet applications

《Sensors and actuators. A, Physical》2006,125(2):512-518

The active flow control (AFC) technology has been studied and shown that it can help aircraft improve aerodynamic performance and jet noise reduction. AFC can be achieved by a synthetic jet actuator injecting high momentum air into the airflow at the appropriate locations on aircraft wings. To produce strong synthetic jet flow at high frequency, a new membrane actuator based on ferromagnetic shape memory alloy (FSMA) composite and hybrid mechanism was designed and constructed. The hybrid mechanism is the stress-induced martensitic phase transformation caused by large force due to large magnetic field gradient, thus enhancing the displacement, as the stiffness of shape memory alloy reduces due to the martensitic transformation. This sequential event can take place within milliseconds. The high momentum airflow will be produced by the oscillation of the circular FSMA composite diaphragm close to its resonance frequency driven by electromagnets. Due to large force and martensitic transformation on the FSMA composite diaphragm, the membrane actuator that we designed can produce 190 m/s synthetic jets at 220 Hz. 相似文献

19.

High-speed,in-band performance measurement instrumentation for next generation IP networks

Dimitrios P. Pezaros Konstantinos Georgopoulos David Hutchison 《Computer Networks》2010,54(18):3246-3263

Facilitating always-on instrumentation of Internet traffic for the purposes of performance measurement is crucial in order to enable accountability of resource usage and automated network control, management and optimisation. This has proven infeasible to date due to the lack of native measurement mechanisms that can form an integral part of the network’s main forwarding operation. However, Internet Protocol version 6 (IPv6) specification enables the efficient encoding and processing of optional per-packet information as a native part of the network layer, and this constitutes a strong reason for IPv6 to be adopted as the ubiquitous next generation Internet transport.In this paper we present a very high-speed hardware implementation of in-line measurement, a truly native traffic instrumentation mechanism for the next generation Internet, which facilitates performance measurement of the actual data-carrying traffic at small timescales between two points in the network. This system is designed to operate as part of the routers’ fast path and to incur an absolutely minimal impact on the network operation even while instrumenting traffic between the edges of very high capacity links. Our results show that the implementation can be easily accommodated by current FPGA technology, and real Internet traffic traces verify that the overhead incurred by instrumenting every packet over a 10 Gb/s operational backbone link carrying a typical workload is indeed negligible. 相似文献

20.

一个适用于DTN网络的拥塞避免与解除方案

杜学绘陈性元王娜曹利峰《计算机应用研究》2013,30(4):1143-1148

DTN(delay-tolerant network,延迟容忍网络)的网络特点及其采用的托管传输机制易造成网络受限资源(如缓存、带宽等)的耗尽,形成网络拥塞,导致网络性能的下降。传统TCP拥塞控制机制不适用于DTN网络。提出了一个全新的适用于DTN网络的拥塞避免与拥塞解除方案。拥塞避免根据在足够小的时间段内DTN链路的传输延迟和传输能力的确定性,建立DTN网络有向多径图,对数据发送速率、接收速率、带宽使用等链路负载分割与约束控制,尽可能地提高网络资源的利用率。拥塞解除在节点存储资源划分的基础上,通过节点内存储资源转换与节点间报文转移相结合的方法,解除DTN网络的拥塞状况。仿真结果显示,与其他DTN拥塞控制机制相比,所提方案具有良好的报文交付率、网络开销等网络性能。相似文献