共查询到20条相似文献,搜索用时 281 毫秒
1.
2.
H.264指数哥伦布码解码部件的硬件设计和实现 总被引:5,自引:3,他引:2
提出了一种针对H.264视频编码标准的变长码-指数哥伦布码解码的硬件设计结构,对传统的桶形移位器进行优化,主要采用基于PLA的并行解码算法以达到实时解码,同时辅助使用串行解码算法降低硬件资源消耗,保证在能够对符合H.264标准baseline Profile的码流实时解码的基础上优化了电路资源,给出实现该硬件结构对应的FPGA仿真结果及其ASIC硬件规模. 相似文献
3.
在立方体插值算法的基础上,提出了一种适合于实时应用的快速查找表结构.其中将涉及颜色空间转换和色域扩展的复杂算法使用硬件上的综合查找表完成,使用了计算速度较快的四面体插值算法.同传统的立方体插值算法相比,有效降低了硬件资源消耗并提高了运算速度. 相似文献
4.
《现代电子技术》2020,(4)
为了实现窄带调制信号载波频率的快速准确获取并减少硬件资源消耗,基于FPGA设计一种综合哥兹柔(Goertzel)算法和能量重心估计算法的实时载波频率检测电路。首先,确定频率检测算法实现流程,在Matlab中进行了频谱和能量重心的仿真;接着提出一种低复杂度频率检测系统的硬件实现架构,以8个哥兹柔电路并行运行获取信号频谱,通过查找峰值谱线位置初步估计出信号频率;最后,采用能量重心估计法,精确估计出载波信号的频率。实验结果表明,设计的低复杂度实时频率检测硬件电路在资源消耗上仅为传统FFT方法的1 3,测频相对误差小于0.2%,2 048点频谱获取时间仅为103μs,易于硬件实现,可用于实时准确获取载波频率。 相似文献
5.
6.
在现代数字信号处理领域中,CORDIC算法是一种重要的数学计算方法。该算法采用一种迭代的方式,运算简便,被广泛应用于乘除法、开方以及一些三角函数运算当中。但CORDIC算法需要较高的迭代级数以保证运算精度,在进行FPGA实现时仍然会消耗较多的硬件逻辑资源。为进一步减少CORDIC算法实现时的资源消耗,设计并实现了一种基于折叠变换的CORDIC算法。相比传统的流水结构CORDIC算法,该折叠结构的CORDIC算法消耗的硬件资源大大减少。文中给出了这一方法的实现结构,并给出了仿真结果。 相似文献
7.
讨论了一种基于前导零的WFQ权重比较电路的设计,主要描述了逻辑电路的设计方法、仿真和综合等.和传统的设计方法相比,基于前导零方法的设计不仅具有速度的优势,而且在电路的硬件资源消耗方面也具有较好的表现. 相似文献
8.
9.
10.
提出了一种改进cordic(coordinate rotation digital compute,坐标数字旋转)算法,用于实现NCO(数控振荡器)的设计。该算法能够预先确定所有迭代的旋转方向,相对于传统cordic算法,减少了硬件资源消耗。采用Altra公司CycloneII系列芯片EP2C5AF256A7进行FPGA验证,结果符合设计要求。 相似文献
11.
针对实时图像处理的要求,提出了一种利用FPGA实现多尺度Harris角点提取的方法。通过简化高斯函数,实现了基于FPGA的尺度空间构建。分析了多尺度Harris角点检测算法,改进了响应值计算函数,通过改进算法步骤,在FPGA上并行搜索位置、尺度空间响应值的极大值,进一步提高了多尺度Harris角点检测的速度。硬件处理速度显示,设计适用于实时性要求高的图像处理领域。 相似文献
12.
基于FPGA的三维视频系统实时深度估计 总被引:2,自引:1,他引:1
深度估计是基于视频加深度图像的三维视频系统中前端预处理的核心技术,其主要技术难题包括准确性、实时处理和大分辨率深度图获取等。本文提出一种实时深度估计的硬件实现方案,主要解决处理速度问题,并兼顾了准确性和大分辨率问题。本方案采用单片FPGA实现深度估计,其中采用census变换与SAD(Sum of Absolute Differences)混合的算法进行逐点匹配得到稠密深度图。硬件设计充分利用FPGA的大规模并行能力,并采用流水线设计提高数据通路的数据吞吐量,提升整个设计的时钟频率。实验表明,所提出的方案可实现全高清(1 920×1 080)分辨率视频实时深度估计。为了支持大分辨率图像并能观测距离相机较近的物体深度,本文方案视差搜索范围可以达到240pixels,帧率最高可达69.6fps,达到了实时和高清的处理目的。 相似文献
13.
High-speed real-time digital frequency analysis is one major field of Fast Fourier Transform (FFT) application, such as Synthetic Aperture Radar(SAR) processing and medical imaging. In SAR processing, the image size could be 4 k×4 k in normal and it has become larger over the years. In the view of real-time, extensibility and reusable characteristics, an Field Programmable Gate Array(FPGA) based multi-channel variable-length FFT architecture which adopts radix-2 butterfly algorithm is proposed in this paper. The hardware implementation of FFT is partially reconfigurable architecture. Firstly, the proposed architecture in the paper has flexibility in terms of chip area, speed, resource utilization and power consumption. Secondly, the proposed architecture combines serial and parallel methods in its butterfly computations. Furthermore, on system-level issue, the proposed architecture takes advantage of state processing in serial mode and data processing in parallel mode. In case of sufficient FPGA resources, state processing of serial mode mentioned above is converted to pipeline mode. State processing of pipeline mode achieves high throughput. 相似文献
14.
基于FPGA图像分块解码的系统设计 总被引:1,自引:1,他引:0
讨论了一种硬件实现卫星图像高速解码的系统设计方案。采用图像分块解码的方式,使用现场可编程门阵列(FPGA,Field Programmable Gate Array)的IP核实现片内缓存,减少了外挂RAM,降低了设计的复杂度。利用异步FIFO可以同时读写的特性,实现了码流检测与解码同时进行,并通过多路并行和流水线操作对图像码流进行高速解码。整个设计采用VHDL对算法完成建模和实现,仿真和综合结果表明该方案占用的硬件资源少,解码速度快,实现了图像码流解码的实时性和准确性,而且可移植性强,可以应用到图像处理的很多领域。 相似文献
15.
Object detection represents one of the most important and challenging task in computer vision applications. Boosting-based approaches deal with computational intensive operations and they involve several sequential tasks that make very difficult developing hardware implementations with high parallelism level. This work presents a new hardware architecture able to perform object detection based on a cascade classifier in real-time and resource-constrained systems. As case study, the proposed architecture has been tailored to accomplish the face detection task and integrated within a complete heterogeneous embedded system based on a Xilinx Zynq-7000 FPGA-based System-on-Chip. Experimental results show that, thanks to the proposed parallel processing scheme and the runtime adaptable strategy to slide sub-windows across the input image, the novel design achieves a frame rate up to 125fps for the QVGA resolution, thus significantly outperforming previous works. Such a performance is obtained by using less than 10% of on-chip available logic resources with a power consumption of 377 mW at the 100 MHz clock frequency. 相似文献
16.
17.
Hau T. Ngo Author Vitae Author Vitae Ming Z. Zhang Author Vitae Author Vitae 《Integration, the VLSI Journal》2008,41(4):474-488
A high performance digital architecture for the implementation of a nonlinear image enhancement technique is proposed in this paper. The image enhancement is based on an illuminance-reflectance model which improves the visual quality of digital images and video captured under insufficient or non-uniform lighting conditions. The algorithm shows robust performance with appropriate dynamic range compression, good contrast, accurate and consistent color rendition. The algorithm contains a large number of complex computations and thus it requires specialized hardware implementation for real-time applications. Systolic, pipelined and parallel design techniques are utilized effectively in the proposed FPGA-based architectural design to achieve real-time performance. Approximation techniques are used in the hardware algorithmic design to achieve high throughput. The video enhancement system is implemented using Xilinx's multimedia development board that contains a VirtexII-X2000 FPGA and it is capable of processing approximately 63 Mega-pixels (Mpixels) per second. 相似文献
18.
TING-PANG LIN DER-LAN LO PO-CHIANG LU CHAUR-HEH HSIEH 《International Journal of Electronics》2013,100(6):1165-1176
An architecture based on a systolic array for real-time image template matching is presented. The architecture consists mainly of four elements: a digitizer, a two-dimensional systolic array combined with variable-length shift register arrays, an adder tree, and a comparator. All the elements form a four-stage pipeline. The image data enter the pipe sequentially in the same order as the TV raster scan. The matching computation is, however, performed in a parallel manner. The analyses on time complexity and hardware complexity have shown that real-time performance is achieved. The analyses have also shown that the processing speed is higher and the hardware is simpler when compared to the architecture presented by Chou and Chen. 相似文献
19.
20.
Aho E. Vanne J. Hamalainen T.D. Kuusilinna K. 《IEEE transactions on circuits and systems. I, Regular papers》2005,52(12):2717-2725
Image scaling is a frequent operation in medical image processing. This paper presents how two-dimensional (2-D) image scaling can be accelerated with a new coarse-grained parallel processing method. The method is based on evenly divisible image sizes which is, in practice, the case with most medical images. In the proposed method, the image is divided into slices and all the slices are scaled in parallel. The complexity of the method is examined with two parallel architectures while considering memory consumption and data throughput. Several scaling functions can be handled with these generic architectures including linear, cubic B-spline, cubic, Lagrange, Gaussian, and sinc interpolations. Parallelism can be adjusted independent of the complexity of the computational units. The most promising architecture is implemented as a simulation model and the hardware resources as well as the performance are evaluated. All the significant resources are shown to be linearly proportional to the parallelization factor. With contemporary programmable logic, real-time scaling is achievable with large resolution 2-D images and a good quality interpolation. The proposed block-level scaling is also shown to increase software scaling performance over four times. 相似文献