基于Spark Streaming的实时交通数据处理平台 Real-Time Traffic Data Processing System Based on Spark Streaming期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Spark Streaming的实时交通数据处理平台

引用本文：	谭亮,周静.基于Spark Streaming的实时交通数据处理平台[J].计算机系统应用,2018,27(10):133-139.

作者姓名：	谭亮周静

作者单位：	四川省交通运输发展战略和规划科学研究院, 成都 611130,四川省交通运输发展战略和规划科学研究院, 成都 611130

摘要：	交通大数据是解决城市交通问题的最基本条件，是制定宏观城市交通发展战略规划和进行微观道路交通管理与控制的重要保障.针对于智能交通系统中数据产生快、实时性强、数据量大的特点，本文基于Spark Streaming和Apache Kafka的组合构建了一个实时交通数据处理平台，用于处理通过双基基站采集的数据，采用时间窗口机制从持续的Kafka分布式消息队列中获取数据，并按照规则将数据分类处理后保存到数据库.本文对平台的系统架构和内部结构进行了详细的介绍，并通过实验验证了系统的实时处理能力，完全可以在大规模高并发的数据流下进行应用.
关键词：	大数据流处理系统双基基站数据 Spark Streaming Apache Kafka
收稿时间：	2018/3/9 0:00:00
修稿时间：	2018/4/3 0:00:00
Real-Time Traffic Data Processing System Based on Spark Streaming

TAN Liang and ZHOU Jing.Real-Time Traffic Data Processing System Based on Spark Streaming[J].Computer Systems& Applications,2018,27(10):133-139.

Authors:	TAN Liang and ZHOU Jing

Affiliation:	Sichuan Provincial Institute of Transportation Development Strategy and Planning, Chengdu 611130, China and Sichuan Provincial Institute of Transportation Development Strategy and Planning, Chengdu 611130, China

Abstract:	Traffic big data is the most basic condition for solving urban traffic problems. It is an important guarantee for formulating macro-city traffic development strategy and construction planning. And it is also an important guarantee for carrying out micro-road traffic management and control. In view of the characteristics of intelligent transportation system such as fast data generation, high real-time performance and large amount of data, this paper constructs a real-time traffic data processing platform based on the combination of Spark Streaming and Apache Kafka to process the data collected by dual base stations. Using time window mechanism to get data from Kafka, and process the data to the database according to the rules. In this study, the system architecture and internal structure of the platform are introduced in detail, and the real-time processing capability of the system is verified through experiments, which can be applied under large-scale and high-concurrency data flow.

Keywords:	big data stream processing system double station data Spark Streaming Apache Kafka

	点击此处可从《计算机系统应用》浏览原始摘要信息
	点击此处可从《计算机系统应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏