基于全局数据混洗的小样本数据预测方法 A Small Sample Data Prediction Method Based on Global Data Shuffling期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于全局数据混洗的小样本数据预测方法

引用本文：	赖峻,刘震宇,刘圣海.基于全局数据混洗的小样本数据预测方法[J].广东工业大学学报,2021,38(3):17-21.

作者姓名：	赖峻刘震宇刘圣海

作者单位：	广东工业大学信息工程学院，广东广州 510006

基金项目：	广州市科技计划资助项目(201907010003)

摘要：	以广州车牌竞拍价格数据集为数据来源, 采用线性回归并结合k折交叉验证, 研究小样本数据集的预测方法。为解决小样本局部特异性数据导致的验证误差增大的问题, 提出验证之前先对数据进行全局混洗的策略。最后通过实验验证了此策略可以明显降低验证误差, 以此为基础, 通过多组实验验证, 确定了合适的参数, 结果表明最终预测值的总平均正确率达到了95%。
关键词：	线性回归 k折交叉验证随机梯度下降数据混洗深度学习
收稿时间：	2020-09-22
A Small Sample Data Prediction Method Based on Global Data Shuffling

Lai Jun,Liu Zhen-yu,Liu Sheng-hai.A Small Sample Data Prediction Method Based on Global Data Shuffling[J].Journal of Guangdong University of Technology,2021,38(3):17-21.

Authors:	Lai Jun Liu Zhen-yu Liu Sheng-hai

Affiliation:	School of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China

Abstract:	Based on the Guangzhou license plate auction price data set, linear regression combined with k-fold cross-validation is used to study the prediction method of a small sample data set. In order to solve the problem of increased verification errors caused by local specific data in a small sample set, a strategy to shuffle the data globally before verification is proposed. Finally, it is verified through experiments that this strategy can significantly reduce the verification error. Based on this, through multiple sets of experimental verification, the appropriate parameters are determined, and the results show that the total average correct rate of the final predicted value has reached 95%.

Keywords:	linear regression k-fold cross-validation stochastic gradient descent data shuffling deep learning

	点击此处可从《广东工业大学学报》浏览原始摘要信息
	点击此处可从《广东工业大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏