首页 | 官方网站   微博 | 高级检索  
     

基于全局数据混洗的小样本数据预测方法
引用本文:赖峻,刘震宇,刘圣海.基于全局数据混洗的小样本数据预测方法[J].广东工业大学学报,2021,38(3):17-21.
作者姓名:赖峻  刘震宇  刘圣海
作者单位:广东工业大学 信息工程学院,广东 广州 510006
基金项目:广州市科技计划资助项目(201907010003)
摘    要:以广州车牌竞拍价格数据集为数据来源, 采用线性回归并结合k折交叉验证, 研究小样本数据集的预测方法。为解决小样本局部特异性数据导致的验证误差增大的问题, 提出验证之前先对数据进行全局混洗的策略。最后通过实验验证了此策略可以明显降低验证误差, 以此为基础, 通过多组实验验证, 确定了合适的参数, 结果表明最终预测值的总平均正确率达到了95%。

关 键 词:线性回归  k折交叉验证  随机梯度下降  数据混洗  深度学习  
收稿时间:2020-09-22

A Small Sample Data Prediction Method Based on Global Data Shuffling
Lai Jun,Liu Zhen-yu,Liu Sheng-hai.A Small Sample Data Prediction Method Based on Global Data Shuffling[J].Journal of Guangdong University of Technology,2021,38(3):17-21.
Authors:Lai Jun  Liu Zhen-yu  Liu Sheng-hai
Affiliation:School of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China
Abstract:Based on the Guangzhou license plate auction price data set, linear regression combined with k-fold cross-validation is used to study the prediction method of a small sample data set. In order to solve the problem of increased verification errors caused by local specific data in a small sample set, a strategy to shuffle the data globally before verification is proposed. Finally, it is verified through experiments that this strategy can significantly reduce the verification error. Based on this, through multiple sets of experimental verification, the appropriate parameters are determined, and the results show that the total average correct rate of the final predicted value has reached 95%.
Keywords:linear regression  k-fold cross-validation  stochastic gradient descent  data shuffling  deep learning  
点击此处可从《广东工业大学学报》浏览原始摘要信息
点击此处可从《广东工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号