首页 | 官方网站   微博 | 高级检索  
     

考虑数据不均衡的居民用户负荷曲线分类方法
作者姓名:张慧波  王守相  赵倩宇  任杰  王海
作者单位:天津大学,天津大学,天津大学,国网冀北张家口风光储输新能源有限公司,国网冀北张家口风光储输新能源有限公司
基金项目:河北省省级科技计划资助(高效高可靠低压交直流微电网关键技术及示范应用)
摘    要:由于用户用电行为的多样性和随机性,负荷数据存在分布不均衡的问题,传统负荷曲线分类方法在处理不均衡数据时性能较差,为此,提出一种改进K-means与LSTM(long short term memory network)-CNN(convolutional neural network)分类模型结合的负荷曲线分类方法。首先,为提升K-means算法对不均衡数据的聚类效果,基于密度峰值聚类算法(density peaks clustering,DPC)思想,提出一种相对k近邻密度峰值初始聚类中心选取方法(related k-nearest neighbor density peaks,RKDP),将其作为K-means算法的初始中心进行聚类;其次,为提高RKDP_K-means处理高维负荷数据的性能,采用LSTM自编码器进行特征降维后再聚类(LSTM-auto-encoder RKDP_K-means,LARK)获得精准类别标签;最后,基于LSTM和CNN网络分别提取负荷特征构建负荷曲线分类模型,实现对大规模负荷曲线的分类。算例表明在大规模负荷曲线分类时,相比于LARK算法,本文所提方法轮廓系数指标提升29.7%,效率提升3.46倍,具有良好的负荷曲线分类效果。

关 键 词:负荷曲线分类  不均衡数据  改进K-means  LSTM  CNN
收稿时间:2021/7/2 0:00:00
修稿时间:2021/8/19 0:00:00

Residential user load curve classification method considering data imbalance
Authors:ZHANG Huibo  WANG Shouxiang  ZHAO Qianyu  REN Jie  WANG Hai
Affiliation:Key Laboratory of Smart Grid of Ministry of Education (Tianjin University), Tianjin 300072, China;State Grid Hebei Zhangjiakou Scenic Storage and Transportation New Energy Co., Ltd., Zhangjiakou 075061, China
Abstract:With the development of smart grids, the dimension and volume of load data continue to increase. At the same time, due to the diversity and randomness of users'' electricity consumption behaviors, the imbalance of load data classes is increasingly obvious. Traditional load curve classification technologies have become ineffective to deal with the im-balanced class problem of data. Therefore, an algorithm combing improved K-means with long short term memory net-work (LSTM) and convolutional neural network (CNN) classification model is proposed. Firstly, to improve the classifica-tion accuracy of the K-means on imbalanced data, a method of relative k-nearest neighbor density peaks(RKDP) is proposed to select the initial clustering centre of K-means. Secondly, an auto-encoder based on LSTM (LSTM-AE) is used to extract load characteristics from high dimensional data, and combined with RKDP_K-means to obtain accu-rate load profiles labels. Finally, a classification model based on the LSTM and CNN networks is used to realize the classification of large-scale load profiles. The results show that compared with LARK(LSTM-AE RKDP_K-means) al-gorithm, the Silhouette Coefficient of the proposed method is improved by 29.7%, and the efficiency is improved by 3.46 times.
Keywords:load curve classification  unbalanced data  improved K-means  LSTM  CNN
点击此处可从《》浏览原始摘要信息
点击此处可从《》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号