首页 | 官方网站   微博 | 高级检索  
     

基于字典的域名生成算法生成域名的检测方法
引用本文:张永斌,常文欣,孙连山,张航. 基于字典的域名生成算法生成域名的检测方法[J]. 计算机应用, 2021, 41(9): 2609-2614. DOI: 10.11772/j.issn.1001-9081.2020111837
作者姓名:张永斌  常文欣  孙连山  张航
作者单位:陕西科技大学 电子信息与人工智能学院, 西安 710021
基金项目:陕西省自然科学基础研究计划项目(2019JM-354)。
摘    要:针对基于字典的域名生成算法(DGA)生成域名与良性域名构成十分相似,现有技术难以有效检测的问题,提出一种卷积神经网络(CNN)和长短时记忆(LSTM)网络相结合的网络模型——CL模型.该模型由字符嵌入层、特征提取层及全连接层三部分组成.首先,字符嵌入层对输入域名的字符进行编码;然后,特征提取层将CNN与LSTM串行连接...

关 键 词:域名生成算法  基于字典的域名生成算法  卷积神经网络  长短时记忆网络  域名检测
收稿时间:2020-11-24
修稿时间:2021-02-24

Detection method of domains generated by dictionary-based domain generation algorithm
ZHANG Yongbin,CHANG Wenxin,SUN Lianshan,ZHANG Hang. Detection method of domains generated by dictionary-based domain generation algorithm[J]. Journal of Computer Applications, 2021, 41(9): 2609-2614. DOI: 10.11772/j.issn.1001-9081.2020111837
Authors:ZHANG Yongbin  CHANG Wenxin  SUN Lianshan  ZHANG Hang
Affiliation:School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an Shaanxi 710021, China
Abstract:The composition of domain names generated by the dictionary-based Domain Generation Algorithm (DGA) is very similar to that of benign domain names and it is difficult to effectively detect them with the existing technology. To solve this problem, a detection model was proposed, namely CL (Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) network). The model includes three parts:character embedding layer, feature extraction layer and fully connected layer. Firstly, the characters of the input domain name were encoded by the character embedding layer. Then, the features of the domain name were extracted by connecting CNN and LSTM in serial way through the feature extraction layer. The n-grams features of the domain name were extracted by CNN and the extracted result were sent to LSTM to learn the context features between n-grams. Meanwhile, different combinations of CNNs and LSTMs were used to learn the features of n-grams with different lengths. Finally, the dictionary-based DGA domain names were classified and predicted by the fully connected layer according to the extracted features. Experimental results show that when the CNNs select the convolution kernel sizes of 3 and 4, the proposed model achives the best performance. In the four dictionary-based DGA family experiments, the accuracy of the CL model is improved by 2.20% compared with that of the CNN model. And with the increase of the number of sample families, the CL network model has a better stability.
Keywords:Domain Generation Algorithm (DGA)  dictionary-based DGA  Convolutional Neural Network (CNN)  Long Short-Term Memory (LSTM) network  domain name detection  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号