TY - JOUR
T1 - Towards the development of realistic botnet dataset in the Internet of Things for network forensic analytics
T2 - Bot-IoT dataset
AU - Koroniotis, Nickolaos
AU - Moustafa, Nour
AU - Sitnikova, Elena
AU - Turnbull, Benjamin
PY - 2019/5/22
Y1 - 2019/5/22
N2 - The proliferation of IoT systems, has seen them targeted by malicious third parties. To address this challenge, realistic protection and investigation countermeasures, such as network intrusion detection and network forensic systems, need to be effectively developed. For this purpose, a well-structured and representative dataset is paramount for training and validating the credibility of the systems. Although there are several network datasets, in most cases, not much information is given about the Botnet scenarios that were used. This paper proposes a new dataset, so-called Bot-IoT, which incorporates legitimate and simulated IoT network traffic, along with various types of attacks. We also present a realistic testbed environment for addressing the existing dataset drawbacks of capturing complete network information, accurate labeling, as well as recent and complex attack diversity. Finally, we evaluate the reliability of the BoT-IoT dataset using different statistical and machine learning methods for forensics purposes compared with the benchmark datasets. This work provides the baseline for allowing botnet identification across IoT-specific networks. The Bot-IoT dataset can be accessed at Bot-iot (2018)[1].
AB - The proliferation of IoT systems, has seen them targeted by malicious third parties. To address this challenge, realistic protection and investigation countermeasures, such as network intrusion detection and network forensic systems, need to be effectively developed. For this purpose, a well-structured and representative dataset is paramount for training and validating the credibility of the systems. Although there are several network datasets, in most cases, not much information is given about the Botnet scenarios that were used. This paper proposes a new dataset, so-called Bot-IoT, which incorporates legitimate and simulated IoT network traffic, along with various types of attacks. We also present a realistic testbed environment for addressing the existing dataset drawbacks of capturing complete network information, accurate labeling, as well as recent and complex attack diversity. Finally, we evaluate the reliability of the BoT-IoT dataset using different statistical and machine learning methods for forensics purposes compared with the benchmark datasets. This work provides the baseline for allowing botnet identification across IoT-specific networks. The Bot-IoT dataset can be accessed at Bot-iot (2018)[1].
KW - Bot-IoT dataset
KW - Forensics analytics
KW - Network flow
KW - Network forensics
UR - http://www.scopus.com/inward/record.url?scp=85066442910&partnerID=8YFLogxK
U2 - 10.1016/j.future.2019.05.041
DO - 10.1016/j.future.2019.05.041
M3 - Article
AN - SCOPUS:85066442910
SN - 0167-739X
VL - 100
SP - 779
EP - 796
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -