Introduction
This dataset is part of my PhD research on malware detection and classification using Deep Learning. It contains static analysis data (PE Section Headers of the .text, .code and CODE sections) extracted from the 'pe_sections' elements of Cuckoo Sandbox reports. PE malware examples were downloaded from virusshare.com. PE goodware examples were downloaded from portableapps.com and from Windows 7 x86 directories.
Features
Column name: hash
Description: MD5 hash of the example
Content: 32 bytes string
Column name: sizeofdata
Description: The size of the section on disk
Content: Integer
Column name: virtualaddress
Description: Memory address of the first byte of the section relative to the image base
Content: Integer
Column name: entropy
Description: Calculated entropy of the section
Content: Float
Column name: virtualsize
Description: The size of the section when loaded into memory
Content: Integer
Column name: malware
Description: Class
Content: 0 (Goodware) or 1 (Malware)
Acknowledgements
Thank you Cuckoo Sandbox for developing such an amazing dynamic analysis environment!
Thank you VirusShare! Because sharing is caring!
Citations
Please refer to http://dx.doi.org/10.21227/2czh-es14