Skip to content

johorun/MalSensor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 

Repository files navigation

README

MalSensor

Fast and Robust Windows Malware Classification.

Setup

  • IDA Pro: >= 7.5

Dataset

The dataset MalwareBazaar MalwareDrift and label files MalwareBazaar_Labels.csv MalwareBazaar_Labels.csv used in this paper and code come from the following paper.

[FSE2021] A Comprehensive Study on Learning-Based PE Malware Family Classification Methods.

Dataset:https://github.com/MHunt-er/Benchmarking-Malware-Family-Classification

Useage

Disassembly

Put the auto_script.py static_analysis.py in the IDA Pro working directory.

Run python auto_script.py -d Malware_path -i IDA_path


Generate function call graph gexf for MalwareBazaar

Run python APIextract.py to deduplicate the imp_API.txt file and generate APIlist.txt file

Put the .info files of malware generated by disassembly

Run python cg.py to generate the .gexf files representing the function call graph of each sample


Generate function call graph gexf for MalwareDrift

Run python APIextract.py to deduplicate the imp_API.txt file and generate APIlist.txt file

Put the .info files of malware generated by disassembly

Run python cg.py to generate the .gexf files representing the function call graph of each sample

Run python rename.py to rename the .gexf files according to the sha256 value of thier raw files

Run transform.py to move the .gexf files to the gexf_pre or gexf_post directory according to the MalwareDrift_Labels.csv


FeatureExtraction

Run python FeatureExtraction.py -d Gexf_path -o Output_path -c Centrality_type (For MalwareDrift, Gexf_path is the upper directory of gexf_pre and gexf_post)

Then, run python transtomulti.py to generate the feature csv file with sample feature vectors and label information according to the centrality.


Classification

Run python Classification.py -d features_csv_path -o Output_path -t Centrality_type

About

A fast and lightweight PE malware detector based on program behavior.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages