Machine-Learning-approach-for-Malware-Detection icon indicating copy to clipboard operation
Machine-Learning-approach-for-Malware-Detection copied to clipboard

A Machine Learning approach for classifying a file as Malicious or Legitimate

Machine-Learning-approach-for-Malware-Detection

A Machine Learning approach for classifying a file as Malicious or Legitimate.

This approach tries out 6 different classification algorithms before deciding which one to use for prediction by comparing their results. Different Machine Learning models tried are, Linear Regression, RandomForest, DecisionTree, Adaboost, Gaussian, Gradient Boosting.

In order to test the model on an unseen file, it's required to extract the characteristics of the given file. Python's pefile.PE library is used to construct and build the feature vector and a ML model is used to predict the class for the given file based on the already trained model.

Dependencies

  • pandas pip install pandas
  • numpy pip install numpy
  • pickle pip install pickle
  • scipy pip install scipy
  • scikit pip install -U scikit-learn