CNS_final_proj_EDOS_Detection_Fooling

Preprocess

python preprocess.py

preprocess.py will generate two file, train and test, which are the inputs of libSVM.
Details
- For the row written in float or int, I did normalization.
- For the row written in words, such as attack categories, I applied one hot to represent each category.

python preprocess_csv.py

preprocess_csv.py generates two csv files, train.csv and test.csv under preprocess directory.
Details
- Map attack category to multiclass label
- Drop rows containing NA (seems none)
- Min-max normalization
- One hot encoding

The current implementation of analyze.py takes one or two .csv files in the format of UNSW_NB15 as input.

For one file, comment the comparison section. Running analyze.py outputs analysis that contains the average and standard deviation of each continous data category.
For two files, running analyze.py outputs analysis that contains the aforementioned information for both files, and also the difference between the two (both by value and by value / std).

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
analysis		analysis
data		data
preprocess		preprocess
svm		svm
README.md		README.md
get-pip.py		get-pip.py
parse.py		parse.py
preprocess.py		preprocess.py
preprocess_csv.py		preprocess_csv.py
rf.py		rf.py
rf_adv_model_prediction		rf_adv_model_prediction
rf_adv_model_prediction.csv		rf_adv_model_prediction.csv
rf_detection_fooling_case		rf_detection_fooling_case
rf_detection_fooling_case.csv		rf_detection_fooling_case.csv
rf_svm.py		rf_svm.py
svm.py		svm.py
svm_adv_model_prediction		svm_adv_model_prediction
svm_adv_model_prediction.csv		svm_adv_model_prediction.csv
svm_rf.py		svm_rf.py