nsl-kdd icon indicating copy to clipboard operation
nsl-kdd copied to clipboard

PySpark solution to the NSL-KDD dataset: https://www.unb.ca/cic/datasets/nsl.html

Results 2 nsl-kdd issues
Sort by recently updated
recently updated
newest added

`# Gaussian Mixture clustering from pyspark.ml.clustering import GaussianMixture t0 = time() gm = GaussianMixture(k=8, maxIter=150, seed=seed, featuresCol="pca_features", predictionCol="cluster", probabilityCol="gm_prob") gm_pipeline = Pipeline(stages=[pca_slicer, pca, gm]) gm_model = gm_pipeline.fit(scaled_train_df) gm_train_df = gm_model.transform(scaled_train_df).cache()...

Epoch [1/30], Loss: 0.0000 Accuracy on test set: 81.23% 0.009939057285945933 Epoch [2/30], Loss: 0.0245 Accuracy on test set: 76.59% 0.009757729755661013 Epoch [3/30], Loss: 0.0623 Accuracy on test set: 77.08% 0.009460482294732423...