UFC-Predictions icon indicating copy to clipboard operation
UFC-Predictions copied to clipboard

Slight improvement to accuracy score

Open apavlo89 opened this issue 3 years ago • 3 comments

Hello there,

I think I increased prediction accuracy (using 80%-20% split) ever so slightly using TPOT (no oversampling applied yet).

try this:

# Average CV score on the training set was: 0.6958245897228948
exported_pipeline = make_pipeline(
    make_union(
        make_pipeline(
            StackingEstimator(estimator=BernoulliNB(alpha=0.001, fit_prior=False)),
            ZeroCount()
        ),
        FunctionTransformer(copy)
    ),
    StackingEstimator(estimator=SGDClassifier(alpha=0.01, eta0=0.1, fit_intercept=False, l1_ratio=0.0, learning_rate="constant", loss="perceptron", penalty="elasticnet", power_t=0.1)),
    MaxAbsScaler(),
    XGBClassifier(learning_rate=0.1, max_depth=2, min_child_weight=19, n_estimators=100, n_jobs=1, subsample=0.4, verbosity=0)
)
# Fix random state for all the steps in exported pipeline
set_param_recursive(exported_pipeline.steps, 'random_state', 2)

exported_pipeline.fit(training_features, training_target)
results = exported_pipeline.predict(testing_features)

apavlo89 avatar Apr 07 '21 11:04 apavlo89

The accuracy score might increase even more as the optimisation code is still running on my unis cluster computer :P . I'll be updating main post with updated best

apavlo89 avatar Apr 07 '21 11:04 apavlo89

I also created a new variable called B_finish_decision_ratio (also for R_) where I sum all the decision finish columns / finish columns. This improves the score. Keep in mind when its 0/0 i just turn it into a 0 value

apavlo89 avatar Apr 07 '21 12:04 apavlo89

The accuracy score might increase even more as the optimisation code is still running on my unis cluster computer :P . I'll be updating main post with updated best

What accuracy did you end up with?

MathemAddict avatar Jan 03 '22 16:01 MathemAddict