ADBench icon indicating copy to clipboard operation
ADBench copied to clipboard

ADBench with custom data

Open Sreyashi-Bhattacharjee opened this issue 3 years ago • 1 comments

Hello, I am trying to replicate the demo notebook but using a different open source data. I am getting keyerror. I have tried changing the data as well and have reduced the number of rows as well. Please help.

Attaching screenshots

ad`b_2 adb_1

Sreyashi-Bhattacharjee avatar Sep 22 '22 06:09 Sreyashi-Bhattacharjee

I sincerely apologize for my late reply. We have released the latest version of ADBench, which now support the implementation of either customized dataset or algorithm. After installing ADBench and upgrading to its latest version, you can run AD algorithms (or your customized algorithm) on the customized dataset. See the following codes:

# customized model on ADBench's datasets
from adbench.run import RunPipeline
from adbench.baseline.Customized.run import Customized

# notice that you should specify the corresponding category of your customized AD algorithm
# for example, here we use Logistic Regression as customized clf, which belongs to the supervised algorithm
# for your own algorithm, you can realize the same usage as other baselines by modifying the fit.py, model.py, and run.py files in the adbench/baseline/Customized
pipeline = RunPipeline(suffix='ADBench', parallel='supervise', realistic_synthetic_mode=None, noise_type=None)
results = pipeline.run(clf=Customized)

# customized model on customized dataset
import numpy as np
dataset = {}
dataset['X'] = np.random.randn(1000, 20)
dataset['y'] = np.random.choice([0, 1], 1000)
results = pipeline.run(dataset=dataset, clf=Customized)

Please see the guidance for further details.

Minqi824 avatar Jul 28 '23 03:07 Minqi824