AMLSim icon indicating copy to clipboard operation
AMLSim copied to clipboard

Output CSV file Label not syncronized

Open Atif-ub opened this issue 5 years ago • 3 comments

Output files explained in wiki/Input-and-Output-Files are not aligned with the files generated. it doesnot say anything about the LOG.csv generated by run_AMLSim.sh script.

The Log.csv file generated by run_AMLSim.sh is quite unclear when we try to match it with accounts.csv

Can you help in understanding the output log file. and which file we can use to train the predict for ML algo.

LOG.csv

step type amount nameOrig oldbalanceOrg newbalanceOrig nameDest oldbalanceDest newbalanceDest isFraud alertID
1 CASH-OUT 70.86 0 0.0 0.0 1 0.0 70.86 0 -1
1 CASH-OUT 30.19 0 0.0 0.0 14 0.0 30.19 0 -1
2 CASH-OUT 54.1 0 0.0 0.0 0 0.0 54.1 0 -1

Atif-ub avatar Mar 06 '19 10:03 Atif-ub

samplelog.csv is so very confusing- tx.csv and cash_tx.csv are quite understandable . and all files have almost the same number of records.

which is the final set of files that I can use for further processing in ML. ? (train predict use cases)

Atif-ub avatar Mar 06 '19 10:03 Atif-ub

Sorry, we will add an explanation about log.csv. I will let you know when it finishes. Thank you for your feedback.

hkanezashi avatar Mar 06 '19 13:03 hkanezashi

I added a description of log.csv from the simulator: https://github.com/IBM/AMLSim/wiki/Input-and-Output-Files#simulator-log-csv This log file is not the input file of ML processing. We intend to construct a transaction network from this log file and then extract features for ML from the network.

hkanezashi avatar Mar 09 '19 13:03 hkanezashi