AMLSim
AMLSim copied to clipboard
Output CSV file Label not syncronized
Output files explained in wiki/Input-and-Output-Files are not aligned with the files generated. it doesnot say anything about the LOG.csv generated by run_AMLSim.sh script.
The Log.csv file generated by run_AMLSim.sh is quite unclear when we try to match it with accounts.csv
Can you help in understanding the output log file. and which file we can use to train the predict for ML algo.
LOG.csv
step | type | amount | nameOrig | oldbalanceOrg | newbalanceOrig | nameDest | oldbalanceDest | newbalanceDest | isFraud | alertID |
---|---|---|---|---|---|---|---|---|---|---|
1 | CASH-OUT | 70.86 | 0 | 0.0 | 0.0 | 1 | 0.0 | 70.86 | 0 | -1 |
1 | CASH-OUT | 30.19 | 0 | 0.0 | 0.0 | 14 | 0.0 | 30.19 | 0 | -1 |
2 | CASH-OUT | 54.1 | 0 | 0.0 | 0.0 | 0 | 0.0 | 54.1 | 0 | -1 |
samplelog.csv is so very confusing- tx.csv and cash_tx.csv are quite understandable . and all files have almost the same number of records.
which is the final set of files that I can use for further processing in ML. ? (train predict use cases)
Sorry, we will add an explanation about log.csv. I will let you know when it finishes. Thank you for your feedback.
I added a description of log.csv from the simulator: https://github.com/IBM/AMLSim/wiki/Input-and-Output-Files#simulator-log-csv This log file is not the input file of ML processing. We intend to construct a transaction network from this log file and then extract features for ML from the network.