Antti "Andy" Törrönen
Antti "Andy" Törrönen
Code: IDataView trainData = DataFrame.LoadCsv(TrainDatasetPath, separator: ';', header: true, guessRows: 100); Gives exception: DataFrame already contains a column called Target20 (Parameter 'column') Suggestion: It would be nice if LoadCsv would...
**System Information (please complete the following information):** - OS & Version: Windows 11 - ML.NET Version: ML.NET v1.5.5 - .NET Version: NET6.0 **Describe the bug** Out-of-memory errors on FastTree. There...
I have: - IDataView trainingData - IDataView testData I want to combine trainingData and testData `IDataView combinedData = trainingData + testData;` Finally, I want to retrain the model with it....
I filter data from a dataview to get all items within a specific time period. It seems slow compared to filtering with LINQ from objects in memory. Is there a...
I am running the new PFI API (main branch with #5934) for a FastTreeBinary loaded model created by AutoML API. **Main question:** I receive items like "Slot 48416" from `MLContext.BinaryClassification.PermutationFeatureImportanceNonCalibrated().`...
Creating pipelines for big datasets can be complex. Datasets may have hundreds or thousands of columns. Type of columns may vary. Even if the type is text, some may be...
**System Information (please complete the following information):** - OS & Version: Windows 11 - ML.NET Version: ML.NET 1.6.0 - .NET Version: .NET 6.0 **Describe the bug** Dataset may include long...
Per my understanding: 1) Default values for trainers are set based on experience from a wide range of dataset. Therefore, they should be a good starting point. 2) AutoML start...
**System Information:** - OS & Version: Windows 11 - ML.NET Version: 1.6.0 (custom build with autoML search space edited) - .NET Version: .NET 5.0 **Describe the bug** I am running...
I have about 80-160 GB CSV files (2000 - 10 000 features, including string, floats and ints) I am trying to run with Autokeras. However, after .fit() I see first...