Antti "Andy" Törrönen

Results 57 comments of Antti "Andy" Törrönen

For Microsoft.ML.LightGBM class library adding GPU support went fairly easily. However, the benefits might not make it worth to do it so only very brief notes below. Maybe better to...

Current TransformInference may be unnecessarily complex. So maybe it could be best to rewrite transform inference to support the new tuners. 1. Support adding and removing "experts" 2. Support testing...

If anyone has same problem renamed header names can put in the parameter. This solves my issue. I do not know if LoadCsv should have this functionality inbuilt or not...

@LittleLittleCloud This is very useful. I have a follow-up question. Q: The sample has `var platt = context.BinaryClassification.Calibrators.Platt().Fit(**trainData**);` It is just a syntax sample so is it best to use...

One way to make file reading faster is to used [binary IDV files](https://docs.microsoft.com/en-us/dotnet/api/microsoft.ml.binaryloadersavercatalog.saveasbinary?view=ml-dotnet). However, this also fails: `var combinedDataView = mlContext.Data.LoadFromBinary(new MultiFileSource(TrainDataPath + ".idv", TestDataPath + ".idv"));` throws `'binary loader...

@luisquintanilla I dont recall trying wildcard. I only tried with list of filenames as parameters: https://github.com/dotnet/machinelearning/issues/6134#issuecomment-1073554230 I will put it on my task list to test, but the error message...

@luisquintanilla yes, you are correct, it is DataFrame.

@michaelgsharp I am thinking about something like a dictionary or hashset to select items quickly. For example, I might want get metrics for observations from each city separately: one test...

@michaelgsharp Our code is converting them to .ToDataFrame(-1) after reading the IDV file. I believe it puts them to RAM. However, these errors happen later, except maybe the last one....

I think FastTree needs lots of virtual memory in Model Builder too (e.g. https://github.com/dotnet/machinelearning-modelbuilder/issues/1875 ) If I had less than 500 Gb virtual memory the servicehub in model builder crashed...