machinelearning
machinelearning copied to clipboard
ML.NET is an open source and cross-platform machine learning framework for .NET.
LLM tokenizers are a crucial component in Large Language Models (LLMs) like GPT-3 or BERT. They are responsible for the tokenization process, which involves breaking down natural language text into...
The most significant changes include the removal of the null check for "args" in the "Main" method, the addition of a "try-catch" block to handle exceptions during the execution of...
This is a portion of the work from https://github.com/dotnet/machinelearning/pull/6749 This moves the tests forward to net8.0, and cleans up RemoteExecutor (which I noticed in @tarekgh's PR). I minimized the product...
**Is your feature request related to a problem? Please describe.** No **Describe the solution you'd like** Add the option to modify the idataview (such as in preFeaturizer) but a "postFeaturizer"...
**System Information (please complete the following information):** - OS & Version: either Windows 10 or 11 - ML.NET Version: versions 1.7 and 3.0 - .NET Version: .netframework 4.7 **Describe the...
fix #7135 Describe the bug Accessing data by column after adding columns to a DataFrame returns error data ```` var df = DataFrame.LoadCsvFromString("a1,a2\n1,2\n3,4"); var dc0 = DataFrameColumn.Create("a0", new int[] {...
Hi, thanks for the effort put into the Microsoft.ML.Tokenizers! I'm the author of the last performance improvements in `SharpToken` library. Since MLTokenizers are faster now than SharpToken I looked into...
**Is your feature request related to a problem? Please describe.** Currently when you use `LoadCsv` or `LoadCsvFromString` without supplying data types for each column, the code will try to guess...
**System Information (please complete the following information):** - Model Builder Version: ml.net CLI 16.1.1 - Visual Studio Version 8.7.6 for macs **Describe the bug** Tried to change model input to...
**Is your feature request related to a problem? Please describe.** I need a way to visualize how my model is learning during training, which is a comparison between training loss...