Gani Nazirov issues

Results 23 issues of


                                            Gani Nazirov

[WIP] ONNX conversion

Changes needed to convert DeBerta to ONNX

ONNX model of ToKey outputs 1 based key, while 0 based expected

Repro `from nimbusml.datasets import get_dataset from nimbusml.preprocessing import OnnxRunner, ToKey iris_df = get_dataset("iris").as_df() iris_df = iris_df.drop(['Label'], axis=1) transform = ToKey()

upgrade to ML.NET 1.5.1

DatasetTransformer to work with predictor models

Currently if you DatasetTransformer with predictor model it outputs all the hidden fields. It needs to ouput only Score and optionally PredictedLabel if its classifier for ex, Probabilities if available.

Set of new timeseries transforms

New transforms added: LagLeadOperator SimpleRollingWindow AnalyticalRollingWindow ShortDrop ForecastingPivot

Support TreeFeaturizer transform

Currently TreeFeaturizer is generated as a python user class, but it doesnt work, also no samples/tests. The changes to support it would require supporting PredictorModel class in GraphRunner parsing logic...

ONNX model for NgramFeaturizer doesnt output actual tokens

Repro ` from nimbusml.datasets import get_dataset from nimbusml import FileDataStream from nimbusml.preprocessing import OnnxRunner from nimbusml.feature_extraction.text import NGramFeaturizer from nimbusml.feature_extraction.text.extractor import Ngram path = get_dataset("wiki_detox_train").as_filepath() data = FileDataStream.read_csv(path, sep='\t') transformer...

Gani Nazirov