MachineLearningNotebooks icon indicating copy to clipboard operation
MachineLearningNotebooks copied to clipboard

Enforce consistency between datasets API and Models API

Open edgBR opened this issue 3 years ago • 0 comments

Currently AzureML Python SDK as well as the CLI does not support to filter datasets by using tags.

When doing :

from azureml.core import Run, Datastore, Workspace
from azureml.core import Dataset
ws = Workspace.from_config()
Dataset.get_all(ws)
 

You get only the latest version of a dataset.

When you do:

from azureml.core import Run, Datastore, Workspace, Model
from azureml.core import Dataset
ws = Workspace.from_config()
Model.list(ws)

You get all of the models versions and you can actually use Model.list(ws, tags=['key', ['key2', 'key2 value']]).

The behaviour with the Model API is what I expect. But the one with datasets is inconsistent.

In fact in the REST API it is possible to use the tags property:

https://learn.microsoft.com/en-us/rest/api/azureml/2022-06-01-preview/data-containers/list?tabs=HTTP

BR E

edgBR avatar Sep 20 '22 14:09 edgBR