MachineLearningNotebooks
MachineLearningNotebooks copied to clipboard
Enforce consistency between datasets API and Models API
Currently AzureML Python SDK as well as the CLI does not support to filter datasets by using tags.
When doing :
from azureml.core import Run, Datastore, Workspace
from azureml.core import Dataset
ws = Workspace.from_config()
Dataset.get_all(ws)
You get only the latest version of a dataset.
When you do:
from azureml.core import Run, Datastore, Workspace, Model
from azureml.core import Dataset
ws = Workspace.from_config()
Model.list(ws)
You get all of the models versions and you can actually use Model.list(ws, tags=['key', ['key2', 'key2 value']]).
The behaviour with the Model API is what I expect. But the one with datasets is inconsistent.
In fact in the REST API it is possible to use the tags property:
https://learn.microsoft.com/en-us/rest/api/azureml/2022-06-01-preview/data-containers/list?tabs=HTTP
BR E