azure-docs icon indicating copy to clipboard operation
azure-docs copied to clipboard

AML Pipeline: azureml:dataset_name:@latest does not return latest registered dataset

Open thegitofdaniel opened this issue 3 years ago • 1 comments

I have submited a pipeline run using yaml + CLI2 command "az ml job create -f filename".

The pipeline gets created. However, the wrong version of dataset is selected if I use "azureml:dataset_name:@latest".

In the yaml I declare:

inputs:
  training_data: 
      type: mltable 
      path: azureml:greenfield_slp_combined_input@latest

I should get version 5 of the dataset, instead the pipeline gets version 1.

Not being able to get the latest version of the dataset is quite a big issue.

On a separate topic, "@latest" works fine when I try to select the latest environment.


Document Details

Do not edit this section. It is required for learn.microsoft.com ➟ GitHub issue linking.

thegitofdaniel avatar Dec 15 '22 09:12 thegitofdaniel

@thegitofdaniel

Thanks for your feedback! We will investigate and update as appropriate.

Hi @thegitofdaniel We have assigned this to content author @ynpandey to check and share his valuable insights on this.

Naveenommi-MSFT avatar Dec 19 '22 06:12 Naveenommi-MSFT

Please notice: it worked when I had a V2 datasets. It didn't get the latest only when I had a V1 dataset. Backward compatibility should be guaranteed.

thegitofdaniel avatar Dec 19 '22 12:12 thegitofdaniel

Thank you Daniel for the feedback! It's a known issue and we are currently working on it. This issue should be resolved by end of June.

SturgeonMi avatar May 22 '23 03:05 SturgeonMi