Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot use curated environment from azureml registry in command job #38594

Open
IvanHahan opened this issue Nov 18, 2024 · 1 comment
Open

Cannot use curated environment from azureml registry in command job #38594

IvanHahan opened this issue Nov 18, 2024 · 1 comment
Assignees
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Machine Learning needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.

Comments

@IvanHahan
Copy link

IvanHahan commented Nov 18, 2024

  • Package Name: azure ml
  • Operating System: linux

Describe the bug
When I call az ml job create --file <my_file>.yml, specifying one of the curated environments acft-mmdetection-image-gpu I get the error:

(UserError) No environment exists for name: acft-mmdetection-image-gpu, version: 46, label: 
Code: UserError
Message: No environment exists for name: acft-mmdetection-image-gpu, version: 46, label: 

az ml package is unable to find this existing environment acft-mmdetection-image-gpu under azureml registry.

When I perform az ml environment list, I'm unable to see acft-mmdetection-image-gpu environment in the list.
I can see this environment when I execute az ml environment list --registry azureml.

I also tried setting my environment in the job.yml file as environment: azureml://registries/azureml/environments/acft-mmdetection-image-gpu/versions/46, but it doesn't work either. same error appears.

What I can do is creating a custom environment that is inherited from the specified environment above and use it. But what should I do to use the original acft-mmdetection-image-gpu environment in my job?

To Reproduce

My job.yml file:

$schema: https://azuremlschemas.azureedge.net/latest/commandJob.schema.json
command: >-
  echo ""
environment: azureml:acft-mmdetection-image-gpu:46
compute: A100
resources:
  docker_args: '--ipc=host'
  instance_count: 1

Steps to reproduce the behavior:

  1. call az ml job create --file job.yml

Expected behavior
The training job is created

@github-actions github-actions bot added Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Machine Learning needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team. labels Nov 18, 2024
Copy link

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @Azure/azure-ml-sdk @azureml-github.

@achauhan-scc achauhan-scc self-assigned this Nov 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Machine Learning needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.
Projects
None yet
Development

No branches or pull requests

2 participants