Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AzureMLException: Message: Failed to flush task queue within 300.0 seconds | dreambooth finetune text to image #3313

Open
jyravi opened this issue Jul 25, 2024 · 0 comments
Labels

Comments

@jyravi
Copy link
Contributor

jyravi commented Jul 25, 2024

Operating System

Linux

Version Information

Running the sample in Compute Instance within the Azure ML workspace.
Running the sample provided in azureml-examples
pipeline component : diffusers_text_to_image_dreambooth_pipeline

Steps to reproduce

  1. Ran the notebook as -is provided in azureml-examples "https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/system/finetune/text-to-image/diffusers-dreambooth-dog-text-to-image.ipynb"

Expected behavior

Finetune the model successfully.

Actual behavior

encountered error in the child job "text_to_image_dreambooth_finetune".

Encountered an internal ACFT error. Error Message/Code: AzureMLException:
Message: Failed to flush task queue within 300.0 seconds. Please set AZUREML_ARTIFACTS_DEFAULT_TIMEOUT environment variable to increase the timeout(in seconds)
InnerException None
ErrorResponse
{
"error": {
"code": "UserError",
"message": "Failed to flush task queue within 300.0 seconds. Please set AZUREML_ARTIFACTS_DEFAULT_TIMEOUT environment variable to increase the timeout(in seconds) ",
"inner_error": {
"code": "ResourceExhausted",
"inner_error": {
"code": "Timeout"
}
}
}
}. Traceback: File "swallow_all_exceptions_decorator.py", line 68, in wrapper
return func(*args, **kwargs)
File "[Non-AutoML file]", line 1097, in [Non-AutoML function]
File "finetune_runner.py", line 252, in finetune_runner
mlflow.log_artifacts(component_args.output_dir, SettingParameters.DEFAULT_OUTPUT_DIR)
File "[Non-AutoML file]", line 1096, in [Non-AutoML function]
File "[Non-AutoML file]", line 1242, in [Non-AutoML function]
File "[Non-AutoML file]", line 570, in [Non-AutoML function]
File "[Non-AutoML file]", line 88, in [Non-AutoML function]
File "[Non-AutoML file]", line 97, in [Non-AutoML function]
File "[Non-AutoML file]", line 29, in [Non-AutoML function]
File "[Non-AutoML file]", line 55, in [Non-AutoML function]
File "[Non-AutoML file]", line 135, in [Non-AutoML function]
. Additional information: [Hidden as it may contain PII].
4.

Addition information

Talks about increasing the timeout. Not sure in compute instance how to find the file and change the default time out.

@jyravi jyravi added the bug label Jul 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant