Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix "Deadlock in Scheduler While Handling High Concurrency DAGs" #45849

Open
1 of 2 tasks
shreyashdwivedi36 opened this issue Jan 21, 2025 · 1 comment
Open
1 of 2 tasks
Labels
area:core area:Scheduler including HA (high availability) scheduler kind:bug This is a clearly a bug needs-triage label for new issues that we didn't triage yet pending-response

Comments

@shreyashdwivedi36
Copy link

Apache Airflow version

2.10.4

If "Other Airflow 2 version" selected, which one?

No response

What happened?

The scheduler stops processing tasks due to a deadlock in the resource allocation mechanism. Logs show repeated warnings about locking mechanisms and no progress in DAG processing.

What you think should happen instead?

The scheduler should efficiently handle high concurrency workloads without encountering deadlocks. Tasks should progress as expected, and the system should avoid resource contention issues.

How to reproduce

  1. Deploy Apache Airflow 2.7.0 with a high-concurrency configuration.
  2. Submit a complex DAG with multiple tasks requiring concurrent execution.
  3. Observe the scheduler logs for deadlock warnings and stalled task processing.

Operating System

Ubuntu 20.04

Versions of Apache Airflow Providers

Run the following command in your environment where Apache Airflow is installed:

pip freeze | grep apache-airflow-providers

Deployment

Docker-Compose

Deployment details

  1. Airflow Deployment Type: Deployed using Docker Compose with the official Airflow Docker image.
  2. Environment Configuration: Includes PostgreSQL as the metadata database and Redis as the Celery backend.
  3. Customization: Custom configuration for logging and email alerts added to airflow.cfg.
  4. Version of Deployment Tools:
  • Docker: 20.10.14
  • Docker Compose: 2.12.2

Anything else?

Frequency of the Problem: The issue occurs consistently under high concurrency scenarios.

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@shreyashdwivedi36 shreyashdwivedi36 added area:core kind:bug This is a clearly a bug needs-triage label for new issues that we didn't triage yet labels Jan 21, 2025
@dosubot dosubot bot added the area:Scheduler including HA (high availability) scheduler label Jan 21, 2025
@potiuk
Copy link
Member

potiuk commented Jan 21, 2025

You report is confusing and ambiguous. You need to add the logs and dag example to make it more concrete and reproducible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:core area:Scheduler including HA (high availability) scheduler kind:bug This is a clearly a bug needs-triage label for new issues that we didn't triage yet pending-response
Projects
None yet
Development

No branches or pull requests

2 participants