Fully support running more than one scheduler concurrently #10956

ashb · 2020-09-15T16:41:54Z

Here it is. The "big" one. Smaller than I thought, but still a very fundamental change!

~~Depends upon #10949, ignore first commit when reviewing please.~~

~~This PR is still in draft -- there are one or two TODOs left in the code that will need to be fixed before final merge, and while they are import, there is still enough here to start reviewing.~~

Test that my forward-port of the changes actually works again. (I wanted to get the code up for review, so there may be some transcription errors breaking things.)
DAG SLAs need to happen via the parsing process
Adding back dagrun_timeout support
dag_run.verify_integrity is slow, and we don't want to call it every time, just when the dag structure changes (which we can know now thanks to DAG Serialization)
Add a savepoint in verify_integrity (to avoid rollback killing the whole transaction and releasing the locks.
Produce benchmark figures against this branch, not a 4 month old version of master branch.
Unit test the hell out of this.
Add config option to disable all locking (a.k.a. escape hatch) in case of unforseen MySQL performance issues.

This PR implements scheduler HA as proposed in AIP-15. The high level design is as follows:

Move all scheduling decisions into SchedulerJob (requiring DAG serialization in the scheduler)
Use row-level locks to ensure schedulers don't stomp on each other
(SELECT ... FOR UPDATE)
Use SKIP LOCKED for better performance when multiple schedulers are
running. (Mysql < 8 and MariaDB don't support this)
Scheduling decisions are not tied to the parsing speed, but can
operate just on the database

DagFileProcessorProcess:

Previously this component was responsible for more than just parsing the
DAG files as it's name might imply. It also was responsible for creating
DagRuns, and also making scheduling decisions of TIs, sending them from
"None" to "scheduled" state.

This commit changes it so that the DagFileProcessorProcess now will
update the SerializedDAG row for this DAG, and make no scheduling
decisions itself.

To make the scheduler's job easier (so that it can make as many
decisions as possible without having to load the possibly-large
SerializedDAG row) we store/update some columns on the DagModel table:

next_dagrun: The execution_date of the next dag run that should be created (or
None)
next_dagrun_create_after: The earliest point at which the next dag
run can be created

Pre-computing these values (and updating them every time the DAG is
parsed) reduce the overall load on the DB as many decisions can be taken
by selecting just these two columns/the small DagModel row.

In case of max_active_runs, or @once these columns will be set to
null, meaning "don't create any dag runs"

SchedulerJob

The SchedulerJob used to only queue/send tasks to the executor after
they were parsed, and returned from the DagFileProcessorProcess.

This PR breaks the link between parsing and enqueuing of tasks, instead
of looking at DAGs as they are parsed, we now:

store a new datetime column, last_scheduling_decision on DagRun
table, signifying when a scheduler last examined a DagRun
Each time around the loop the scheduler will get (and lock) the next
n DagRuns via DagRun.next_dagruns_to_examine, prioritising DagRuns
which haven't been touched by a scheduler in the longest period
SimpleTaskInstance etc have been almost entirely removed now, as we
use the serialized versions

Part of #9630

^ Add meaningful description above

Read the Pull Request Guidelines for more information.
In case of fundamental code change, Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in UPDATING.md.

potiuk · 2020-09-15T17:25:16Z

Cool. Looking at it

ryw · 2020-09-15T18:55:05Z

We're working on a short doc that will provide an easy way to test the components/PRs of AIP-15 together, and some preliminary benchmarks for performance improvement, hopefully have that out tomorrow.

airflow/executors/base_executor.py

turbaszek · 2020-09-15T19:04:01Z

airflow/jobs/scheduler_job.py

Suggested change

:rtype: int or None

:rtype: Optional[None]

Doc typing uses a different syntax to type comments https://www.sphinx-doc.org/en/master/usage/restructuredtext/domains.html#info-field-lists:

Multiple types in a type field will be linked automatically if separated by the word “or”:

:type an_arg: int or None :vartype a_var: str or int :rtype: float or str

Interesting, TIL 👌

This function returns tuple or None. The description in the pydocstring is incorrect.
int or None != Optional[Tuple[int, int]]

airflow/jobs/scheduler_job.py

ashb · 2020-09-16T10:34:24Z

Process note: I intend to keep changes as fixups/separate commits even beyond rebasing/following master, hopefully to to make it easier to re-review this in future. (I.e. I will use git rebase -i origin/master --no-autosquash)

airflow/models/dag.py

mik-laj · 2020-09-16T10:55:14Z

airflow/models/dagbag.py

Can you tell a little more about it? How do you want serialization of the same object to a database to produce different results? Is it by adding the next_dagrun, next_dagrun_create_after fields to the DAG in the sync_to_db method?

@ashb can you please take a look here?

This change was a bit of a hack here, and we'll tidy it up.

Since the scheduler now requires DAG serialization to operate, we need a way for dag bags to Load dags from given files only, and not from the DB, but then also a way to make them write things back to the DB.

We'll pull this out in to a separate PR, essentially removing the STORE_SERIALIZED_DAGS option entirely, and then this bulk_sync_to_db can then just always write to the DB.

airflow/utils/dag_processing.py

airflow/models/dag.py

ashb · 2020-09-17T11:28:36Z

One of the parts of this PR: it moves creating of DAG runs out of the dag file parser process, and in to the scheduler -- if you have a large number of dag files (1000, for instance), all with the same scheduler_interval, then the time to parse each one sequentially can introduce significant delay in creating the DAG runs after the period ticks over.

XD-DENG

Don't have time to do thorough check yet, but have got a few minor suggestions. FYI

airflow/jobs/scheduler_job.py

airflow/models/dag.py

airflow/jobs/scheduler_job.py

ashb · 2020-09-18T09:45:25Z

I'm working on updating/adding to the tests -- this fixup commit is going to be a bit large I'm afraid (at +444/-406 so far, about 60% through) -- as many tests have been re-organized between TestDagFileProcessor and TestSchedulerJob to reflect the location of new code (for example, test_dag_with_system_exit doesn't make sense in TestSchedulerJob anymore, as the scheduler doesn't parse the dags at all)

ashb · 2020-09-18T10:29:17Z

As discussed with @potiuk , I'll extend this PR to add config option to disable all locking (a.k.a. escape hatch) in case of unforeseen MySQL performance issues. Locking (SELECT ... FOR UPDATE) will be on by default, so out of the box on Mysql 8+ and Postgres you can Just Run More Schedulers.

potiuk · 2020-09-18T15:52:40Z

Fantastic! I will have some time over the weekend to finally look through it myself :).

github-actions · 2020-10-09T17:35:10Z

The Workflow run is cancelling this PR. It has some failed jobs matching ^Pylint$,^Static checks$,^Build docs$,^Spell check docs$,^Backport packages$,^Checks: Helm tests$,^Test OpenAPI*.

github-actions · 2020-10-09T17:37:59Z

The Workflow run is cancelling this PR. It has some failed jobs matching ^Pylint$,^Static checks$,^Build docs$,^Spell check docs$,^Backport packages$,^Checks: Helm tests$,^Test OpenAPI*.

ashb · 2020-10-09T18:16:07Z

We're trying to get this PR green before merging, because although it doesn't matter we know it's possible and it feels like an important one to say "yes, this is all good" (and we know it's possible to get it green)

ashb · 2020-10-09T21:40:03Z

We got bored waiting for the Github Actions lottery (build timeouts, 137/OOM kill errors. We did have a green build just before though - https://github.com/apache/airflow/actions/runs/297696260

potiuk · 2020-10-09T22:22:50Z

🎉

raphaelauv · 2020-11-29T20:35:35Z

@ashb is this mean the localExecutor is now thread-safe in the use of pools ?

https://airflow.apache.org/docs/stable/concepts.html#pools

Pools are not thread-safe , in case of more than one scheduler in localExecutor Mode you can’t ensure the non-scheduling of task even if the pool is full.

ashb · 2020-11-29T20:40:49Z

@raphaelauv That comment should be removed now -- multiple schedulers use a "mutex" (actually a lock on some DB rows) to avoid that exact problem.

(Also Airflow doesn't use threads, but processes. Technicality though)

raphaelauv · 2020-11-29T20:43:15Z

@ashb thank I will PR

raphaelauv · 2020-12-01T12:43:01Z

@ashb One more question , how do we custom the number of threads ( that are in practice processes because of the GIL ) of the scheduler ?

there is no more max_threads in http://apache-airflow-docs.s3-website.eu-central-1.amazonaws.com/docs/apache-airflow/latest/configurations-ref.html#scheduler

Thank you

kaxil · 2020-12-01T13:39:20Z

@ashb One more question , how do we custom the number of threads ( that are in practice processes because of the GIL ) of the scheduler ?

there is no more max_threads in http://apache-airflow-docs.s3-website.eu-central-1.amazonaws.com/docs/apache-airflow/latest/configurations-ref.html#scheduler

Thank you

max_threads was renamed to parsing_processes http://apache-airflow-docs.s3-website.eu-central-1.amazonaws.com/docs/apache-airflow/latest/configurations-ref.html#parsing-processes

Note in Updating.md:
https://github.com/apache/airflow/blob/master/UPDATING.md#scheduler-max_threads-config-has-been-renamed-to-scheduler-parsing_processes

ashb requested review from XD-DENG, houqp, mik-laj, potiuk and turbaszek September 15, 2020 16:41

boring-cyborg bot added area:docs area:Scheduler including HA (high availability) scheduler area:serialization labels Sep 15, 2020

mik-laj added the AIP-15 label Sep 15, 2020

turbaszek reviewed Sep 15, 2020

View reviewed changes

airflow/executors/base_executor.py Outdated Show resolved Hide resolved

turbaszek reviewed Sep 15, 2020

View reviewed changes

airflow/jobs/scheduler_job.py Outdated Show resolved Hide resolved

ashb force-pushed the scheduler-ha branch from a8c520e to d6ed1b9 Compare September 16, 2020 09:02

ashb mentioned this pull request Sep 16, 2020

Officially support HA for scheduler component (AIP-15) #9630

Closed

10 tasks

ashb force-pushed the scheduler-ha branch from d6ed1b9 to 6774465 Compare September 16, 2020 10:20

mik-laj reviewed Sep 16, 2020

View reviewed changes

airflow/models/dag.py Outdated Show resolved Hide resolved

mik-laj reviewed Sep 16, 2020

View reviewed changes

airflow/utils/dag_processing.py Outdated Show resolved Hide resolved

ashb force-pushed the scheduler-ha branch 2 times, most recently from 50782a6 to 74dba8a Compare September 16, 2020 14:56

ashb commented Sep 16, 2020

View reviewed changes

airflow/models/dag.py Outdated Show resolved Hide resolved

ashb force-pushed the scheduler-ha branch from a12d6ef to f86c233 Compare September 16, 2020 20:24

XD-DENG reviewed Sep 17, 2020

View reviewed changes

airflow/jobs/scheduler_job.py Outdated Show resolved Hide resolved

airflow/models/dag.py Outdated Show resolved Hide resolved

airflow/jobs/scheduler_job.py Outdated Show resolved Hide resolved

fixup! Officially support running more than one scheduler concurrently.

636693a

ashb force-pushed the scheduler-ha branch from de04e74 to 636693a Compare October 9, 2020 16:05

ashb added 2 commits October 9, 2020 21:38

fixup! Officially support running more than one scheduler concurrently.

eb82b9a

fixup! Officially support running more than one scheduler concurrently.

ee0a79a

ashb merged commit 73b9163 into apache:master Oct 9, 2020

turbaszek mentioned this pull request Oct 10, 2020

Optimize subclasses of DummyOperator be set straight to success instead of being run #11393

Closed

ashb mentioned this pull request Oct 12, 2020

Add docs about Scheduler HA, how to use it and DB requirements #11467

Merged

mik-laj added the area:performance label Oct 13, 2020

ashb mentioned this pull request Nov 13, 2020

scheduler gets stuck without a trace #7935

Closed

ashb deleted the scheduler-ha branch November 13, 2020 12:58

ashb mentioned this pull request Nov 14, 2020

faq.rst provides incorrect instructions for reducing scheduling latency #12348

Closed

raphaelauv mentioned this pull request Nov 29, 2020

DOC - Remove non thread-safe pools - Revert #7643 #12709

Merged

mobuchowski mentioned this pull request Aug 2, 2021

create_dagrun doesn't seem to work when called from scheduler in airflow 2.0 OpenLineage/OpenLineage#126

Closed

mobuchowski mentioned this pull request Oct 11, 2021

How to run with Airflow 2.0.1 OpenLineage/OpenLineage#326

Closed

kaxil mentioned this pull request Jun 10, 2022

Create pull request template astronomer/astro-sdk#205

Closed

pdebelak mentioned this pull request Feb 16, 2023

scheduler.tasks.running metric is always 0 #29578

Closed

2 tasks

Taragolis mentioned this pull request Feb 18, 2023

Track number of tasks in executor as metric in scheduler job #29579

Closed

Fully support running more than one scheduler concurrently #10956

Fully support running more than one scheduler concurrently #10956

Uh oh!

Conversation

ashb commented Sep 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

potiuk commented Sep 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryw commented Sep 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

turbaszek Sep 15, 2020

Choose a reason for hiding this comment

Uh oh!

ashb Sep 16, 2020

Choose a reason for hiding this comment

Uh oh!

turbaszek Sep 16, 2020

Choose a reason for hiding this comment

Uh oh!

mik-laj Oct 9, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ashb commented Sep 16, 2020

Uh oh!

Uh oh!

mik-laj Sep 16, 2020

Choose a reason for hiding this comment

Uh oh!

turbaszek Oct 5, 2020

Choose a reason for hiding this comment

Uh oh!

ashb Oct 6, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ashb commented Sep 17, 2020

Uh oh!

XD-DENG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ashb commented Sep 18, 2020

Uh oh!

ashb commented Sep 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

potiuk commented Sep 18, 2020

Uh oh!

github-actions bot commented Oct 9, 2020

Uh oh!

github-actions bot commented Oct 9, 2020

Uh oh!

ashb commented Oct 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ashb commented Oct 9, 2020

Uh oh!

potiuk commented Oct 9, 2020

Uh oh!

raphaelauv commented Nov 29, 2020

Uh oh!

ashb commented Nov 29, 2020

Uh oh!

raphaelauv commented Nov 29, 2020

Uh oh!

raphaelauv commented Dec 1, 2020

Uh oh!

kaxil commented Dec 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

ashb commented Sep 15, 2020 •

edited

Loading

potiuk commented Sep 15, 2020 •

edited

Loading

ryw commented Sep 15, 2020 •

edited

Loading

ashb commented Sep 18, 2020 •

edited

Loading

ashb commented Oct 9, 2020 •

edited

Loading

kaxil commented Dec 1, 2020 •

edited

Loading