Airflow sensors | Astronomer Documentation

Apache Airflow sensors are a special kind of operator that are designed to wait for something to happen. When sensors run, they check to see if a certain condition is met before they are marked successful and let their downstream tasks execute.

In this guide, you’ll learn how sensors are used in Airflow, best practices for implementing sensors in production, and how to use deferrable versions of sensors.

Tip

Sensors are used to wait for a condition to be met before executing downstream tasks. Many sensors come with a deferrable mode, which allows them to release their worker slot while waiting for the condition to be met, increasing the efficiency of your DAGs. See Deferrable operators for more information.

If you want a DAG to run based on messages in a messaging queue, consider using event-driven scheduling instead of sensors.

Assumed knowledge

To get the most out of this guide, you should have an understanding of:

Basic Airflow concepts. See Introduction to Apache Airflow.
Basic Python. See the Python Documentation.

Sensor basics

Sensors are a type of operator that checks if a condition is met at a specific interval. If the condition is met, the task is marked successful and the DAG can move to downstream tasks. If the condition isn’t met, the sensor waits for another interval before checking again.

All sensors inherit from the BaseSensorOperator and have the following parameters:

mode: How the sensor operates. There are two types of modes:
- poke: This is the default mode. When using poke, the sensor occupies a worker slot for the entire execution time and sleeps between pokes. This mode is best if you expect a short runtime for the sensor.
- reschedule: When using this mode, if the criteria is not met then the sensor releases its worker slot and reschedules the next check for a later time. This mode is best if you expect a long runtime for the sensor, because it is less resource intensive and frees up workers for other tasks.
poke_interval: When using poke mode, this is the time in seconds that the sensor waits before checking the condition again. The default is 60 seconds.
exponential_backoff: When set to True, this setting creates exponentially longer wait times between pokes in poke mode.
timeout: The maximum amount of time in seconds that the sensor checks the condition. If the condition is not met within the specified period, the task fails.
soft_fail: If set to True, the task is marked as skipped if the condition is not met by the timeout.

Different types of sensors have different implementation details.

Commonly used sensors

Many Airflow provider packages contain sensors that wait for various criteria in different source systems. The following are some of the most commonly used sensors:

@task.sensor decorator: Allows you to turn any Python function that returns a PokeReturnValue into an instance of the BaseSensorOperator class. This way of creating a sensor is useful when checking for complex logic or if you are connecting to a tool via an API that has no specific sensor available.
S3KeySensor: Waits for a key (file) to appear in an Amazon S3 bucket. This sensor is useful if you want your DAG to process files from Amazon S3 as they arrive.
DateTimeSensor: Waits for a specified date and time. This sensor is useful if you want different tasks within the same DAG to run at different times.
ExternalTaskSensor: Waits for an Airflow task to be completed. This sensor is useful if you want to implement cross-DAG dependencies in the same Airflow environment.
HttpSensor: Waits for an API to be available. This sensor is useful if you want to ensure your API requests are successful.
SqlSensor: Waits for data to be present in a SQL table. This sensor is useful if you want your DAG to process data as it arrives in your database.

To review the available Airflow sensors, go to the Airflow Registry.

Example implementation

The following example DAG shows how you might use the SqlSensor sensor:

Taskflow

1 from airflow.decorators import task, dag
2 from airflow.providers.common.sql.sensors.sql import SqlSensor
3 
4 from typing import Dict
5 from pendulum import datetime
6 
7 
8 def _success_criteria(record):
9     return record
10 
11 
12 def _failure_criteria(record):
13     return True if not record else False
14 
15 
16 @dag(
17     description="DAG in charge of processing partner data",
18     start_date=datetime(2021, 1, 1),
19     schedule="@daily",
20     catchup=False,
21 )
22 def partner():
23     waiting_for_partner = SqlSensor(
24         task_id="waiting_for_partner",
25         conn_id="postgres",
26         sql="sql/CHECK_PARTNER.sql",
27         parameters={"name": "partner_a"},
28         success=_success_criteria,
29         failure=_failure_criteria,
30         fail_on_empty=False,
31         poke_interval=20,
32         mode="reschedule",
33         timeout=60 * 5,
34     )
35 
36     @task
37     def validation() -> Dict[str, str]:
38         return {"partner_name": "partner_a", "partner_validation": True}
39 
40     @task
41     def storing():
42         print("storing")
43 
44     waiting_for_partner >> validation() >> storing()
45 
46 
47 partner()

Traditional

1 from airflow import DAG
2 from airflow.operators.python import PythonOperator
3 from airflow.providers.common.sql.sensors.sql import SqlSensor
4 
5 from typing import Dict
6 from pendulum import datetime
7 
8 
9 def _success_criteria(record):
10     return record
11 
12 
13 def _failure_criteria(record):
14     return True if not record else False
15 
16 
17 with DAG(
18     dag_id="partner",
19     description="DAG in charge of processing partner data",
20     start_date=datetime(2021, 1, 1),
21     schedule="@daily",
22     catchup=False,
23 ):
24     waiting_for_partner = SqlSensor(
25         task_id="waiting_for_partner",
26         conn_id="postgres",
27         sql="sql/CHECK_PARTNER.sql",
28         parameters={"name": "partner_a"},
29         success=_success_criteria,
30         failure=_failure_criteria,
31         fail_on_empty=False,
32         poke_interval=20,
33         mode="reschedule",
34         timeout=60 * 5,
35     )
36 
37     def validation_function() -> Dict[str, str]:
38         return {"partner_name": "partner_a", "partner_validation": True}
39 
40     validation = PythonOperator(
41         task_id="validation", python_callable=validation_function
42     )
43 
44     def storing_function():
45         print("storing")
46 
47     storing = PythonOperator(task_id="storing", python_callable=storing_function)
48 
49     waiting_for_partner >> validation >> storing

This DAG waits for data to be available in a Postgres database before running validation and storing tasks. The SqlSensor runs a SQL query and is marked successful when that query returns data. Specifically, when the result is not in the set (0, ‘0’, ”, None). The SqlSensor task in the example DAG (waiting_for_partner) runs the CHECK_PARTNER.sql script every 20 seconds (the poke_interval) until the data is returned. The mode is set to reschedule, meaning between each 20 second interval the task will not take a worker slot. The timeout is set to 5 minutes, and the task fails if the data doesn’t arrive within that time. When the SqlSensor criteria is met, the DAG moves to the downstream tasks.

Sensor decorator / PythonSensor

If no sensor exists for your use case, you can create your own using either the @task.sensor decorator or the PythonSensor. The @task.sensor decorator returns a PokeReturnValue as an instance of the BaseSensorOperator. The PythonSensor takes a python_callable that returns True or False.

The following DAG shows how to use either the sensor decorator or the PythonSensor to create the same custom sensor:

Taskflow

1 """
2 ### Create a custom sensor using the @task.sensor decorator
3 
4 This DAG showcases how to create a custom sensor using the @task.sensor decorator
5 to check the availability of an API.
6 """
7 
8 from airflow.decorators import dag, task
9 from pendulum import datetime
10 import requests
11 
12 # importing the PokeReturnValue
13 from airflow.sensors.base import PokeReturnValue
14 
15 
16 @dag(start_date=datetime(2022, 12, 1), schedule="@daily", catchup=False)
17 def sensor_decorator():
18     # supply inputs to the BaseSensorOperator parameters in the decorator
19     @task.sensor(poke_interval=30, timeout=3600, mode="poke")
20     def check_dog_availability() -> PokeReturnValue:
21         r = requests.get("https://random.dog/woof.json")
22         print(r.status_code)
23 
24         # set the condition to True if the API response was 200
25         if r.status_code == 200:
26             condition_met = True
27             operator_return_value = r.json()
28         else:
29             condition_met = False
30             operator_return_value = None
31             print(f"Woof URL returned the status code {r.status_code}")
32 
33         # the function has to return a PokeReturnValue
34         # if is_done = True the sensor will exit successfully, if
35         # is_done=False, the sensor will either poke or be rescheduled
36         return PokeReturnValue(is_done=condition_met, xcom_value=operator_return_value)
37 
38     # print the URL to the picture
39     @task
40     def print_dog_picture_url(url):
41         print(url)
42 
43     print_dog_picture_url(check_dog_availability())
44 
45 
46 sensor_decorator()

Here, the @task.sensor decorates the check_dog_availability() function, which checks if a given API returns a 200 status code. If the API returns a 200 status code, the sensor task is marked as successful. If any other status code is returned, the sensor pokes again after the poke_interval has passed.

The optional xcom_value parameter in PokeReturnValue defines what data will be pushed to XCom once is_done=true. You can use the data that was pushed to XCom in any downstream tasks.

Traditional

1 """
2 ### Create a custom sensor using the PythonSensor
3 
4 This DAG showcases how to create a custom sensor using the PythonSensor
5 to check the availability of an API.
6 """
7 
8 from airflow.decorators import dag, task
9 from pendulum import datetime
10 import requests
11 from airflow.sensors.python import PythonSensor
12 
13 
14 def check_dog_availability_func(**context):
15     r = requests.get("https://random.dog/woof.json")
16     print(r.status_code)
17 
18     # set the condition to True if the API response was 200
19     if r.status_code == 200:
20         operator_return_value = r.json()
21         # pushing the link to the Dog picture to XCom
22         context["ti"].xcom_push(key="return_value", value=operator_return_value)
23         return True
24     else:
25         operator_return_value = None
26         print(f"Woof URL returned the status code {r.status_code}")
27         return False
28 
29 
30 @dag(
31     start_date=datetime(2022, 12, 1),
32     schedule=None,
33     catchup=False,
34     tags=["sensor"],
35 )
36 def pythonsensor_example():
37     # turn any Python function into a sensor
38     check_dog_availability = PythonSensor(
39         task_id="check_dog_availability",
40         poke_interval=10,
41         timeout=3600,
42         mode="reschedule",
43         python_callable=check_dog_availability_func,
44     )
45 
46     # click the link in the logs for a cute picture :)
47     @task
48     def print_dog_picture_url(url):
49         print(url)
50 
51     print_dog_picture_url(check_dog_availability.output)
52 
53 
54 pythonsensor_example()

Here, the PythonSensor uses the check_dog_availability_func to check if a given API returns a 200 status code. If the API returns a 200 status code, the API response is pushed to XCom and the function returns True, causing the sensor task to be marked as successful. If any other status code is returned the check_dog_availability_func returns False and the sensor pokes again after the poke_interval has passed.

Sensor best practices

When using sensors, keep the following in mind to avoid potential performance issues:

Always define a meaningful timeout parameter for your sensor. The default for this parameter is seven days, which is a long time for your sensor to be running. When you implement a sensor, consider your use case and how long you expect the sensor to wait and then define the sensor’s timeout accurately.
Whenever possible and especially for long-running sensors, use deferrable mode. If no deferrable mode is available, use the reschedule mode. Both of these options help your sensor to not constantly occupy a worker slot. This helps avoid deadlocks in Airflow where sensors take all of the available worker slots.
If your poke_interval is very short (less than about 5 minutes), use the poke mode. Using reschedule mode in this case can overload your scheduler.
Define a meaningful poke_interval based on your use case. There is no need for a task to check a condition every 60 seconds (the default) if you know the total amount of wait time will be 30 minutes.

Sensor failure modes

When using sensors, there are different options to define its behavior in case of an exception raised within the sensor.

soft_fail=True: If an exception is raised within the task, it is marked as skipped, affecting downstream tasks according to their defined trigger rules.
silent_fail=True: If an exception is raised in the poke method that is not one of: AirflowSensorTimeout, AirflowTaskTimeout, AirflowSkipException or AirflowFailException, the sensor will log the error but continue its execution.
never_fail=True: If the poke method raises any exception, the sensor task is skipped. This parameter is mutually exclusive with soft_fail.

Deferrable operators

Deferrable operators (sometimes referred to as asynchronous operators) eliminate the problem of having any operator or sensor using a full worker slot for the entire time they run. Many operators have a deferrable parameter that can be set to True to make the operator deferrable. For the sensors where this parameter is not available, deferrable versions exist in open source Airflow and in the Astronomer Providers package. Astronomer recommends using these in most cases to reduce resource costs.

For DAG authors, using deferrable sensors is no different from using regular sensors. All you need is to do is run a triggerer process in Airflow and either:

Set the Airflow config operators.default_deferrable to True to set all sensors with a deferrable parameter to be deferrable by default.
Set the deferrable parameter to True on individual sensor instances you want to run in deferrable mode.
Replace the name of a sensor with its deferrable counterpart if no deferrable parameter is available.

For more details, see Deferrable operators.