How to use blueprint on Astro to write Apache Airflow® Dags in a no-code interface

Preview

The no-code UI for Blueprint on Astro is in Preview.

Blueprint on Astro lets you build data pipelines from templates provided by your data engineering team. You browse the template library, configure each template using form fields, and connect them into a pipeline through a drag-and-drop interface in the Astro IDE.

Astro IDE Blueprint tab with the visual workflow graph, library panel, and node configuration.

This tutorial walks you through exploring the blueprint onboarding project and adding a new ETL pipeline consisting of three templates in order to aggregate moon-merch sales revenue. No knowledge of Airflow or Python is required.

If you don’t have an Astro account yet, sign up for a free trial of Astro, which gives access to the Astro IDE, including the blueprint interface. If you already have an Astro account, skip to Step 1b.

In the onboarding flow, after giving your Organization and Workspace names, select Start with a template.

Astro onboarding flow with the "Start with a template" option selected.

Choose the Blueprint template and click Continue.

Astro onboarding flow with the "Blueprint" template selected.

You enter the Astro IDE where the Blueprint tab opens the no-code interface to define your pipeline. Continue with Step 2.

Astro IDE with the Blueprint tab selected, the Dag list, and New DAG.

Step 1b: Add the tutorial project to the Astro IDE

If you already have an Astro account, you can add the blueprint tutorial project by navigating to the Astro IDE (1) and then clicking the “Build DAGs visually with Blueprint templates” option.

Astro workspace with Astro IDE in the sidebar and the Build DAGs visually with Blueprint templates card.

Step 2: Explore an existing blueprint pipeline

There are eight pre-existing pipelines in the onboarding project, each using one or more of 11 blueprint templates to accomplish different data engineering tasks.

Click one of the existing pipelines, for example moon_missions_country_stats to open the drag-and-drop view, which shows you the library of blueprint templates (1), the workflow graph (2), and the DAG properties button (3) to modify the schedule (4) on which the pipeline should run.

Astro IDE Blueprint editor with the template library, workflow canvas, DAG Properties, and schedule field.

If you click on any nodes in the workflow graph, a panel opens on the right in which you can make changes to the configuration of the template. For example, you can change the quality checks task, which uses the sql data quality check blueprint to make sure the source data has at least 10 rows.

Astro IDE Blueprint canvas with the quality checks node selected and DAG Properties for min rows and column checks.

Step 3: Run the pipeline

To be able to run any pipeline in the Astro IDE, you need to start a test Deployment.

Click Start Test Deployment in the top right corner of the IDE. The test Deployment might take a few minutes to start.

Astro IDE with the Start Test Deployment button highlighted.

Once the test Deployment has spun up, you can run the Dag. Click the Test tab (1), select the Dag from the dropdown menu (2), and click Run DAG (3) to start a run.

Astro IDE with the Test tab selected, the Dag dropdown menu, and the Run DAG button.

Click + X TASKS on any blueprint node to see the individual Airflow tasks contained in a blueprint as they complete. You can see the output of the task by clicking on it and then on Task Logs on the bar at the bottom of the Astro IDE.

Astro IDE Blueprint view with expanded tasks, a selected task, and Task Logs with tabular output.

If you make any changes to a blueprint pipeline, you need to click Sync to Test to deploy your changes to the test Deployment before running the changed pipeline.

Step 4: Create a new pipeline

Now it is time to use Blueprint to build your own pipeline.

Back on the Blueprint tab, click Blueprint in the breadcrumb to return to the overview of all Blueprint Dags in this project.

Astro IDE Blueprint tab with the Blueprint breadcrumb, workflow canvas, library, and Task Logs panel.

Click + New DAG, give your Dag a name (an ID unique within the project, for example my_first_blueprint_dag) and click Generate DAG.

Astro IDE Blueprint Dag list with New DAG and the onboarding project Dags.

Now you have an empty canvas. Add the first blueprint by dragging the extract and aggregate blueprint to the canvas and configure it in the form on the right. For example, you can change the GRAIN to quarter instead of month, which changes how the blueprint aggregates moon merch sales.

Astro IDE Blueprint canvas with extract and aggregate on the canvas, library, and configuration form with source CSV and grain.

If you want to change blueprints, either to change the form options, or get more blueprints entirely, see the write blueprint templates tutorial. Any action that can be defined in Python code can be part of a blueprint.

Next, add a second blueprint to the pipeline to perform a data quality check. Drag the row count check blueprint from the library into the canvas and enter the minimum number of rows you are expecting. At a quarterly grain for one year of data, that would be 4.

Astro IDE Blueprint canvas with row count check selected, library, and min rows in the configuration form.

You can set the dependency between the blueprints by hovering on the bottom edge of the extract and aggregate node, clicking and then dragging your cursor to the bottom edge of the row count check node. A blue line appears to indicate that the aggregation blueprint needs to run before the quality check.

Screen recording of drawing a dependency edge from extract and aggregate to row count check on the Blueprint canvas.

Lastly, add a third blueprint that prints the results after the data quality check passes. Drag the print results blueprint from the library (1) into the canvas, add a dependency (2) and fill in the source variable field using merch_revenue_by_period, the target variable of the first blueprint.

Astro IDE Blueprint canvas with print results, three connected nodes, library, and source variable in the configuration form.

Click Sync to Test to deploy your changes to the test Deployment. After the sync process has finished you can run your pipeline!

The test Deployment is a fully functional Airflow environment. You can access the regular Airflow UI of your Deployment by clicking on the dropdown arrow next to the Sync to Test button and selecting Open Airflow.

Astro IDE with the Sync to Test menu open showing Test Deployment Details, Open Airflow, and Stop Test Deployment.

Conclusion

Congratulations! You created an Airflow Dag processing data, performing a data quality check, and printing the results, without writing any Python code! A good next step is to send the How to write blueprint templates tutorial to your data engineering team to write more blueprints for you to use.