Airflow 101 Architecture of Apache Airflow

Unlock the power of Apache Airflow with our comprehensive guide to its essential components. Dive into the architecture, functionality, and operational insights to streamline your workflow management process.

What You’ll Learn:

  1. Architecture Overview:
    • Understand the top-level architecture of Apache Airflow and the roles of its four key components.
    • Explore the multi-node setup and how different components interact across nodes.
  2. Web Server:
    • Experience the stateless web application that empowers users with job management capabilities.
    • Learn how to interact with Airflow’s meta-database and monitoring features via the web server interface.
  3. Scheduler:
    • Discover the multi-threaded Python process responsible for scheduling and task execution.
    • Gain insights into how the scheduler interacts with the database and manages task dependencies.
  4. Database:
    • Learn about Airflow’s database usage for storing job details, variables, and connections.
    • Explore recommended database options and their advantages, with a focus on PostgreSQL.
  5. Executor:
    • Understand the role of the executor in task execution and its interaction with the scheduler and workers.
    • Explore different executor types, including the default sequential executor and the recommended Celery executor for production environments.

Please fill in the form to watch the webinar

Note: By filling and submitting this form you understand and agree that the use of Qubole’s website is subject to the General Website Terms of Use. Additional details regarding Qubole’s collection and use of your personal information, including information about access, retention, rectification, deletion, security, cross-border transfers and other topics, is available in the Privacy Policy. If you have any questions regarding the webform language, please contact [email protected].