Explain what Airflow is, its purpose, core concepts, architecture, and user interface
Create a Directed Acyclic Graph (DAG) to define workflows, set up tasks, and establish dependencies between them
Use Bash, Python, and SQLExecuteQueryOperator; leverage XComs for inter-task communication; and manage variables and conditional branching with TaskGroups
Implement catchup, backfill, and CRON expressions to schedule tasks effectively
Successfully complete practical exercises to reinforce theoretical knowledge and apply it to real-world scenarios
Basic knowledge in Python
Basic knowledge of SQL
Basic knowledge of Command line usage
Alyssa is a Lead Data Engineer with over eight years of experience at Samsung R&D Institute Philippines (SRPH). A graduate of BS in Computer Science from the University of the Philippines Los Baños, Alyssa initially aspiring to be a Frontend Developer. Upon joining SRPH, she was accepted as a Backend Developer, where she gained proficiency in Spring and Java programming. Later, she was moved to a new Big Data project, sparking her journey of exploration and learning in various areas of Big Data Analytics, ultimately leading her to fall in love with Data Engineering. This experience allowed her to explore numerous Big Data technologies, with Python and the Hadoop framework as her primary tools, and gain expertise in cloud services like AWS and GCP for building robust data platforms. With five years of experience using Apache Airflow, she has excelled in workflow orchestration and automation. Today, she is part of a team at SRPH, driving innovation and delivering impactful data.