Airflow 是Airbnb公司开源的,是用 Python 实现的任务管理、调度、监控工作流的平台。因依调度赖于 crontab ,Airflow 目前只支持在安装在 Linux 系统平台。Airflow 可以用做 data pipeline(ETL)、 消息队列等监控。

airflow官网 airflow-github

安装airflow参考链接

airflow-web前端

删除一些失败记录

登录到airflow的数据库后执行SQL语句

SELECT * FROM `dag_run` WHERE dag_id='adv_data_16min' and state='failed'

DELETE FROM `dag_run` WHERE dag_id='adv_data_16min' and state='failed'

SELECT * FROM `dag_stats` WHERE dag_id='adv_data_16min' and state='failed'

DELETE FROM `dag_stats` WHERE dag_id='adv_data_16min' and state='failed'