介绍一下airflow中的表用途

alembic_version		  #
celery_taskmeta       #
celery_tasksetmeta    #
chart                 #
connection            #
dag                   # dag 任务名的存放表
dag_pickle            # 
dag_run               # 
dag_stats             # airflow-web显示所需信息 
import_error          #
job                   #
known_event           #
known_event_type      #
kombu_message         #
kombu_queue           #
log                   # 所以dag日志
sla_miss              #
slot_pool             #
task_fail             # 记录失败的task信息……
task_instance         # 记录成功的task执行的 开始时间,结束时间,执行时间
users                 # airflow认证用户表
variable              #
xcom                  #

删除一个废弃的dag

## 首先删除py脚本文件,很重要

set @dag_id = 'BAD_DAG';
delete from airflow.xcom where dag_id = @dag_id;
delete from airflow.task_instance where dag_id = @dag_id;
delete from airflow.sla_miss where dag_id = @dag_id;
delete from airflow.log where dag_id = @dag_id;
delete from airflow.job where dag_id = @dag_id;
delete from airflow.dag_run where dag_id = @dag_id;
delete from airflow.dag where dag_id = @dag_id;

删除失败的dag统计,让展示更悦目

## web展示的时候使用的中间表,如dag_stats,
这个可以在F12时候看到http://ip:8080/admin/airflow/dag_stats

## 分别统计失败dag_id的次数
SELECT dag_id,count(id) FROM `dag_run` where state = 'failed' GROUP BY dag_id;

## 统计的结果会保存在 dag_stats,具体什么时候更新表,暂时还不太清楚