介绍一下airflow中的表用途
alembic_version #
celery_taskmeta #
celery_tasksetmeta #
chart #
connection #
dag # dag 任务名的存放表
dag_pickle #
dag_run #
dag_stats # airflow-web显示所需信息
import_error #
job #
known_event #
known_event_type #
kombu_message #
kombu_queue #
log # 所以dag日志
sla_miss #
slot_pool #
task_fail # 记录失败的task信息……
task_instance # 记录成功的task执行的 开始时间,结束时间,执行时间
users # airflow认证用户表
variable #
xcom #
删除一个废弃的dag
## 首先删除py脚本文件,很重要
set @dag_id = 'BAD_DAG';
delete from airflow.xcom where dag_id = @dag_id;
delete from airflow.task_instance where dag_id = @dag_id;
delete from airflow.sla_miss where dag_id = @dag_id;
delete from airflow.log where dag_id = @dag_id;
delete from airflow.job where dag_id = @dag_id;
delete from airflow.dag_run where dag_id = @dag_id;
delete from airflow.dag where dag_id = @dag_id;
删除失败的dag统计,让展示更悦目
## web展示的时候使用的中间表,如dag_stats,
这个可以在F12时候看到http://ip:8080/admin/airflow/dag_stats
## 分别统计失败dag_id的次数
SELECT dag_id,count(id) FROM `dag_run` where state = 'failed' GROUP BY dag_id;
## 统计的结果会保存在 dag_stats,具体什么时候更新表,暂时还不太清楚