发布的YOLOv4显示,COCO的平均精度(AP)和每秒帧数(FPS)分别提高了10%和12%这也就是为什么YOLOv4这么受欢迎。
一、数据处理
1.1 bag of freebies
不增加推理时间延迟的情况下提高模型性能的步骤
1.1.1 畸变
光照畸变 — 这包括在一个图像中改变亮度,对比度,饱和度和噪声。
几何畸变 — 这包括随机缩放,裁剪,翻转,和旋转
1.1.2 图像遮挡
随机擦除 — 这是一种数据增强技术,将图像的区域替换为随机值,或者是训练集的平均像素值,通常采用不同比例的擦除图像和擦除区域的高宽比来实现。在功能上,这成为了一种正则化技术,它可以防止我们的模型记忆训练数据和过拟合。
Cutout — 在训练中掩盖一个正方形区域。Cutout 区域只对CNN的第一层隐藏。这与随机擦除非常相似,但是在遮挡中使用的是一个常数值。目的是相似的:减少过拟合。
Grid Mask — 图像的区域被隐藏在一个网格中。类似于捉迷藏,这迫使我们的模型学习组成物体的组成部分。
MixUp — 图像对及其标签的凸叠加。
Hide and Seek — 将图像分割成一个由SxS patch组成的网格。以一定的概率隐藏每个补丁(p_hide)。这允许模型了解物体的样子,而不只是学习物体的单个部分是什么样子。
马赛克数据增强 — 马赛克数据增强将4张训练图像按一定比例组合成1张。Mosaic是YOLOv4中引入的第一个新的数据增强技术。这使得模型能够学习如何识别比正常尺寸小的物体。它还鼓励模型在框架的不同部分定位不同类型的图像。
CutMix — 通过从一个图像中切割一部分并将其粘贴到增强图像上来组合图像。图像的裁剪迫使模型学会基于大量健壮的特征进行预测。看上面的 “Hide and Seek”,没有裁剪,模型特别依赖狗的头来做预测。如果我们想准确地识别一只头藏在灌木丛后面的狗,这就有问题了。在CutMix中,用另一幅图像的一部分以及第二幅图像的ground truth标记替换该切块。在图像生成过程中设置每个图像的比例(例如0.4/0.6)。在下面的图片中,你可以看到CutMix的作者是如何演示这种技术比简单的MixUp和Cutout效果更好。
类标签平滑 — 类标签平滑不是一个图像处理技术,而是一个直观的对类标签的改变。通常情况下,正确的边界框分类是用类的独热向量[0,0,0,1,0,0,…]来表示,并据此计算损失函数。然而,当一个模型的预测值接近1.0时,它往往会出现错误、过拟合以及在某种程度上过度关注其他预测的复杂性。根据这种直觉,对类标签表示进行编码以在某种程度上衡量不确定性更为合理。通常情况下,作者选择0.9,所以用[0,0,0,0.9,0…]来代表正确的类。
自对抗训练(SAT) — 该技术通过对输入图像的变换,利用模型的状态来通知漏洞。首先,图像经过一个正常的训练步骤。然后,与其通过权重向后传播,损失信号被用来以一种对模型最有害的方式改变图像。在随后的训练中,模型被迫面对这个特别困难的例子并学习它。在我们这里介绍的技术中,这一技术可能是最不直观的,也是最接近于建模方面的技术。
二、YOLO网络模型
1、主干特征提取网络:DarkNet53 => CSPDarkNet53
2、特征金字塔:SPP,PAN
3、分类回归层:
4、训练用到的小技巧:Mosaic数据增强、Label Smoothing平滑、CIOU、学习率余弦退火衰减
5、激活函数:使用Mish激活函数
网络模型由卷积层、池化层、全连接层
评价指标:
查准率
查全率
检测速度
__________________________________________________________________________________________________
Layer (type) Output Shape Param # Connected to
==================================================================================================
input_1 (InputLayer) (None, 416, 416, 3) 0
__________________________________________________________________________________________________
conv2d_1 (Conv2D) (None, 416, 416, 32) 864 input_1[0][0]
__________________________________________________________________________________________________
batch_normalization_1 (BatchNor (None, 416, 416, 32) 128 conv2d_1[0][0]
__________________________________________________________________________________________________
mish_1 (Mish) (None, 416, 416, 32) 0 batch_normalization_1[0][0]
__________________________________________________________________________________________________
zero_padding2d_1 (ZeroPadding2D (None, 417, 417, 32) 0 mish_1[0][0]
__________________________________________________________________________________________________
conv2d_2 (Conv2D) (None, 208, 208, 64) 18432 zero_padding2d_1[0][0]
__________________________________________________________________________________________________
batch_normalization_2 (BatchNor (None, 208, 208, 64) 256 conv2d_2[0][0]
__________________________________________________________________________________________________
mish_2 (Mish) (None, 208, 208, 64) 0 batch_normalization_2[0][0]
__________________________________________________________________________________________________
conv2d_4 (Conv2D) (None, 208, 208, 64) 4096 mish_2[0][0]
__________________________________________________________________________________________________
batch_normalization_4 (BatchNor (None, 208, 208, 64) 256 conv2d_4[0][0]
__________________________________________________________________________________________________
mish_4 (Mish) (None, 208, 208, 64) 0 batch_normalization_4[0][0]
__________________________________________________________________________________________________
conv2d_5 (Conv2D) (None, 208, 208, 32) 2048 mish_4[0][0]
__________________________________________________________________________________________________
batch_normalization_5 (BatchNor (None, 208, 208, 32) 128 conv2d_5[0][0]
__________________________________________________________________________________________________
mish_5 (Mish) (None, 208, 208, 32) 0 batch_normalization_5[0][0]
__________________________________________________________________________________________________
conv2d_6 (Conv2D) (None, 208, 208, 64) 18432 mish_5[0][0]
__________________________________________________________________________________________________
batch_normalization_6 (BatchNor (None, 208, 208, 64) 256 conv2d_6[0][0]
__________________________________________________________________________________________________
mish_6 (Mish) (None, 208, 208, 64) 0 batch_normalization_6[0][0]
__________________________________________________________________________________________________
add_1 (Add) (None, 208, 208, 64) 0 mish_4[0][0]
mish_6[0][0]
__________________________________________________________________________________________________
conv2d_7 (Conv2D) (None, 208, 208, 64) 4096 add_1[0][0]
__________________________________________________________________________________________________
conv2d_3 (Conv2D) (None, 208, 208, 64) 4096 mish_2[0][0]
__________________________________________________________________________________________________
batch_normalization_7 (BatchNor (None, 208, 208, 64) 256 conv2d_7[0][0]
__________________________________________________________________________________________________
batch_normalization_3 (BatchNor (None, 208, 208, 64) 256 conv2d_3[0][0]
__________________________________________________________________________________________________
mish_7 (Mish) (None, 208, 208, 64) 0 batch_normalization_7[0][0]
__________________________________________________________________________________________________
mish_3 (Mish) (None, 208, 208, 64) 0 batch_normalization_3[0][0]
__________________________________________________________________________________________________
concatenate_1 (Concatenate) (None, 208, 208, 128 0 mish_7[0][0]
mish_3[0][0]
__________________________________________________________________________________________________
conv2d_8 (Conv2D) (None, 208, 208, 64) 8192 concatenate_1[0][0]
__________________________________________________________________________________________________
batch_normalization_8 (BatchNor (None, 208, 208, 64) 256 conv2d_8[0][0]
__________________________________________________________________________________________________
mish_8 (Mish) (None, 208, 208, 64) 0 batch_normalization_8[0][0]
__________________________________________________________________________________________________
zero_padding2d_2 (ZeroPadding2D (None, 209, 209, 64) 0 mish_8[0][0]
__________________________________________________________________________________________________
conv2d_9 (Conv2D) (None, 104, 104, 128 73728 zero_padding2d_2[0][0]
__________________________________________________________________________________________________
batch_normalization_9 (BatchNor (None, 104, 104, 128 512 conv2d_9[0][0]
__________________________________________________________________________________________________
mish_9 (Mish) (None, 104, 104, 128 0 batch_normalization_9[0][0]
__________________________________________________________________________________________________
conv2d_11 (Conv2D) (None, 104, 104, 64) 8192 mish_9[0][0]
__________________________________________________________________________________________________
batch_normalization_11 (BatchNo (None, 104, 104, 64) 256 conv2d_11[0][0]
__________________________________________________________________________________________________
mish_11 (Mish) (None, 104, 104, 64) 0 batch_normalization_11[0][0]
__________________________________________________________________________________________________
conv2d_12 (Conv2D) (None, 104, 104, 64) 4096 mish_11[0][0]
__________________________________________________________________________________________________
batch_normalization_12 (BatchNo (None, 104, 104, 64) 256 conv2d_12[0][0]
__________________________________________________________________________________________________
mish_12 (Mish) (None, 104, 104, 64) 0 batch_normalization_12[0][0]
__________________________________________________________________________________________________
conv2d_13 (Conv2D) (None, 104, 104, 64) 36864 mish_12[0][0]
__________________________________________________________________________________________________
batch_normalization_13 (BatchNo (None, 104, 104, 64) 256 conv2d_13[0][0]
__________________________________________________________________________________________________
mish_13 (Mish) (None, 104, 104, 64) 0 batch_normalization_13[0][0]
__________________________________________________________________________________________________
add_2 (Add) (None, 104, 104, 64) 0 mish_11[0][0]
mish_13[0][0]
__________________________________________________________________________________________________
conv2d_14 (Conv2D) (None, 104, 104, 64) 4096 add_2[0][0]
__________________________________________________________________________________________________
batch_normalization_14 (BatchNo (None, 104, 104, 64) 256 conv2d_14[0][0]
__________________________________________________________________________________________________
mish_14 (Mish) (None, 104, 104, 64) 0 batch_normalization_14[0][0]
__________________________________________________________________________________________________
conv2d_15 (Conv2D) (None, 104, 104, 64) 36864 mish_14[0][0]
__________________________________________________________________________________________________
batch_normalization_15 (BatchNo (None, 104, 104, 64) 256 conv2d_15[0][0]
__________________________________________________________________________________________________
mish_15 (Mish) (None, 104, 104, 64) 0 batch_normalization_15[0][0]
__________________________________________________________________________________________________
add_3 (Add) (None, 104, 104, 64) 0 add_2[0][0]
mish_15[0][0]
__________________________________________________________________________________________________
conv2d_16 (Conv2D) (None, 104, 104, 64) 4096 add_3[0][0]
__________________________________________________________________________________________________
conv2d_10 (Conv2D) (None, 104, 104, 64) 8192 mish_9[0][0]
__________________________________________________________________________________________________
batch_normalization_16 (BatchNo (None, 104, 104, 64) 256 conv2d_16[0][0]
__________________________________________________________________________________________________
batch_normalization_10 (BatchNo (None, 104, 104, 64) 256 conv2d_10[0][0]
__________________________________________________________________________________________________
mish_16 (Mish) (None, 104, 104, 64) 0 batch_normalization_16[0][0]
__________________________________________________________________________________________________
mish_10 (Mish) (None, 104, 104, 64) 0 batch_normalization_10[0][0]
__________________________________________________________________________________________________
concatenate_2 (Concatenate) (None, 104, 104, 128 0 mish_16[0][0]
mish_10[0][0]
__________________________________________________________________________________________________
conv2d_17 (Conv2D) (None, 104, 104, 128 16384 concatenate_2[0][0]
__________________________________________________________________________________________________
batch_normalization_17 (BatchNo (None, 104, 104, 128 512 conv2d_17[0][0]
__________________________________________________________________________________________________
mish_17 (Mish) (None, 104, 104, 128 0 batch_normalization_17[0][0]
__________________________________________________________________________________________________
zero_padding2d_3 (ZeroPadding2D (None, 105, 105, 128 0 mish_17[0][0]
__________________________________________________________________________________________________
conv2d_18 (Conv2D) (None, 52, 52, 256) 294912 zero_padding2d_3[0][0]
__________________________________________________________________________________________________
batch_normalization_18 (BatchNo (None, 52, 52, 256) 1024 conv2d_18[0][0]
__________________________________________________________________________________________________
mish_18 (Mish) (None, 52, 52, 256) 0 batch_normalization_18[0][0]
__________________________________________________________________________________________________
conv2d_20 (Conv2D) (None, 52, 52, 128) 32768 mish_18[0][0]
__________________________________________________________________________________________________
batch_normalization_20 (BatchNo (None, 52, 52, 128) 512 conv2d_20[0][0]
__________________________________________________________________________________________________
mish_20 (Mish) (None, 52, 52, 128) 0 batch_normalization_20[0][0]
__________________________________________________________________________________________________
conv2d_21 (Conv2D) (None, 52, 52, 128) 16384 mish_20[0][0]
__________________________________________________________________________________________________
batch_normalization_21 (BatchNo (None, 52, 52, 128) 512 conv2d_21[0][0]
__________________________________________________________________________________________________
mish_21 (Mish) (None, 52, 52, 128) 0 batch_normalization_21[0][0]
__________________________________________________________________________________________________
conv2d_22 (Conv2D) (None, 52, 52, 128) 147456 mish_21[0][0]
__________________________________________________________________________________________________
batch_normalization_22 (BatchNo (None, 52, 52, 128) 512 conv2d_22[0][0]
__________________________________________________________________________________________________
mish_22 (Mish) (None, 52, 52, 128) 0 batch_normalization_22[0][0]
__________________________________________________________________________________________________
add_4 (Add) (None, 52, 52, 128) 0 mish_20[0][0]
mish_22[0][0]
__________________________________________________________________________________________________
conv2d_23 (Conv2D) (None, 52, 52, 128) 16384 add_4[0][0]
__________________________________________________________________________________________________
batch_normalization_23 (BatchNo (None, 52, 52, 128) 512 conv2d_23[0][0]
__________________________________________________________________________________________________
mish_23 (Mish) (None, 52, 52, 128) 0 batch_normalization_23[0][0]
__________________________________________________________________________________________________
conv2d_24 (Conv2D) (None, 52, 52, 128) 147456 mish_23[0][0]
__________________________________________________________________________________________________
batch_normalization_24 (BatchNo (None, 52, 52, 128) 512 conv2d_24[0][0]
__________________________________________________________________________________________________
mish_24 (Mish) (None, 52, 52, 128) 0 batch_normalization_24[0][0]
__________________________________________________________________________________________________
add_5 (Add) (None, 52, 52, 128) 0 add_4[0][0]
mish_24[0][0]
__________________________________________________________________________________________________
conv2d_25 (Conv2D) (None, 52, 52, 128) 16384 add_5[0][0]
__________________________________________________________________________________________________
batch_normalization_25 (BatchNo (None, 52, 52, 128) 512 conv2d_25[0][0]
__________________________________________________________________________________________________
mish_25 (Mish) (None, 52, 52, 128) 0 batch_normalization_25[0][0]
__________________________________________________________________________________________________
conv2d_26 (Conv2D) (None, 52, 52, 128) 147456 mish_25[0][0]
__________________________________________________________________________________________________
batch_normalization_26 (BatchNo (None, 52, 52, 128) 512 conv2d_26[0][0]
__________________________________________________________________________________________________
mish_26 (Mish) (None, 52, 52, 128) 0 batch_normalization_26[0][0]
__________________________________________________________________________________________________
add_6 (Add) (None, 52, 52, 128) 0 add_5[0][0]
mish_26[0][0]
__________________________________________________________________________________________________
conv2d_27 (Conv2D) (None, 52, 52, 128) 16384 add_6[0][0]
__________________________________________________________________________________________________
batch_normalization_27 (BatchNo (None, 52, 52, 128) 512 conv2d_27[0][0]
__________________________________________________________________________________________________
mish_27 (Mish) (None, 52, 52, 128) 0 batch_normalization_27[0][0]
__________________________________________________________________________________________________
conv2d_28 (Conv2D) (None, 52, 52, 128) 147456 mish_27[0][0]
__________________________________________________________________________________________________
batch_normalization_28 (BatchNo (None, 52, 52, 128) 512 conv2d_28[0][0]
__________________________________________________________________________________________________
mish_28 (Mish) (None, 52, 52, 128) 0 batch_normalization_28[0][0]
__________________________________________________________________________________________________
add_7 (Add) (None, 52, 52, 128) 0 add_6[0][0]
mish_28[0][0]
__________________________________________________________________________________________________
conv2d_29 (Conv2D) (None, 52, 52, 128) 16384 add_7[0][0]
__________________________________________________________________________________________________
batch_normalization_29 (BatchNo (None, 52, 52, 128) 512 conv2d_29[0][0]
__________________________________________________________________________________________________
mish_29 (Mish) (None, 52, 52, 128) 0 batch_normalization_29[0][0]
__________________________________________________________________________________________________
conv2d_30 (Conv2D) (None, 52, 52, 128) 147456 mish_29[0][0]
__________________________________________________________________________________________________
batch_normalization_30 (BatchNo (None, 52, 52, 128) 512 conv2d_30[0][0]
__________________________________________________________________________________________________
mish_30 (Mish) (None, 52, 52, 128) 0 batch_normalization_30[0][0]
__________________________________________________________________________________________________
add_8 (Add) (None, 52, 52, 128) 0 add_7[0][0]
mish_30[0][0]
__________________________________________________________________________________________________
conv2d_31 (Conv2D) (None, 52, 52, 128) 16384 add_8[0][0]
__________________________________________________________________________________________________
batch_normalization_31 (BatchNo (None, 52, 52, 128) 512 conv2d_31[0][0]
__________________________________________________________________________________________________
mish_31 (Mish) (None, 52, 52, 128) 0 batch_normalization_31[0][0]
__________________________________________________________________________________________________
conv2d_32 (Conv2D) (None, 52, 52, 128) 147456 mish_31[0][0]
__________________________________________________________________________________________________
batch_normalization_32 (BatchNo (None, 52, 52, 128) 512 conv2d_32[0][0]
__________________________________________________________________________________________________
mish_32 (Mish) (None, 52, 52, 128) 0 batch_normalization_32[0][0]
__________________________________________________________________________________________________
add_9 (Add) (None, 52, 52, 128) 0 add_8[0][0]
mish_32[0][0]
__________________________________________________________________________________________________
conv2d_33 (Conv2D) (None, 52, 52, 128) 16384 add_9[0][0]
__________________________________________________________________________________________________
batch_normalization_33 (BatchNo (None, 52, 52, 128) 512 conv2d_33[0][0]
__________________________________________________________________________________________________
mish_33 (Mish) (None, 52, 52, 128) 0 batch_normalization_33[0][0]
__________________________________________________________________________________________________
conv2d_34 (Conv2D) (None, 52, 52, 128) 147456 mish_33[0][0]
__________________________________________________________________________________________________
batch_normalization_34 (BatchNo (None, 52, 52, 128) 512 conv2d_34[0][0]
__________________________________________________________________________________________________
mish_34 (Mish) (None, 52, 52, 128) 0 batch_normalization_34[0][0]
__________________________________________________________________________________________________
add_10 (Add) (None, 52, 52, 128) 0 add_9[0][0]
mish_34[0][0]
__________________________________________________________________________________________________
conv2d_35 (Conv2D) (None, 52, 52, 128) 16384 add_10[0][0]
__________________________________________________________________________________________________
batch_normalization_35 (BatchNo (None, 52, 52, 128) 512 conv2d_35[0][0]
__________________________________________________________________________________________________
mish_35 (Mish) (None, 52, 52, 128) 0 batch_normalization_35[0][0]
__________________________________________________________________________________________________
conv2d_36 (Conv2D) (None, 52, 52, 128) 147456 mish_35[0][0]
__________________________________________________________________________________________________
batch_normalization_36 (BatchNo (None, 52, 52, 128) 512 conv2d_36[0][0]
__________________________________________________________________________________________________
mish_36 (Mish) (None, 52, 52, 128) 0 batch_normalization_36[0][0]
__________________________________________________________________________________________________
add_11 (Add) (None, 52, 52, 128) 0 add_10[0][0]
mish_36[0][0]
__________________________________________________________________________________________________
conv2d_37 (Conv2D) (None, 52, 52, 128) 16384 add_11[0][0]
__________________________________________________________________________________________________
conv2d_19 (Conv2D) (None, 52, 52, 128) 32768 mish_18[0][0]
__________________________________________________________________________________________________
batch_normalization_37 (BatchNo (None, 52, 52, 128) 512 conv2d_37[0][0]
__________________________________________________________________________________________________
batch_normalization_19 (BatchNo (None, 52, 52, 128) 512 conv2d_19[0][0]
__________________________________________________________________________________________________
mish_37 (Mish) (None, 52, 52, 128) 0 batch_normalization_37[0][0]
__________________________________________________________________________________________________
mish_19 (Mish) (None, 52, 52, 128) 0 batch_normalization_19[0][0]
__________________________________________________________________________________________________
concatenate_3 (Concatenate) (None, 52, 52, 256) 0 mish_37[0][0]
mish_19[0][0]
__________________________________________________________________________________________________
conv2d_38 (Conv2D) (None, 52, 52, 256) 65536 concatenate_3[0][0]
__________________________________________________________________________________________________
batch_normalization_38 (BatchNo (None, 52, 52, 256) 1024 conv2d_38[0][0]
__________________________________________________________________________________________________
mish_38 (Mish) (None, 52, 52, 256) 0 batch_normalization_38[0][0]
__________________________________________________________________________________________________
zero_padding2d_4 (ZeroPadding2D (None, 53, 53, 256) 0 mish_38[0][0]
__________________________________________________________________________________________________
conv2d_39 (Conv2D) (None, 26, 26, 512) 1179648 zero_padding2d_4[0][0]
__________________________________________________________________________________________________
batch_normalization_39 (BatchNo (None, 26, 26, 512) 2048 conv2d_39[0][0]
__________________________________________________________________________________________________
mish_39 (Mish) (None, 26, 26, 512) 0 batch_normalization_39[0][0]
__________________________________________________________________________________________________
conv2d_41 (Conv2D) (None, 26, 26, 256) 131072 mish_39[0][0]
__________________________________________________________________________________________________
batch_normalization_41 (BatchNo (None, 26, 26, 256) 1024 conv2d_41[0][0]
__________________________________________________________________________________________________
mish_41 (Mish) (None, 26, 26, 256) 0 batch_normalization_41[0][0]
__________________________________________________________________________________________________
conv2d_42 (Conv2D) (None, 26, 26, 256) 65536 mish_41[0][0]
__________________________________________________________________________________________________
batch_normalization_42 (BatchNo (None, 26, 26, 256) 1024 conv2d_42[0][0]
__________________________________________________________________________________________________
mish_42 (Mish) (None, 26, 26, 256) 0 batch_normalization_42[0][0]
__________________________________________________________________________________________________
conv2d_43 (Conv2D) (None, 26, 26, 256) 589824 mish_42[0][0]
__________________________________________________________________________________________________
batch_normalization_43 (BatchNo (None, 26, 26, 256) 1024 conv2d_43[0][0]
__________________________________________________________________________________________________
mish_43 (Mish) (None, 26, 26, 256) 0 batch_normalization_43[0][0]
__________________________________________________________________________________________________
add_12 (Add) (None, 26, 26, 256) 0 mish_41[0][0]
mish_43[0][0]
__________________________________________________________________________________________________
conv2d_44 (Conv2D) (None, 26, 26, 256) 65536 add_12[0][0]
__________________________________________________________________________________________________
batch_normalization_44 (BatchNo (None, 26, 26, 256) 1024 conv2d_44[0][0]
__________________________________________________________________________________________________
mish_44 (Mish) (None, 26, 26, 256) 0 batch_normalization_44[0][0]
__________________________________________________________________________________________________
conv2d_45 (Conv2D) (None, 26, 26, 256) 589824 mish_44[0][0]
__________________________________________________________________________________________________
batch_normalization_45 (BatchNo (None, 26, 26, 256) 1024 conv2d_45[0][0]
__________________________________________________________________________________________________
mish_45 (Mish) (None, 26, 26, 256) 0 batch_normalization_45[0][0]
__________________________________________________________________________________________________
add_13 (Add) (None, 26, 26, 256) 0 add_12[0][0]
mish_45[0][0]
__________________________________________________________________________________________________
conv2d_46 (Conv2D) (None, 26, 26, 256) 65536 add_13[0][0]
__________________________________________________________________________________________________
batch_normalization_46 (BatchNo (None, 26, 26, 256) 1024 conv2d_46[0][0]
__________________________________________________________________________________________________
mish_46 (Mish) (None, 26, 26, 256) 0 batch_normalization_46[0][0]
__________________________________________________________________________________________________
conv2d_47 (Conv2D) (None, 26, 26, 256) 589824 mish_46[0][0]
__________________________________________________________________________________________________
batch_normalization_47 (BatchNo (None, 26, 26, 256) 1024 conv2d_47[0][0]
__________________________________________________________________________________________________
mish_47 (Mish) (None, 26, 26, 256) 0 batch_normalization_47[0][0]
__________________________________________________________________________________________________
add_14 (Add) (None, 26, 26, 256) 0 add_13[0][0]
mish_47[0][0]
__________________________________________________________________________________________________
conv2d_48 (Conv2D) (None, 26, 26, 256) 65536 add_14[0][0]
__________________________________________________________________________________________________
batch_normalization_48 (BatchNo (None, 26, 26, 256) 1024 conv2d_48[0][0]
__________________________________________________________________________________________________
mish_48 (Mish) (None, 26, 26, 256) 0 batch_normalization_48[0][0]
__________________________________________________________________________________________________
conv2d_49 (Conv2D) (None, 26, 26, 256) 589824 mish_48[0][0]
__________________________________________________________________________________________________
batch_normalization_49 (BatchNo (None, 26, 26, 256) 1024 conv2d_49[0][0]
__________________________________________________________________________________________________
mish_49 (Mish) (None, 26, 26, 256) 0 batch_normalization_49[0][0]
__________________________________________________________________________________________________
add_15 (Add) (None, 26, 26, 256) 0 add_14[0][0]
mish_49[0][0]
__________________________________________________________________________________________________
conv2d_50 (Conv2D) (None, 26, 26, 256) 65536 add_15[0][0]
__________________________________________________________________________________________________
batch_normalization_50 (BatchNo (None, 26, 26, 256) 1024 conv2d_50[0][0]
__________________________________________________________________________________________________
mish_50 (Mish) (None, 26, 26, 256) 0 batch_normalization_50[0][0]
__________________________________________________________________________________________________
conv2d_51 (Conv2D) (None, 26, 26, 256) 589824 mish_50[0][0]
__________________________________________________________________________________________________
batch_normalization_51 (BatchNo (None, 26, 26, 256) 1024 conv2d_51[0][0]
__________________________________________________________________________________________________
mish_51 (Mish) (None, 26, 26, 256) 0 batch_normalization_51[0][0]
__________________________________________________________________________________________________
add_16 (Add) (None, 26, 26, 256) 0 add_15[0][0]
mish_51[0][0]
__________________________________________________________________________________________________
conv2d_52 (Conv2D) (None, 26, 26, 256) 65536 add_16[0][0]
__________________________________________________________________________________________________
batch_normalization_52 (BatchNo (None, 26, 26, 256) 1024 conv2d_52[0][0]
__________________________________________________________________________________________________
mish_52 (Mish) (None, 26, 26, 256) 0 batch_normalization_52[0][0]
__________________________________________________________________________________________________
conv2d_53 (Conv2D) (None, 26, 26, 256) 589824 mish_52[0][0]
__________________________________________________________________________________________________
batch_normalization_53 (BatchNo (None, 26, 26, 256) 1024 conv2d_53[0][0]
__________________________________________________________________________________________________
mish_53 (Mish) (None, 26, 26, 256) 0 batch_normalization_53[0][0]
__________________________________________________________________________________________________
add_17 (Add) (None, 26, 26, 256) 0 add_16[0][0]
mish_53[0][0]
__________________________________________________________________________________________________
conv2d_54 (Conv2D) (None, 26, 26, 256) 65536 add_17[0][0]
__________________________________________________________________________________________________
batch_normalization_54 (BatchNo (None, 26, 26, 256) 1024 conv2d_54[0][0]
__________________________________________________________________________________________________
mish_54 (Mish) (None, 26, 26, 256) 0 batch_normalization_54[0][0]
__________________________________________________________________________________________________
conv2d_55 (Conv2D) (None, 26, 26, 256) 589824 mish_54[0][0]
__________________________________________________________________________________________________
batch_normalization_55 (BatchNo (None, 26, 26, 256) 1024 conv2d_55[0][0]
__________________________________________________________________________________________________
mish_55 (Mish) (None, 26, 26, 256) 0 batch_normalization_55[0][0]
__________________________________________________________________________________________________
add_18 (Add) (None, 26, 26, 256) 0 add_17[0][0]
mish_55[0][0]
__________________________________________________________________________________________________
conv2d_56 (Conv2D) (None, 26, 26, 256) 65536 add_18[0][0]
__________________________________________________________________________________________________
batch_normalization_56 (BatchNo (None, 26, 26, 256) 1024 conv2d_56[0][0]
__________________________________________________________________________________________________
mish_56 (Mish) (None, 26, 26, 256) 0 batch_normalization_56[0][0]
__________________________________________________________________________________________________
conv2d_57 (Conv2D) (None, 26, 26, 256) 589824 mish_56[0][0]
__________________________________________________________________________________________________
batch_normalization_57 (BatchNo (None, 26, 26, 256) 1024 conv2d_57[0][0]
__________________________________________________________________________________________________
mish_57 (Mish) (None, 26, 26, 256) 0 batch_normalization_57[0][0]
__________________________________________________________________________________________________
add_19 (Add) (None, 26, 26, 256) 0 add_18[0][0]
mish_57[0][0]
__________________________________________________________________________________________________
conv2d_58 (Conv2D) (None, 26, 26, 256) 65536 add_19[0][0]
__________________________________________________________________________________________________
conv2d_40 (Conv2D) (None, 26, 26, 256) 131072 mish_39[0][0]
__________________________________________________________________________________________________
batch_normalization_58 (BatchNo (None, 26, 26, 256) 1024 conv2d_58[0][0]
__________________________________________________________________________________________________
batch_normalization_40 (BatchNo (None, 26, 26, 256) 1024 conv2d_40[0][0]
__________________________________________________________________________________________________
mish_58 (Mish) (None, 26, 26, 256) 0 batch_normalization_58[0][0]
__________________________________________________________________________________________________
mish_40 (Mish) (None, 26, 26, 256) 0 batch_normalization_40[0][0]
__________________________________________________________________________________________________
concatenate_4 (Concatenate) (None, 26, 26, 512) 0 mish_58[0][0]
mish_40[0][0]
__________________________________________________________________________________________________
conv2d_59 (Conv2D) (None, 26, 26, 512) 262144 concatenate_4[0][0]
__________________________________________________________________________________________________
batch_normalization_59 (BatchNo (None, 26, 26, 512) 2048 conv2d_59[0][0]
__________________________________________________________________________________________________
mish_59 (Mish) (None, 26, 26, 512) 0 batch_normalization_59[0][0]
__________________________________________________________________________________________________
zero_padding2d_5 (ZeroPadding2D (None, 27, 27, 512) 0 mish_59[0][0]
__________________________________________________________________________________________________
conv2d_60 (Conv2D) (None, 13, 13, 1024) 4718592 zero_padding2d_5[0][0]
__________________________________________________________________________________________________
batch_normalization_60 (BatchNo (None, 13, 13, 1024) 4096 conv2d_60[0][0]
__________________________________________________________________________________________________
mish_60 (Mish) (None, 13, 13, 1024) 0 batch_normalization_60[0][0]
__________________________________________________________________________________________________
conv2d_62 (Conv2D) (None, 13, 13, 512) 524288 mish_60[0][0]
__________________________________________________________________________________________________
batch_normalization_62 (BatchNo (None, 13, 13, 512) 2048 conv2d_62[0][0]
__________________________________________________________________________________________________
mish_62 (Mish) (None, 13, 13, 512) 0 batch_normalization_62[0][0]
__________________________________________________________________________________________________
conv2d_63 (Conv2D) (None, 13, 13, 512) 262144 mish_62[0][0]
__________________________________________________________________________________________________
batch_normalization_63 (BatchNo (None, 13, 13, 512) 2048 conv2d_63[0][0]
__________________________________________________________________________________________________
mish_63 (Mish) (None, 13, 13, 512) 0 batch_normalization_63[0][0]
__________________________________________________________________________________________________
conv2d_64 (Conv2D) (None, 13, 13, 512) 2359296 mish_63[0][0]
__________________________________________________________________________________________________
batch_normalization_64 (BatchNo (None, 13, 13, 512) 2048 conv2d_64[0][0]
__________________________________________________________________________________________________
mish_64 (Mish) (None, 13, 13, 512) 0 batch_normalization_64[0][0]
__________________________________________________________________________________________________
add_20 (Add) (None, 13, 13, 512) 0 mish_62[0][0]
mish_64[0][0]
__________________________________________________________________________________________________
conv2d_65 (Conv2D) (None, 13, 13, 512) 262144 add_20[0][0]
__________________________________________________________________________________________________
batch_normalization_65 (BatchNo (None, 13, 13, 512) 2048 conv2d_65[0][0]
__________________________________________________________________________________________________
mish_65 (Mish) (None, 13, 13, 512) 0 batch_normalization_65[0][0]
__________________________________________________________________________________________________
conv2d_66 (Conv2D) (None, 13, 13, 512) 2359296 mish_65[0][0]
__________________________________________________________________________________________________
batch_normalization_66 (BatchNo (None, 13, 13, 512) 2048 conv2d_66[0][0]
__________________________________________________________________________________________________
mish_66 (Mish) (None, 13, 13, 512) 0 batch_normalization_66[0][0]
__________________________________________________________________________________________________
add_21 (Add) (None, 13, 13, 512) 0 add_20[0][0]
mish_66[0][0]
__________________________________________________________________________________________________
conv2d_67 (Conv2D) (None, 13, 13, 512) 262144 add_21[0][0]
__________________________________________________________________________________________________
batch_normalization_67 (BatchNo (None, 13, 13, 512) 2048 conv2d_67[0][0]
__________________________________________________________________________________________________
mish_67 (Mish) (None, 13, 13, 512) 0 batch_normalization_67[0][0]
__________________________________________________________________________________________________
conv2d_68 (Conv2D) (None, 13, 13, 512) 2359296 mish_67[0][0]
__________________________________________________________________________________________________
batch_normalization_68 (BatchNo (None, 13, 13, 512) 2048 conv2d_68[0][0]
__________________________________________________________________________________________________
mish_68 (Mish) (None, 13, 13, 512) 0 batch_normalization_68[0][0]
__________________________________________________________________________________________________
add_22 (Add) (None, 13, 13, 512) 0 add_21[0][0]
mish_68[0][0]
__________________________________________________________________________________________________
conv2d_69 (Conv2D) (None, 13, 13, 512) 262144 add_22[0][0]
__________________________________________________________________________________________________
batch_normalization_69 (BatchNo (None, 13, 13, 512) 2048 conv2d_69[0][0]
__________________________________________________________________________________________________
mish_69 (Mish) (None, 13, 13, 512) 0 batch_normalization_69[0][0]
__________________________________________________________________________________________________
conv2d_70 (Conv2D) (None, 13, 13, 512) 2359296 mish_69[0][0]
__________________________________________________________________________________________________
batch_normalization_70 (BatchNo (None, 13, 13, 512) 2048 conv2d_70[0][0]
__________________________________________________________________________________________________
mish_70 (Mish) (None, 13, 13, 512) 0 batch_normalization_70[0][0]
__________________________________________________________________________________________________
add_23 (Add) (None, 13, 13, 512) 0 add_22[0][0]
mish_70[0][0]
__________________________________________________________________________________________________
conv2d_71 (Conv2D) (None, 13, 13, 512) 262144 add_23[0][0]
__________________________________________________________________________________________________
conv2d_61 (Conv2D) (None, 13, 13, 512) 524288 mish_60[0][0]
__________________________________________________________________________________________________
batch_normalization_71 (BatchNo (None, 13, 13, 512) 2048 conv2d_71[0][0]
__________________________________________________________________________________________________
batch_normalization_61 (BatchNo (None, 13, 13, 512) 2048 conv2d_61[0][0]
__________________________________________________________________________________________________
mish_71 (Mish) (None, 13, 13, 512) 0 batch_normalization_71[0][0]
__________________________________________________________________________________________________
mish_61 (Mish) (None, 13, 13, 512) 0 batch_normalization_61[0][0]
__________________________________________________________________________________________________
concatenate_5 (Concatenate) (None, 13, 13, 1024) 0 mish_71[0][0]
mish_61[0][0]
__________________________________________________________________________________________________
conv2d_72 (Conv2D) (None, 13, 13, 1024) 1048576 concatenate_5[0][0]
__________________________________________________________________________________________________
batch_normalization_72 (BatchNo (None, 13, 13, 1024) 4096 conv2d_72[0][0]
__________________________________________________________________________________________________
mish_72 (Mish) (None, 13, 13, 1024) 0 batch_normalization_72[0][0]
__________________________________________________________________________________________________
conv2d_73 (Conv2D) (None, 13, 13, 512) 524288 mish_72[0][0]
__________________________________________________________________________________________________
batch_normalization_73 (BatchNo (None, 13, 13, 512) 2048 conv2d_73[0][0]
__________________________________________________________________________________________________
leaky_re_lu_1 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_73[0][0]
__________________________________________________________________________________________________
conv2d_74 (Conv2D) (None, 13, 13, 1024) 4718592 leaky_re_lu_1[0][0]
__________________________________________________________________________________________________
batch_normalization_74 (BatchNo (None, 13, 13, 1024) 4096 conv2d_74[0][0]
__________________________________________________________________________________________________
leaky_re_lu_2 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_74[0][0]
__________________________________________________________________________________________________
conv2d_75 (Conv2D) (None, 13, 13, 512) 524288 leaky_re_lu_2[0][0]
__________________________________________________________________________________________________
batch_normalization_75 (BatchNo (None, 13, 13, 512) 2048 conv2d_75[0][0]
__________________________________________________________________________________________________
leaky_re_lu_3 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_75[0][0]
__________________________________________________________________________________________________
max_pooling2d_1 (MaxPooling2D) (None, 13, 13, 512) 0 leaky_re_lu_3[0][0]
__________________________________________________________________________________________________
max_pooling2d_2 (MaxPooling2D) (None, 13, 13, 512) 0 leaky_re_lu_3[0][0]
__________________________________________________________________________________________________
max_pooling2d_3 (MaxPooling2D) (None, 13, 13, 512) 0 leaky_re_lu_3[0][0]
__________________________________________________________________________________________________
concatenate_6 (Concatenate) (None, 13, 13, 2048) 0 max_pooling2d_1[0][0]
max_pooling2d_2[0][0]
max_pooling2d_3[0][0]
leaky_re_lu_3[0][0]
__________________________________________________________________________________________________
conv2d_76 (Conv2D) (None, 13, 13, 512) 1048576 concatenate_6[0][0]
__________________________________________________________________________________________________
batch_normalization_76 (BatchNo (None, 13, 13, 512) 2048 conv2d_76[0][0]
__________________________________________________________________________________________________
leaky_re_lu_4 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_76[0][0]
__________________________________________________________________________________________________
conv2d_77 (Conv2D) (None, 13, 13, 1024) 4718592 leaky_re_lu_4[0][0]
__________________________________________________________________________________________________
batch_normalization_77 (BatchNo (None, 13, 13, 1024) 4096 conv2d_77[0][0]
__________________________________________________________________________________________________
leaky_re_lu_5 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_77[0][0]
__________________________________________________________________________________________________
conv2d_78 (Conv2D) (None, 13, 13, 512) 524288 leaky_re_lu_5[0][0]
__________________________________________________________________________________________________
batch_normalization_78 (BatchNo (None, 13, 13, 512) 2048 conv2d_78[0][0]
__________________________________________________________________________________________________
leaky_re_lu_6 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_78[0][0]
__________________________________________________________________________________________________
conv2d_79 (Conv2D) (None, 13, 13, 256) 131072 leaky_re_lu_6[0][0]
__________________________________________________________________________________________________
conv2d_80 (Conv2D) (None, 26, 26, 256) 131072 mish_59[0][0]
__________________________________________________________________________________________________
batch_normalization_79 (BatchNo (None, 13, 13, 256) 1024 conv2d_79[0][0]
__________________________________________________________________________________________________
batch_normalization_80 (BatchNo (None, 26, 26, 256) 1024 conv2d_80[0][0]
__________________________________________________________________________________________________
leaky_re_lu_7 (LeakyReLU) (None, 13, 13, 256) 0 batch_normalization_79[0][0]
__________________________________________________________________________________________________
leaky_re_lu_8 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_80[0][0]
__________________________________________________________________________________________________
up_sampling2d_1 (UpSampling2D) (None, 26, 26, 256) 0 leaky_re_lu_7[0][0]
__________________________________________________________________________________________________
concatenate_7 (Concatenate) (None, 26, 26, 512) 0 leaky_re_lu_8[0][0]
up_sampling2d_1[0][0]
__________________________________________________________________________________________________
conv2d_81 (Conv2D) (None, 26, 26, 256) 131072 concatenate_7[0][0]
__________________________________________________________________________________________________
batch_normalization_81 (BatchNo (None, 26, 26, 256) 1024 conv2d_81[0][0]
__________________________________________________________________________________________________
leaky_re_lu_9 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_81[0][0]
__________________________________________________________________________________________________
conv2d_82 (Conv2D) (None, 26, 26, 512) 1179648 leaky_re_lu_9[0][0]
__________________________________________________________________________________________________
batch_normalization_82 (BatchNo (None, 26, 26, 512) 2048 conv2d_82[0][0]
__________________________________________________________________________________________________
leaky_re_lu_10 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_82[0][0]
__________________________________________________________________________________________________
conv2d_83 (Conv2D) (None, 26, 26, 256) 131072 leaky_re_lu_10[0][0]
__________________________________________________________________________________________________
batch_normalization_83 (BatchNo (None, 26, 26, 256) 1024 conv2d_83[0][0]
__________________________________________________________________________________________________
leaky_re_lu_11 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_83[0][0]
__________________________________________________________________________________________________
conv2d_84 (Conv2D) (None, 26, 26, 512) 1179648 leaky_re_lu_11[0][0]
__________________________________________________________________________________________________
batch_normalization_84 (BatchNo (None, 26, 26, 512) 2048 conv2d_84[0][0]
__________________________________________________________________________________________________
leaky_re_lu_12 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_84[0][0]
__________________________________________________________________________________________________
conv2d_85 (Conv2D) (None, 26, 26, 256) 131072 leaky_re_lu_12[0][0]
__________________________________________________________________________________________________
batch_normalization_85 (BatchNo (None, 26, 26, 256) 1024 conv2d_85[0][0]
__________________________________________________________________________________________________
leaky_re_lu_13 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_85[0][0]
__________________________________________________________________________________________________
conv2d_86 (Conv2D) (None, 26, 26, 128) 32768 leaky_re_lu_13[0][0]
__________________________________________________________________________________________________
conv2d_87 (Conv2D) (None, 52, 52, 128) 32768 mish_38[0][0]
__________________________________________________________________________________________________
batch_normalization_86 (BatchNo (None, 26, 26, 128) 512 conv2d_86[0][0]
__________________________________________________________________________________________________
batch_normalization_87 (BatchNo (None, 52, 52, 128) 512 conv2d_87[0][0]
__________________________________________________________________________________________________
leaky_re_lu_14 (LeakyReLU) (None, 26, 26, 128) 0 batch_normalization_86[0][0]
__________________________________________________________________________________________________
leaky_re_lu_15 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_87[0][0]
__________________________________________________________________________________________________
up_sampling2d_2 (UpSampling2D) (None, 52, 52, 128) 0 leaky_re_lu_14[0][0]
__________________________________________________________________________________________________
concatenate_8 (Concatenate) (None, 52, 52, 256) 0 leaky_re_lu_15[0][0]
up_sampling2d_2[0][0]
__________________________________________________________________________________________________
conv2d_88 (Conv2D) (None, 52, 52, 128) 32768 concatenate_8[0][0]
__________________________________________________________________________________________________
batch_normalization_88 (BatchNo (None, 52, 52, 128) 512 conv2d_88[0][0]
__________________________________________________________________________________________________
leaky_re_lu_16 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_88[0][0]
__________________________________________________________________________________________________
conv2d_89 (Conv2D) (None, 52, 52, 256) 294912 leaky_re_lu_16[0][0]
__________________________________________________________________________________________________
batch_normalization_89 (BatchNo (None, 52, 52, 256) 1024 conv2d_89[0][0]
__________________________________________________________________________________________________
leaky_re_lu_17 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_89[0][0]
__________________________________________________________________________________________________
conv2d_90 (Conv2D) (None, 52, 52, 128) 32768 leaky_re_lu_17[0][0]
__________________________________________________________________________________________________
batch_normalization_90 (BatchNo (None, 52, 52, 128) 512 conv2d_90[0][0]
__________________________________________________________________________________________________
leaky_re_lu_18 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_90[0][0]
__________________________________________________________________________________________________
conv2d_91 (Conv2D) (None, 52, 52, 256) 294912 leaky_re_lu_18[0][0]
__________________________________________________________________________________________________
batch_normalization_91 (BatchNo (None, 52, 52, 256) 1024 conv2d_91[0][0]
__________________________________________________________________________________________________
leaky_re_lu_19 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_91[0][0]
__________________________________________________________________________________________________
conv2d_92 (Conv2D) (None, 52, 52, 128) 32768 leaky_re_lu_19[0][0]
__________________________________________________________________________________________________
batch_normalization_92 (BatchNo (None, 52, 52, 128) 512 conv2d_92[0][0]
__________________________________________________________________________________________________
leaky_re_lu_20 (LeakyReLU) (None, 52, 52, 128) 0 batch_normalization_92[0][0]
__________________________________________________________________________________________________
zero_padding2d_6 (ZeroPadding2D (None, 53, 53, 128) 0 leaky_re_lu_20[0][0]
__________________________________________________________________________________________________
conv2d_95 (Conv2D) (None, 26, 26, 256) 294912 zero_padding2d_6[0][0]
__________________________________________________________________________________________________
batch_normalization_94 (BatchNo (None, 26, 26, 256) 1024 conv2d_95[0][0]
__________________________________________________________________________________________________
leaky_re_lu_22 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_94[0][0]
__________________________________________________________________________________________________
concatenate_9 (Concatenate) (None, 26, 26, 512) 0 leaky_re_lu_22[0][0]
leaky_re_lu_13[0][0]
__________________________________________________________________________________________________
conv2d_96 (Conv2D) (None, 26, 26, 256) 131072 concatenate_9[0][0]
__________________________________________________________________________________________________
batch_normalization_95 (BatchNo (None, 26, 26, 256) 1024 conv2d_96[0][0]
__________________________________________________________________________________________________
leaky_re_lu_23 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_95[0][0]
__________________________________________________________________________________________________
conv2d_97 (Conv2D) (None, 26, 26, 512) 1179648 leaky_re_lu_23[0][0]
__________________________________________________________________________________________________
batch_normalization_96 (BatchNo (None, 26, 26, 512) 2048 conv2d_97[0][0]
__________________________________________________________________________________________________
leaky_re_lu_24 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_96[0][0]
__________________________________________________________________________________________________
conv2d_98 (Conv2D) (None, 26, 26, 256) 131072 leaky_re_lu_24[0][0]
__________________________________________________________________________________________________
batch_normalization_97 (BatchNo (None, 26, 26, 256) 1024 conv2d_98[0][0]
__________________________________________________________________________________________________
leaky_re_lu_25 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_97[0][0]
__________________________________________________________________________________________________
conv2d_99 (Conv2D) (None, 26, 26, 512) 1179648 leaky_re_lu_25[0][0]
__________________________________________________________________________________________________
batch_normalization_98 (BatchNo (None, 26, 26, 512) 2048 conv2d_99[0][0]
__________________________________________________________________________________________________
leaky_re_lu_26 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_98[0][0]
__________________________________________________________________________________________________
conv2d_100 (Conv2D) (None, 26, 26, 256) 131072 leaky_re_lu_26[0][0]
__________________________________________________________________________________________________
batch_normalization_99 (BatchNo (None, 26, 26, 256) 1024 conv2d_100[0][0]
__________________________________________________________________________________________________
leaky_re_lu_27 (LeakyReLU) (None, 26, 26, 256) 0 batch_normalization_99[0][0]
__________________________________________________________________________________________________
zero_padding2d_7 (ZeroPadding2D (None, 27, 27, 256) 0 leaky_re_lu_27[0][0]
__________________________________________________________________________________________________
conv2d_103 (Conv2D) (None, 13, 13, 512) 1179648 zero_padding2d_7[0][0]
__________________________________________________________________________________________________
batch_normalization_101 (BatchN (None, 13, 13, 512) 2048 conv2d_103[0][0]
__________________________________________________________________________________________________
leaky_re_lu_29 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_101[0][0]
__________________________________________________________________________________________________
concatenate_10 (Concatenate) (None, 13, 13, 1024) 0 leaky_re_lu_29[0][0]
leaky_re_lu_6[0][0]
__________________________________________________________________________________________________
conv2d_104 (Conv2D) (None, 13, 13, 512) 524288 concatenate_10[0][0]
__________________________________________________________________________________________________
batch_normalization_102 (BatchN (None, 13, 13, 512) 2048 conv2d_104[0][0]
__________________________________________________________________________________________________
leaky_re_lu_30 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_102[0][0]
__________________________________________________________________________________________________
conv2d_105 (Conv2D) (None, 13, 13, 1024) 4718592 leaky_re_lu_30[0][0]
__________________________________________________________________________________________________
batch_normalization_103 (BatchN (None, 13, 13, 1024) 4096 conv2d_105[0][0]
__________________________________________________________________________________________________
leaky_re_lu_31 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_103[0][0]
__________________________________________________________________________________________________
conv2d_106 (Conv2D) (None, 13, 13, 512) 524288 leaky_re_lu_31[0][0]
__________________________________________________________________________________________________
batch_normalization_104 (BatchN (None, 13, 13, 512) 2048 conv2d_106[0][0]
__________________________________________________________________________________________________
leaky_re_lu_32 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_104[0][0]
__________________________________________________________________________________________________
conv2d_107 (Conv2D) (None, 13, 13, 1024) 4718592 leaky_re_lu_32[0][0]
__________________________________________________________________________________________________
batch_normalization_105 (BatchN (None, 13, 13, 1024) 4096 conv2d_107[0][0]
__________________________________________________________________________________________________
leaky_re_lu_33 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_105[0][0]
__________________________________________________________________________________________________
conv2d_108 (Conv2D) (None, 13, 13, 512) 524288 leaky_re_lu_33[0][0]
__________________________________________________________________________________________________
batch_normalization_106 (BatchN (None, 13, 13, 512) 2048 conv2d_108[0][0]
__________________________________________________________________________________________________
leaky_re_lu_34 (LeakyReLU) (None, 13, 13, 512) 0 batch_normalization_106[0][0]
__________________________________________________________________________________________________
conv2d_109 (Conv2D) (None, 13, 13, 1024) 4718592 leaky_re_lu_34[0][0]
__________________________________________________________________________________________________
conv2d_101 (Conv2D) (None, 26, 26, 512) 1179648 leaky_re_lu_27[0][0]
__________________________________________________________________________________________________
conv2d_93 (Conv2D) (None, 52, 52, 256) 294912 leaky_re_lu_20[0][0]
__________________________________________________________________________________________________
batch_normalization_107 (BatchN (None, 13, 13, 1024) 4096 conv2d_109[0][0]
__________________________________________________________________________________________________
batch_normalization_100 (BatchN (None, 26, 26, 512) 2048 conv2d_101[0][0]
__________________________________________________________________________________________________
batch_normalization_93 (BatchNo (None, 52, 52, 256) 1024 conv2d_93[0][0]
__________________________________________________________________________________________________
leaky_re_lu_35 (LeakyReLU) (None, 13, 13, 1024) 0 batch_normalization_107[0][0]
__________________________________________________________________________________________________
leaky_re_lu_28 (LeakyReLU) (None, 26, 26, 512) 0 batch_normalization_100[0][0]
__________________________________________________________________________________________________
leaky_re_lu_21 (LeakyReLU) (None, 52, 52, 256) 0 batch_normalization_93[0][0]
__________________________________________________________________________________________________
conv2d_110 (Conv2D) (None, 13, 13, 255) 261375 leaky_re_lu_35[0][0]
__________________________________________________________________________________________________
conv2d_102 (Conv2D) (None, 26, 26, 255) 130815 leaky_re_lu_28[0][0]
__________________________________________________________________________________________________
conv2d_94 (Conv2D) (None, 52, 52, 255) 65535 leaky_re_lu_21[0][0]
==================================================================================================
Total params: 64,429,405
Trainable params: 64,363,101
Non-trainable params: 66,304
三、网络的评价指标
3.1 网络的性能指标
3.1.1FPS
3.1.2 AP
3.1.3MAP
3.1.4 IOU
Intersection over union IOU 交并比计算的是算法“预测区域”和目标真实区域(Ground Truth 真实框)的交集面积和并集区域面积的比值。交并比代表了目标检测中对于目标空间特征预测的准确程度,好的预测结果拥有较高的交并比。
RANK
3.1.5 准确率(precision)
3.1.6 查全率(Recall)
开启马赛克,数据增强后,loss 和val_loss都降低了,但是检测是框会跳动,网络并不稳定
将结果按照numpy拼接到一起,发现bbox是按照score的得分来排序的。