【个人开源】论文复现SRN：Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

原创

怡宝2号 2021-09-06 17:33:14 博主文章分类：ocr ©著作权

文章标签 ocr cvpr2020 2d python lua 文章分类 代码人生

©著作权归作者所有：来自51CTO博客作者怡宝2号的原创作品，请联系作者获取转载授权，否则将追究法律责任

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Unofficial PyTorch implementation of the paper, which integrates not only global semantic reasoning module but also parallel visual attention module and visual-semantic fusion decoder.the semanti reasoning network(SRN) can be trained end-to-end.

At present, the accuracy of the paper cannot be achieved. And i borrowed code from deep-text-recognition-benchmark

model
【个人开源】论文复现SRN：Towards Accurate Scene Text Recognition with Semantic Reasoning Networks_lua

result

IIIT5k_3000	SVT	IC03_860	IC03_867	IC13_857	IC13_1015	IC15_1811	IC15_2077	SVTP	CUTE80
84.600	83.617	92.907	92.849	90.315	88.177	71.010	68.064	71.008	68.641

total_accuracy: 80.597

Feature

predict the character at once time
DistributedDataParallel training

Requirements

Pytorch >= 1.1.0

Test

download the evaluation data from deep-text-recognition-benchmark
download the pretrained model from Baidu, Password: d2qn
test on the evaluation data

python test.py --eval_data path-to-data --saved_model path-to-model

Train

download the training data from deep-text-recognition-benchmark
training from scratch

python train.py --train_data path-to-train-data --valid-data path-to-valid-data

Reference

difference with the origin paper

use resnet for 1D feature not resnetFpn 2D feature
use add not gated unit for visual-semanti fusion decoder

other

It is difficult to achieve the accuracy of the paper, hope more people to try and share

上一篇：【CNN】——矩阵乘法优化

下一篇：【Cmake】——常用的cmake变量

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯