CVPR 2020 论文大盘点-文本图像篇

时间 2020-12-27

本文盘点CVPR 2020 所有文本图像（text）相关论文，主要分为手写文本和场景文本两大方向，总计16篇，对文献进行了细致的分类，大部分论文是围绕识别问题的研究。

方向包括：

1）场景文本检测（Scene Text Detection），从街景等场景文本中检测文本的位置，2 篇文献均为不规则任意形状文本的检测；

2）场景文本识别（Scene Text Recognition），对场景文本检测得到的结果进行识别，共 4 篇文章；

3）手写文本识别（Handwritten Text Recognition），2 篇文章；

4）场景文本端到端识别（Scene Text Spotting），1 篇文章，即华南理工大学和阿德莱德大学学者提出的实时 ABCNet 算法，很吸引人，已经开源；

5）手写文本生成（Handwritten Text Generation），为了增加手写文本的训练样本（感觉也可以用来“写作业”），1 篇文章；

6）场景文本合成（Scene Text Synthesis），为了增加场景文本的训练样本，1 篇文章，出自旷视科技，UnrealText用渲染引擎生成逼真场景文本；

7）文本图像的数据增广，用于手写和场景文本识别算法的训练，1 篇文章；

8）场景文本编辑（Scene Text Editor），对场景文本图像中的文字进行替换；

9）碎纸文档重建，用于刑侦领域的文档被破坏成碎片后的重建，1篇；

10）文本风格迁移，1篇；

11）场景文本识别的对抗攻击研究，1篇；

12）笔迹鉴定，1篇。

值得一提的，16篇文章中10篇已经开源或者准备开源，感谢这些开发者～

已经开源或者即将开源的论文，把代码地址也附上了。

大家可以在：

http://openaccess.thecvf.com/CVPR2020.py

按照题目下载这些论文。

如果想要下载所有CVPR 2020论文，请点击这里：

CVPR 2020 论文全面开放下载，含主会和workshop

场景文本检测

深度关系推理图网络用于任意形状文本检测

[1].Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection

作者 | Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, Xu-Cheng Yin

单位 | 北京科技大学；中国科学技术大学人工智能联合实验室；腾讯科技（深圳）

代码 | https://github.com/GXYM/DRRG

备注 | CVPR 2020 Oral

解读 | https://blog.csdn.net/SpicyCoder/article/details/105072570

[2].ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection

作者 | Yuxin Wang, Hongtao Xie, Zheng-Jun Zha, Mengting Xing, Zilong Fu, Yongdong Zhang

单位 | 中国科学技术大学

代码 | https://github.com/wangyuxin87/ContourNet

解读 | https://zhuanlan.zhihu.com/p/135399747

场景文本识别

论场景文本识别中的词汇依赖性

[3].On Vocabulary Reliance in Scene Text Recognition

作者 | Zhaoyi Wan, Jielei Zhang, Liang Zhang, Jiebo Luo, Cong Yao

单位 | 旷视；中国矿业大学；罗切斯特大学

[4].SCATTER: Selective Context Attentional Scene Text Recognizer

作者 | Ron Litman, Oron Anschel, Shahar Tsiper, Roee Litman, Shai Mazor, R. Manmatha

单位 | Amazon Web Services

语义推理网络，用于场景文本的精确识别

[5].Towards Accurate Scene Text Recognition With Semantic Reasoning Networks

作者 | Deli Yu, Xuan Li, Chengquan Zhang, Tao Liu, Junyu Han, Jingtuo Liu, Errui Ding

单位 | 国科大；百度；中科院

代码 | https://github.com/chenjun2hao/SRN.pytorch

语义增强的编解码框架，用于识别低质量图像（模糊、光照不均、字符不完整等）场景文本

[6].SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

作者 | Zhi Qiao, Yu Zhou, Dongbao Yang, Yucan Zhou, Weiping Wang

单位 | 中科院；国科大

代码 | https://github.com/Pay20Y/SEED（即将）

手写文本识别

[7].OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

作者 | Mohamed Yousef, Tom E. Bishop

单位 | Intuition Machines, Inc

代码 | https://github.com/IntuitionMachines/OrigamiNet

Scene Text Spotting

实时端到端场景文本识别

[8].ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network

作者 | Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang

单位 | 华南理工大学；阿德莱德大学；

代码 | https://github.com/Yuliang-Liu/bezier_curve_text_spotting

备注 | CVPR 2020 Oral

解读 | https://zhuanlan.zhihu.com/p/146276834

手写文本生成

半监督变长手写文本生成，增加文本数据集，提高识别算法精度

[9].ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation

作者 | Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen, Shai Mazor, Roee Litman

单位 | 以色列国，Amazon Rekognition；康奈尔大学

代码 | https://github.com/amzn/convolutional-handwriting-gan

场景文本合成

使用渲染引擎合成场景文本，增加训练样本，提升识别算法精度

[10].UnrealText: Synthesizing Realistic Scene Text Images From the Unreal

作者 | WorldShangbang Long, Cong Yao

单位 | 卡内基梅隆大学；旷视

代码 | https://jyouhou.github.io/UnrealText/

解读 | https://zhuanlan.zhihu.com/p/137406773

数据增广+文本识别

图像增广用于手写与场景文本识别

[11].Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

作者 | Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

单位 | 华南理工大学；阿里

代码 | https://github.com/Canjie-Luo/Text-Image-Augmentation

场景文本编辑

[12].STEFANN: Scene Text Editor Using Font Adaptive Neural Network

作者 | Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal

单位 | 印度统计研究所；印度理工学院

代码 | https://github.com/prasunroy/stefann

网站 | https://prasunroy.github.io/stefann/

碎纸文档重建

破碎纸片重建文档，用于法医等刑侦调查

[13].Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised Deep Asymmetric Metric Learning

作者 | Thiago M. Paixao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

单位 | IFES，Brazil；UFES，Brazil；ETS，Canada

文本风格迁移

[14].SwapText: Image Based Texts Transfer in Scenes

作者 | Qiangpeng Yang, Jun Huang, Wei Lin

单位 | 阿里

场景文本识别+对抗攻击

[15].What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images

作者 | Xing Xu, Jiefu Chen, Jinhui Xiao, Lianli Gao, Fumin Shen, Heng Tao Shen

单位 | 电子科技大学

笔迹鉴定

[16].Sequential Motif Profiles and Topological Plots for Offline Signature Verification

作者 | Elias N. Zois, Evangelos Zervas, Dimitrios Tsourounis, George Economou

单位 | University of West Attica ；派图拉斯大学

往期"精彩阅读"

END

备注：ocr

OCR交流群

文本检测、识别、编辑等更多最新技术，若已为CV君其他账号好友请直接私信。

我爱计算机视觉

微信号:aicvml

QQ群:805388940

微博知乎:@我爱计算机视觉

投稿:[email protected]

网站:www.52cv.net

在看，让更多人看到