W. Kazmi, I. Nabney, G. Vogiatzis, P. Rose, A. Codd, An efficient industrial system for vehicle tyre (Tire) detection and text recognition using deep learning. IEEE Trans. Intell. Transp. Syst. 22(2), 1264–1275 (2021). https://doi.org/10.1109/TITS.2020.2967316
Article
Google Scholar
Y. Liu, L. Jin, C. Fang, Arbitrarily shaped scene text detection with a mask tightness text detector. IEEE Trans. Image Process. 29, 2918–2930 (2020). https://doi.org/10.1109/TIP.2019.2954218
Article
Google Scholar
P. Cheng, Y. Cai, W. Wang, “A direct regression scene text detector with position-sensitive segmentation. IEEE Trans. Circuits Syst. Video Technol., 30(11): 4171–4181 (2020). https://doi.org/10.1109/TCSVT.2019.2947475
P. N. C. a. w. P. Shivakumara, R. Raghavendra, S. Nag, U. Pal, T. Lu, D. Lopresti, "An episodic learning network for text detection on human bodies in sports images," In IEEE Transactions on Circuits and Systems for Video Technology, 1–1 (2021). https://doi.org/10.1109/TCSVT.2021.3092713
S. Ren, K. He, R. Girshick, J. Sun, Faster RCNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell 39(6), 1137–1149 (2017). https://doi.org/10.1109/tpami.2016.2577031
Article
Google Scholar
W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, A.C. Berg, “SSD: single shot multibox detector,” In European Conference on Computer Vision (ECCV), 21–37 (2016). https://doi.org/10.1007/978-3-319-46448-0_2
J. Redmon, S. Divvala, R. Girshick, A. Farhadi, “You only look once: unified, real-time object detection,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 779–788 (2016). https://doi.org/10.1109/CVPR.2016.91
L. Huang, Y. Yang, Y. Deng, Y. Yu, “DenseBox: unifying landmark localization with end to end object detection,” arXiv preprint arXiv:1509.04874(2015)
M. Liao, B. Shi, X. Bai, X. Wang, and W. Liu, “TextBoxes: A Fast Text Detector with a Single Deep Neural Network,” In The National Conference on Artificial Intelligence (AAAI), 4161–4167 (2017).
M. Liao, B. Shi, X. Bai, “TextBoxes++: a single-shot oriented scene text detector,” IEEE Trans. Image Process., 3676–3690 (2018). http://dx.doi.org/https://doi.org/10.1109/TIP.2018.2825107.
X. Zhou, C. Yao, H. Wen, Y. Wang, S. Zhou, W. He, J. Liang, “EAST: an efficient and accurate scene text detector,” In the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2642–2651 (2017). https://doi.org/10.1109/CVPR.2017.283
J. Ma, W. Shao, H. Ye, L. Wang, H. Wang, Y. Zheng, X. Xue, Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20(11), 3111–3122 (2018). https://doi.org/10.1109/TMM.2018.2818020
Article
Google Scholar
L. Tychsen-Smith, L. Petersson, “DeNet: scalable realtime object detection with directed sparse sampling,” In IEEE International Conference on Computer Vision (ICCV), 428–436 (2017)
L. Pengyuan et al., “Multi-oriented scene text detection via corner localization and region segmentation,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 7553–7563 (2018). https://doi.org/10.1109/CVPR.2018.00788
X. Wang, K. Chen, Z. Huang, C. Yao, W. Liu, “Point linking network for object detection,” arXiv preprint arXiv: 1706.03646 (2017)
D. Deng, H. Liu, X. Li, D. Cai, “PixelLink: detecting scene text via instance segmentation,” In The National Conference on Artificial Intelligence (AAAI), 6773–6780 (2018)
Z. Zhang, C. Zhang, W. Shen, C. Yao, W. Liu, X. Bai, “Multi-oriented text detection with fully convolutional networks,” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4159–4167 (2016). https://doi.org/10.1109/CVPR.2016.451
X. Enze et al., “Scene text detection with supervised pyramid context network,” In The National Conference on Artificial Intelligence (AAAI), 9038–9045 (2019)
L. Shangbang et al., “TextSnake: a flexible representation for detecting text of arbitrary shapes,” In European Conference on Computer Vision (ECCV), 20–36 (2018). https://doi.org/10.1007/978-3-030-01216-8_2
W. Wenhai et al., “Shape robust text detection with progressive scale expansion network,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 9336–9345 (2019). https://doi.org/10.1109/CVPR.2019.00956
H. Qibin et al., “Strip pooling: rethinking spatial pooling for scene parsing,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4003–4012 (2020). https://doi.org/10.1109/CVPR42600.2020.00406
S. Xie, R. Girshick, P. Dollár, Z. Tu, K. He, “Aggregated residual transformations for deep neural networks,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5987–5995 (2017). https://doi.org/10.1109/CVPR.2017.634
S. Gao, M. Cheng, K. Zhao et al., Res2Net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652–662 (2021). https://doi.org/10.1109/TPAMI.2019.2938758
Article
Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
C. Szegedy, W. Liu, Y. Jia et al., “going deeper with convolutions,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1–9 (2015). https://doi.org/10.1109/CVPR.2015.7298594
T. Zhi, W. Huang, H. Tong, et al., “Detecting text in natural image with connectionist text proposal network,” In European Conference on Computer Vision (ECCV), 56–72 (2016). https://doi.org/10.1007/978-3-319-46484-8_4
B. Shi, X. Bai, S. Belongie, “Detecting oriented text in natural images by linking segments,” In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3482–3490 (2017). https://doi.org/10.1109/CVPR.2017.371
H. Hu, C. Zhang, Y. Luo, Y. Wang, J. Han, E. Ding, “WordSup: exploiting word annotations for character based text detection,” In IEEE International Conference on Computer Vision (ICCV), 4950–4959 (2017). https://doi.org/10.1109/ICCV.2017.529
H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, “Pyramid scene parsing network,” In CVPR, 6230–6239 (2017)
J. He, Z. Deng, L. Zhou, Y. Wang, Y. Qiao, “Adaptive pyramid context network for semantic segmentation,” In CVPR, 7519–7528 (2019)
Y. Liu, J. Yan, Y. Xiang, “Research on license plate recognition algorithm based on ABCNet,” In IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE), 465–469 (2020). https://doi.org/10.1109/ICISCAE51034.2020.9236855
M. Liao, P. Lyu, M. He, C. Yao, W. Wu, X. Bai, Mask TextSpotter: an end-to-end trainable neural network for spotting text with arbitrary shapes. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 532–548 (2021). https://doi.org/10.1109/TPAMI.2019.2937086
Article
Google Scholar
W. Feng, W. He, F. Yin, X. Zhang, C. Liu, “TextDragon: an end-to-end framework for arbitrary shaped text spotting,” In IEEE International Conference on Computer Vision (ICCV), 9075–9084 (2019), https://doi.org/10.1109/ICCV.2019.00917