|国家预印本平台
首页|PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

来源:Arxiv_logoArxiv
英文摘要

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR system, i.e. PP-OCRv2. We introduce bag of tricks to train a better text detector and a better text recognizer, which include Collaborative Mutual Learning (CML), CopyPaste, Lightweight CPUNetwork (LCNet), Unified-Deep Mutual Learning (U-DML) and Enhanced CTCLoss. Experiments on real data show that the precision of PP-OCRv2 is 7% higher than PP-OCR under the same inference cost. It is also comparable to the server models of the PP-OCR which uses ResNet series as backbones. All of the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle.

Yehua Yang、Qiwen Liu、Cheng Cui、Jun Zhou、Dianhai Yu、Bin Lu、Yuning Du、Ruoyu Guo、Xiaoguang Hu、Chenxia Li、Weiwei Liu、Yanjun Ma

计算技术、计算机技术

Yehua Yang,Qiwen Liu,Cheng Cui,Jun Zhou,Dianhai Yu,Bin Lu,Yuning Du,Ruoyu Guo,Xiaoguang Hu,Chenxia Li,Weiwei Liu,Yanjun Ma.PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System[EB/OL].(2021-09-07)[2025-08-16].https://arxiv.org/abs/2109.03144.点此复制

评论