
Yuchen Li
Phone: +61 423101178
less
Related Authors
Ashish Gupta
Case Western Reserve University
Steven Pinker
Harvard University
James Elkins
School of the Art Institute of Chicago
Simon O'Sullivan
Goldsmiths, University of London
David Seamon
Kansas State University
Armando Marques-Guedes
UNL - New University of Lisbon
Samit Chakrabarty
University of Leeds
Fabio Cuzzolin
Oxford Brookes University
Lev Manovich
Graduate Center of the City University of New York
Katalin Feher
University of Public Service
Uploads
Papers by Yuchen Li
The OCR engines presented in the study include Tesseract OCR, Keras OCR, Paddle OCR, and Microsoft Azure Computer Vision. An innovative design and implementation of an integrated OCR Scoring System which, intricately assesses each engine’s output, integrating confidence scores and Character Error Rates (CER) to determine the most accurate text extraction, is developed and evaluated.
The selection of evaluation metrics—accuracy, precision, recall, F1 score, Character Error Rate (CER), Word Error Rate (WER), and running time per image (RT)—was strategic to comprehensively dis- tinguish the performance of individual OCR engines. Precision and recall offer insights into an OCR engine’s inclusivity versus its fo- cus on detail, which bears significance in handling complex data sets. Running time was tracked to evaluate processing speed, providing a pragmatic lens for understanding the trade-offs between speed and accuracy in real-world applications. demonstrate that while individ- ual OCR engines have their strengths, the integrated
Our findings reveal that the Integrated Method, which amalgamates the strengths of individual OCR engines, consistently outperforms its constituent systems across all tested conditions. It demonstrates remarkable adaptability and resilience, achieving superior accuracy, precision, and recall rates, particularly in the face of the image qual- ity fluctuations inherent in real-world tasks. Azure OCR emerged as a significant contributor to this integrated success, showing exceptional processing speed and precision, with notable proficiency in decipher- ing complex handwriting—a capability that suggests a high degree of immunity to image quality variations.
The OCR engines presented in the study include Tesseract OCR, Keras OCR, Paddle OCR, and Microsoft Azure Computer Vision. An innovative design and implementation of an integrated OCR Scoring System which, intricately assesses each engine’s output, integrating confidence scores and Character Error Rates (CER) to determine the most accurate text extraction, is developed and evaluated.
The selection of evaluation metrics—accuracy, precision, recall, F1 score, Character Error Rate (CER), Word Error Rate (WER), and running time per image (RT)—was strategic to comprehensively dis- tinguish the performance of individual OCR engines. Precision and recall offer insights into an OCR engine’s inclusivity versus its fo- cus on detail, which bears significance in handling complex data sets. Running time was tracked to evaluate processing speed, providing a pragmatic lens for understanding the trade-offs between speed and accuracy in real-world applications. demonstrate that while individ- ual OCR engines have their strengths, the integrated
Our findings reveal that the Integrated Method, which amalgamates the strengths of individual OCR engines, consistently outperforms its constituent systems across all tested conditions. It demonstrates remarkable adaptability and resilience, achieving superior accuracy, precision, and recall rates, particularly in the face of the image qual- ity fluctuations inherent in real-world tasks. Azure OCR emerged as a significant contributor to this integrated success, showing exceptional processing speed and precision, with notable proficiency in decipher- ing complex handwriting—a capability that suggests a high degree of immunity to image quality variations.