دورية أكاديمية

YOLOT: Multi-scale and diverse tire sidewall text region detection based on You-Only-Look-Once(YOLOv5)

التفاصيل البيبلوغرافية
العنوان: YOLOT: Multi-scale and diverse tire sidewall text region detection based on You-Only-Look-Once(YOLOv5)
المؤلفون: Dehua Liu, Yongqin Tian, Yibo Xu, Wenyi Zhao, Xipeng Pan, Xu Ji, Mu Yang, Huihua Yang
المصدر: Cognitive Robotics, Vol 4, Iss , Pp 74-87 (2024)
بيانات النشر: KeAi Communications Co. Ltd., 2024.
سنة النشر: 2024
المجموعة: LCC:Electronic computers. Computer science
مصطلحات موضوعية: Tire sidewall, Multi-scale, Diverse characters, YOLO, Character detection, Electronic computers. Computer science, QA75.5-76.95
الوصف: Driving safety is significant to building a people-oriented and harmonious society, Tires are one of the key components of a vehicle and the character information on the tire sidewall is critical to their storage and usage. However, due to the diverse and differentiated features of typographic fonts, simultaneously extracting comprehensive characteristics is an extremely challenging task. To effectively break through these performance degradation issues, a multi-scale tire sidewall text region detection algorithm based on YOLOv5 is introduced, called YOLOT, which fuses comprehensive feature information in both width and depth directions. In this study, we firstly propose the Width and Depth Awareness (WDA) module in the text region detection field and successfully integrated it with the FPN structure to form the WDA-FPN. The purpose of WDA-FPN is to empower the network to capture multi-scale and multi-shape features in images, thereby augmenting the algorithm’s abstraction and representation of image features and concurrently boosting its robustness and generalization performance. Experimental findings indicate that, compared to the primary algorithm, YOLOT achieves significant improvement in accuracy, providing a higher detection reliability. The dataset and code for the paper are available at: https://github.com/Cloude-dehua/YOLOTTest.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2667-2413
العلاقة: http://www.sciencedirect.com/science/article/pii/S266724132400003XTest; https://doaj.org/toc/2667-2413Test
DOI: 10.1016/j.cogr.2024.03.001
الوصول الحر: https://doaj.org/article/ac11d1bcc1e043b98a1a6fc3ee64e550Test
رقم الانضمام: edsdoj.11d1bcc1e043b98a1a6fc3ee64e550
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:26672413
DOI:10.1016/j.cogr.2024.03.001