A system for identifying different apple varieties based on the YOLOV8X neural network

Authors: Mikheev D.A., Kitaev D.N.
Published in issue: #3(92)/2024
DOI:
Category: Informatics, Computer Engineering and Control \| Chapter: Information Technology. Computer techologies. Theory of computers and systems
Keywords: computer vision, object identification, neural networks, convolutional neural network, single-stage detector, YOLOv8x, weighed products, apple varieties, retail
Published: 28.07.2024

The paper presents results of selecting a neural network model to solve the problem of identifying the weighed products. It considers a family of the single-stage models of the YOLOv8 convolutional neural networks and assesses at the initial stage performance of the largest YOLOv8x model on the frames with images of fruits and vegetables in a grocery store. Data were collected and prepared for the assessed network learning to recognize five apple varieties: Golden Delicious, Granny Smith, Gala, Honey Crisp and Red Chief. The obtained data was introduced to learn the YOLOv8x model using the transfer learning; results of the learned model operation were analyzed.

References

[1] Andrianov N.A., Dementyev V.E., Tashlinsky A.G. Object detection in images: from Bayes and Neyman-Pearson criteria to detectors based on neural networks EfficientDet. Computer Optics, 2022, Vol. 46, No. 1, pp. 139–159. https://doi.org/10.18287/2412-6179-CO-922

[2] Albumentations, “Computer vision tool that boosts the performance of deep convolutional neural networks”. Available at: https://albumentations.ai/ (accessed April 15, 2024).

[3] Computer Hope, Dictionary, Definitions, “FLOPS”. Available at: https://www.computerhope.com/jargon/f/flops.htm (accessed April 15, 2024).

[4] GitHub, ultralytics repository, YOLOv8. URL: https://github.com/ultralytics/ultralytics (accessed April 15, 2024).

[5] Hui J. mAP (mean Average Precision) for object detection. Available at: https://jonathan-hui.medium.com/map-meanaverage-precision-for-object-detection-45c121a31173 (accessed April 15, 2024).

[6] Alzubaidi L., Zhang J., Humaidi A.J. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 2021, art. 53. https://doi.org/10.1186/s40537-021-00444-8

[7] Open Neural Network Exchange. Available at: https://onnx.ai/ (accessed April 15, 2024).

[8] Powers D. Evaluation: From precision, recall and f-measure to ROC, Informedness, markedness & correlation. Journal of Machine Learning Technologies, 2011, vol. 2 (1), pp. 37–63.

[9] Redmon J., Divvala S., Girshick R., Farhadi A. You only look once: Unified, real-time object detection. Proc IEEE Conf on Computer Vision and Pattern Recognition (CVPR), 2016, vol. 1, pp. 779–788. https://doi.org/10.48550/arXiv.1506.02640

[10] Ockay M., Harakal M., Liska M. Compute Unified Device Architecture (CUDA) GPU programming model and possible integration to the parallel environment. Science & Military, no. 2, vol. 3, 2008, pp. 64–68.

[11] Roboflow, Jacob Solawetz, Francesco “What is YOLOv8? The Ultimate Guide.” Available at: https://blog.roboflow.com/whats-new-in-yolov8/ (accessed April 15, 2024).

[12] Roboflow, “Quickly Label Training Data and Export to Any Format.” Available at: https://roboflow.com/annotate (accessed April 15, 2024).