Tăng cường độ chính xác trong việc nhận diện đối tượng trên các thiết bị cạnh thông minh


  • Le Chi Luan
  • To Hai Thien




DL model, edge device, real time detection, object detection

Tóm tắt

Abstract—Object recognition is one of the main topics in the AI ​​field. There are many AI models with high accuracy running well on high-configuration devices. However, smart edge devices (SED) are being widely used in many different fields because of their compact flexibility, ensuring personal data policy. Their limitation is hardware that only runs or supports 8bits 16bits or 32bits models. Therefore, running the model on SED must do the swap step (“quantization”). This also causes the recognition models to be significantly reduced in accuracy. In this paper, we propose the solution “GreedyPlus” – to capture high resolution frame (skip blur frame) and search for small objects in the image by cutting frames into small windows. Then, the solution zooms in and identifies objects. The last step determines the number of objects in the frame exactly. The method is simple but highly effective, improving the recognition results for the model significantly without the need to retrain the model with a new dataset. The results are tested and demonstrated on the datasets KITTI, CrownAI, and Autti.


Download data is not yet available.


. Kim, Jangho, KiYoon Yoo, and Nojun Kwak. "Position-based scaled gradient for model quantization and pruning." Advances in Neural Information Processing Systems 33 (2020): 20415-20426.

. Li, Rundong, et al. "Fully quantized network for object detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.

. Krishnamoorthi, Raghuraman. "Quantizing deep convolutional networks for efficient inference: A whitepaper." arXiv preprint arXiv:1806.08342 (2018).

. Ren, Shaoqing, et al. "Faster r-cnn: Towards real-time object detection with region proposal networks." Advances in neural information processing systems 28 (2015).

. Dai, Jifeng, Yi Li, Kaiming He, and Jian Sun. "R-fcn: Object detection via region-based fully convolutional networks." Advances in neural information processing systems 29 (2016).

. Lin, Tsung-Yi, et al. "Focal loss for dense object detection." Proceedings of the IEEE international conference on computer vision. 2017.

. Redmon, Joseph, and Ali Farhadi. "Yolov3: An incremental improvement." arXiv preprint arXiv:1804.02767 (2018).

. Liu, Wei, et al. "Ssd: Single shot multibox detector." Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016.

. Sandler, Mark, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. "Mobilenetv2: Inverted residuals and linear bottlenecks." In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510-4520. 2018.

. Howard, Andrew G., Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. "Mobilenets: Efficient convolutional neural networks for mobile vision applications." arXiv preprint arXiv:1704.04861 (2017).

. Thuật toán Non maximum suppression https://learnopencv.com/non-maximum-suppression-theory-and-implementation-in-pytorch/.


Abstract views: 90 / PDF downloads: 22



How to Cite

Luận, L. C., & Thiên, T. H. (2023). Tăng cường độ chính xác trong việc nhận diện đối tượng trên các thiết bị cạnh thông minh . Journal of Science and Technology on Information Security, 2(19), 29-38. https://doi.org/10.54654/isj.v2i19.948