Object detection models review

1 minute read

This week was dedicated to read papers and complete the state of the art part of the paper. Papers were related to a general review of object detection models and the most important model’s papers

Object detection model review

Object detection generic goal is finding objects in images and locating them. A general idea is that having an image, the algorithm retrieves located bounding boxes with the object class for each object found in the image. A detailed explanation can be found in [1]. The broad variety of models based on deep learning can be divided in two main groups. The models which are based on a two-step approach and the one-step models.

The two step approach generates region proposals in the first step and then in the second one classifies them.In this group, we can highlight Faster R-CNN [2]. It is divided in three stages: anchor generation, region proposal network and classifier and bounding box regressor.

he one step approach is based on regression and directly outputs the categories and locations for the objects in one step. Two important approaches in this group are Single Shot MultiBox Detector [3] and You Only Look Once(YOLO) [4]. YOLO gained popularity because it obtained a great time performance with good performance metrics, something that was previously difficult to achieve. This architecture has evolved to this days with YOLOv3 [5] and YOLOv4 [6] including several improvements over the first release.

References

[1] Zhong-Qiu Zhao, Peng Zheng, Shou-tao Xu, Xindong Wu, Object Detection with Deep Learning: A Review, 2019

[2] Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, 2016

[3] Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg, SSD: Single Shot MultiBox Detector, 2016

[4] Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once: Unified, Real-Time Object Detection, 2016

[5] Joseph Redmon, Ali Farhadi, YOLOv3: An Incremental Improvement, 2018

[6] Alexey Bochkovskiy, Chien-Yao Wang, Hong-Yuan Mark Liao, YOLOv4: Optimal Speed and Accuracy of Object Detection, 2020