Improving Object Detection Using Enhanced Efficientnet Architecture

Michael Kamel Ibrahim, Purdue University

Abstract

EfficientNet is designed to achieve top accuracy while utilitzing fewer parameters, in addition to less computational resources compared to previous models.In this paper, we are presenting compound scaling method that re-weight the network’s width (w), depth(d), and resolution (r), which leads to better performance than traditional methods that scale only one or two of these dimensions by adjusting the hyperparameters of the model. Additionally, we are presenting an enhanced EfficientNet Backbone architecture.We show that EfficientNet achieves top accuracy on the ImageNet dataset, while being up to 8.4x smaller and up to 6.1x faster than previous top performing models. The effectiveness demonstrated in EfficientNet on transfer learning and object detection tasks, where it achieves higher accuracy with fewer parameters and less computation. Henceforward, the proposed enhanced architecture will be discussed in detail and compared to the original architecture.Our approach provides a scalable and efficient solution for both academic research and practical applications, where resource constraints are often a limiting factor.

Degree

M.S.

Advisors

El-Sharkawy, Purdue University.

Subject Area

Medical imaging|Artificial intelligence|Computer science|Information Technology|Internet and social media studies|Logic|Transportation

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server
.

Share

COinS