What is: EfficientNetV2?
Source | EfficientNetV2: Smaller Models and Faster Training |
Year | 2000 |
Data Source | CC BY-SA - https://paperswithcode.com |
EfficientNetV2 is a type convolutional neural network that has faster training speed and better parameter efficiency than previous models. To develop these models, the authors use a combination of training-aware neural architecture search and scaling, to jointly optimize training speed. The models were searched from the search space enriched with new ops such as Fused-MBConv.
Architecturally the main differences are:
- EfficientNetV2 extensively uses both MBConv and the newly added fused-MBConv in the early layers.
- EfficientNetV2 prefers smaller expansion ratio for MBConv since smaller expansion ratios tend to have less memory access overhead.
- EfficientNetV2 prefers smaller 3x3 kernel sizes, but it adds more layers to compensate the reduced receptive field resulted from the smaller kernel size.
- EfficientNetV2 completely removes the last stride-1 stage in the original EfficientNet, wperhaps due to its large parameter size and memory access overhead.