Add MobileNetV3 architecture for Segmentation by datumbox · Pull Request #3276 · pytorch/vision

datumbox · 2021-01-22T15:24:44Z

Adding MobileNetV3 models for Semantic Segmentation (resolution 520):

Lite R-ASPP with Dilated MobileNetV3 Large Backbone

Heavily optimized for speed. Good for actual mobile usage.

Weight checkpoint:

PR3276/3rd_training/35354083/model_28.pth

Validate:

python -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --dataset coco\
   --model lraspp_mobilenet_v3_large --test-only --pretrained

Accuracy metrics:

0: global correct: 91.2
0: average row correct: ['94.5', '84.3', '69.5', '72.8', '57.7', '42.0', '77.0', '57.0', '90.4', '36.1', '76.0', '60.8', '81.4', '78.9', '81.0', '87.6', '51.3', '83.9', '62.2', '84.2', '56.1']
0: IoU: ['90.2', '69.2', '57.7', '58.5', '47.8', '35.7', '69.5', '47.1', '79.1', '29.6', '62.6', '34.2', '65.5', '63.4', '70.0', '76.8', '30.1', '61.9', '46.8', '70.6', '49.1']
0: mean IoU: 57.9

Speed Benchmark: 0.3278 sec per image on CPU

DeepLabV3 with Dilated MobileNetV3 Large Backbone

Offers good balance between speed and accuracy, significantly faster than the FCN model with a resnet50 backbone without sacrificing too much accuracy.

Weight checkpoint:

PR3276/3rd_training/35354080/model_28.pth

Validate:

python -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --dataset coco\
   --model deeplabv3_mobilenet_v3_large --test-only --pretrained

Accuracy metrics:

0: global correct: 91.2
0: average row correct: ['93.7', '84.9', '73.6', '74.6', '63.6', '50.6', '80.7', '65.1', '91.3', '42.2', '80.4', '70.6', '82.4', '81.8', '83.7', '88.5', '52.6', '87.8', '65.9', '88.3', '63.3']
0: IoU: ['90.1', '69.7', '58.2', '61.3', '49.7', '37.6', '72.7', '52.7', '79.1', '32.2', '64.6', '36.2', '66.7', '67.4', '70.1', '77.3', '33.1', '67.8', '51.1', '73.3', '54.4']
0: mean IoU: 60.3

Speed Benchmark: 0.5869 sec per image on CPU

fmassa

Looks great, thanks!

I only have a couple of minor (non-blocking) comments. The only thing I would really like to see fixed before merge is to have the correct python -m torch.distributed.launch ... commands for reproducibility.

references/segmentation/README.md

torchvision/models/segmentation/segmentation.py

fmassa · 2021-01-27T13:00:01Z

torchvision/models/mobilenetv3.py

        self.block = nn.Sequential(*layers)
        self.out_channels = cnf.out_channels
-        self.is_strided = cnf.stride > 1
+        self._is_cn = cnf.stride > 1


out of curiosity, what does cn mean in here

It's from the C0,C1...C5,Cn names used in Object Detection. I use this feature internally to find out where the downsampling was supposed to happen but it's not always done with strides so I had to rename it. If you have any better name for it, happy to change it. I could not think of any...

thanks for the explanation. Given that this is private I'm fine with this name

fmassa · 2021-01-27T13:05:30Z

torchvision/models/mobilenetv3.py

-            backbone for Detection and Segmentation.
    """
+    # non-public config parameters
+    reduce_divider = 2 if kwargs.pop('_reduced_tail', False) else 1


Is this feature used in any of the models? Otherwise we can just remove it

This is a unique implementation detail from the paper on MobileNetV3 models and it's supposed to produce a further speed optimization on object detection and segmentation. In our training scripts we don't use it because we do transfer learning from ImageNet but if someone really wants to train it from scratch and go smaller I provide a way to do it.

On current master this is public (see reduced_tail param) but here I decide to hide before the release and make it an internal implementation detail for future models. Not quite convinced we will use it but want to provide an implementation very close to the paper.

Personally I would prefer to keep it hidden for now and decide later whether we want this gone. Let me know.

Sounds good, I'm ok keeping this private for now and maybe removing it from the future.

Summary: * Making _segm_resnet() generic and reusable. * Adding fcn and deeplabv3 directly on mobilenetv3 backbone. * Adding tests for segmentation models. * Rename is_strided with _is_cn. * Add dilation support on MobileNetV3 for Segmentation. * Add Lite R-ASPP with MobileNetV3 backbone. * Add pretrained model weights. * Removing model fcn_mobilenet_v3_large. * Adding docs and imports. * Fixing typo and readme. Reviewed By: datumbox Differential Revision: D26156380 fbshipit-source-id: e62528b52728804a40da79c1311562a7f1c2afbd

Making _segm_resnet() generic and reusable.

77da44c

facebook-github-bot added the cla signed label Jan 22, 2021

Adding fcn and deeplabv3 directly on mobilenetv3 backbone.

462d59a

datumbox force-pushed the mobilenetv3_segmentation branch from 8061535 to 462d59a Compare January 22, 2021 17:25

datumbox mentioned this pull request Jan 22, 2021

TorchVision Roadmap - 2021 H1 #3221

Closed

13 tasks

datumbox and others added 5 commits January 25, 2021 11:01

Merge branch 'master' into mobilenetv3_segmentation

56a28b0

Adding tests for segmentation models.

482d3bd

Rename is_strided with _is_cn.

f64bfed

Add dilation support on MobileNetV3 for Segmentation.

10a51cf

Add Lite R-ASPP with MobileNetV3 backbone.

359d941

datumbox force-pushed the mobilenetv3_segmentation branch from 231a525 to 359d941 Compare January 25, 2021 21:19

Add pretrained model weights.

406fa47

datumbox force-pushed the mobilenetv3_segmentation branch from 610f13f to 406fa47 Compare January 27, 2021 12:08

datumbox added 2 commits January 27, 2021 12:19

Removing model fcn_mobilenet_v3_large.

1641d5f

Adding docs and imports.

75170d5

datumbox changed the title ~~[WIP] Add MobileNetV3 architecture for Segmentation~~ Add MobileNetV3 architecture for Segmentation Jan 27, 2021

datumbox requested a review from fmassa January 27, 2021 12:46

fmassa approved these changes Jan 27, 2021

View reviewed changes

datumbox and others added 2 commits January 27, 2021 13:36

Fixing typo and readme.

76d537b

Merge branch 'master' into mobilenetv3_segmentation

ad71b4e

datumbox merged commit e2db2ed into pytorch:master Jan 27, 2021

datumbox deleted the mobilenetv3_segmentation branch January 27, 2021 14:09

fmassa mentioned this pull request Feb 8, 2021

Pretrained MobileNet-V2 Backbones for Segmentation Tasks #2700

Closed

oke-aditya mentioned this pull request Apr 24, 2021

Are new models planned to be added? #2707

Closed

37 tasks

datumbox mentioned this pull request May 13, 2021

Making protected params of MobileNetV3 public #3828

Merged

sudotensor mentioned this pull request Sep 11, 2023

Misconfiguration of MobileNetV3-Large backbone for DeepLabV3 #7956

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MobileNetV3 architecture for Segmentation#3276

Add MobileNetV3 architecture for Segmentation#3276
datumbox merged 12 commits intopytorch:masterfrom
datumbox:mobilenetv3_segmentation

datumbox commented Jan 22, 2021 •

edited

Loading

Uh oh!

fmassa left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fmassa Jan 27, 2021

Uh oh!

datumbox Jan 27, 2021

Uh oh!

fmassa Jan 27, 2021

Uh oh!

fmassa Jan 27, 2021

Uh oh!

datumbox Jan 27, 2021

Uh oh!

fmassa Jan 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

datumbox commented Jan 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Lite R-ASPP with Dilated MobileNetV3 Large Backbone

DeepLabV3 with Dilated MobileNetV3 Large Backbone

Uh oh!

fmassa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fmassa Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

fmassa Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

fmassa Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

datumbox Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

fmassa Jan 27, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

datumbox commented Jan 22, 2021 •

edited

Loading