Table 2. Hyper-parameters and mean average precision (mAP) used to train the Mask R-CNN model

Image size Backbone Mask loss Epoch mAP
512 × 512 resnet50 1.0 8 0.616
512 × 512 resnet50 1.0 12 0.673
1,024 × 1,024 resnet50 1.0 8 0.837
1,024 × 1,024 resnet50 1.0 12 0.609
1,024 × 1,024 resnet50 2.0 7 0.777
1,024 × 1,024 resnet50 2.0 12 0.836
Mask R-CNN: mask region-based convolutional neural network.