November 2019
tl;dr: Propose a new metric and loss function based on IoU for object detection.
Overall impression
The optimal objective for a metric is the metric itself.
How to make a metric differentiable and use it as a loss seems to be the trend. This is quite popular in monocular 3D object detection to use 3D IoU as loss.
Dice loss has been used in medical imaging applications for some time now, but it has the issue of zero gradient when overlap is zero.
This seems quite similar to the signed IoU in monoDIS.
Key ideas
- Problem with commonly used l1 or l2 loss for object detection
- the minimization of loss does not directly correlates with IoU gain.
- (x, y) and (w, h) does not live in the same space, and thus log transformation is needed
- IoU loss is also scale-invariant (like Dice loss)
Technical details
- Compare with sIoU (signed IoU) in monoDIS
- gIoU
- The sIoU
Notes
- VNet in 2016 was the first to propose Dice Loss in image segmentation. Lovasz Softmax (CVPR 2018) is a high-performing surrogate for IoU loss, but is also used for segmentation. Unit Box is the first attempt to use IoU loss in object detection.