Learning-AI

gIoU: Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

November 2019

tl;dr: Propose a new metric and loss function based on IoU for object detection.

Overall impression

The optimal objective for a metric is the metric itself.

How to make a metric differentiable and use it as a loss seems to be the trend. This is quite popular in monocular 3D object detection to use 3D IoU as loss.

Dice loss has been used in medical imaging applications for some time now, but it has the issue of zero gradient when overlap is zero.

This seems quite similar to the signed IoU in monoDIS.

Key ideas

Problem with commonly used l1 or l2 loss for object detection
- the minimization of loss does not directly correlates with IoU gain.
- (x, y) and (w, h) does not live in the same space, and thus log transformation is needed
IoU loss is also scale-invariant (like Dice loss)

Technical details

Compare with sIoU (signed IoU) in monoDIS
gIoU
The sIoU

Notes

VNet in 2016 was the first to propose Dice Loss in image segmentation. Lovasz Softmax (CVPR 2018) is a high-performing surrogate for IoU loss, but is also used for segmentation. Unit Box is the first attempt to use IoU loss in object detection.