Learning-AI

DA-3Det: Monocular 3D Object Detection via Feature Domain Adaptation

August 2020

tl;dr: Use Domain Adaptation to bridge the gap between pseudo-lidar and real lidar.

Overall impression

DA-3Det uses a Siamese network and takes in real lidar and pseudo-lidar data. The difference between the features are penalized. This way DA-3Det learns a general feature based on pseudo-lidar.

Similar ideas to bridge the gap between real and pseudo-lidar has been witnessed in RefinedMPL, which proposes a way to downsample the dense lidar points to mimic the sparsity of point cloud.

Key ideas

The paper also uses the Frustum PointNet version of pseudo-lidar due to its simplicity in dealing with point cloud.
Siamese network with domain adaptation loss (L2 between features).
- During training process, real-lidar data is also utilized for feature domain adaptation. Only a single image is required during the inference stage.
Context aware segmentation module: this is simply a pretrained segmentation module that is finetuned online.
- Pretraining improves performance as compared to unsupervised training with random initialization.
Domain adaptation is a useful technique that can be applied to mono –> stereo and stereo –> lidar.

Technical details

Random sampling of lidar point for each object. For object containing smaller numbers of lidar points, sample with replacement (duplication).

Notes

Questions and notes on how to improve/revise the current work