January 2020
tl;dr: Paint radar as a vertical line and fuse it with radar.
This is one of the first few papers that investigate radar/camera fusion on nuscenes dataset. It is inspired by Distant object detection with camera and radar.
Both CRF-Net and Distant radar object transforms the unstructured radar pins to pseudo-image and then process it with camera. Alternative approach to unstructured radar pins (point cloud) is to use PointNet, but PointNet is usually the best in classification or semantic segmentation when the RoI is extracted. Pseudo-image method is used in many other works, PointPillars for lidar data, and many work that incorporate intrinsics.
Embedding meta data info into conv:
In heavy rain or fog, the visibility is reduced, and safe driving might not be guaranteed. In addition, camera sensors get increasingly affected by noise in sparsely lit conditions. The camera can also be rendered unusable if water droplets stick to the camera lens.
Filtering out stationary radar object is common, but this may filter out cars under traffic light or bridges.