- Dataset: KITTI 3D Dataset
- Weights available at :Link to Weights
- password:4dnetpytorch
- This repo is an attempt at implementing 4D Net https://arxiv.org/abs/2109.01066
- NO TEMPORAL ELEMENT (RGB+LiDAR only, no Time)
- This repo serves only as a tutorial for myself
- I may have missed out some stuff from the paper
- Feel free to download this repo and implement the temporal elements yourself
- Evaluation Code from https://github.com/jacoblambert/3d_lidar_detection_evaluation

- The Model consist of a PointNet Processing model, an RGB Processing Model, PseudoImage Scattering Layer and a Efficient-Det style Single Shot Detector as object detection head
- During Training, the Pseudo Images will look like this in Tensorboard and important objects should get more pronounced
- For matching the targets to predicted outputs, i used a hungarian matcher used in DETR/Deformable-DETR
- Half of the effort here is to let the dataset grab the relavant RGB feature coordinates - These coordinates are used to grab the CNN features from the RGB Image to create a sepearte Pseudo Image - This is then concatenated with the LiDAR Point Pillars Pseudo Image later

- K-Means analysis of ground truth boxes are used
- Look at Stats.ipynb
- Build the docker container
docker buildx build -f ./docker/dockerfile -t 4dnet:0.1 .- Once in the container environment, train using the command:
python -m train /mnt/datasets/kitti_dataset/training --epochs 100 --tensorboard_logs ./tensorboard_logs/training

