Salmon Computer Vision Project

Multi-Object Tracker (MOT)

The current framework uses ByteTrack to track individual salmon for counting.

The following steps are for Ubuntu 20.04:

Clone our version of the ByteTrack repo:

git clone https://github.com/Salmon-Computer-Vision/ByteTrack.git
cd ByteTrack

Follow either the docker install or host machine install in the ByteTrack documentation to install all the requirements to run ByteTrack.

Download the bytetrack_salmon.tar.gz dataset from the Dataset section or convert the dataset to the MOT sequences format and use the script in the ByteTrack repo to convert them to the COCO format.

Extract it and put the salmon folder in the datasets folder in ByteTrack if not already.

tar xzvf bytetrack_salmon.tar.gz

Download the pretrained model YOLOX nano at their model zoo.

Place the pretrained model in the pretrained folder. The path should be pretrained/yolox_nano.pth.

Run the training either inside the docker container or on the host machine:

python3 tools/train.py -f exps/example/mot/yolox_nano_salmon.py -d 4 -b 256 --fp16 -o -c pretrained/yolox_nano.pth

If you canceled the training in the middle, you can resume from a checkpoint with the following command:

python3 tools/train.py -f exps/example/mot/yolox_nano_salmon.py -d 4 -b 256 --fp16 -o --resume

Lower -b (batch size) if running on a GPU with less memory.

Once finished, the final outputs will be in YOLOX_outputs/yolox_nano_salmon/ where best_ckpt.pth.tar would be the checkpoint with the highest validation mAP score.

To inference with the model on a video:

python3 tools/demo_track.py video -f exps/example/mot/yolox_nano_salmon.py -c pretrained/bytetrack_x_mot17.pth.tar --path path/to/video.mp4 --fp16 --fuse --save_result

Other options can be done with demo_track.py such as camera, and images. Run the following to check them all:

python3 tools/demo_track.py -h

Object Detector

This will describe YOLOv6, however, the steps and format are similar for the other versions.

Clone the YOLOv6 repo:

git clone https://github.com/meituan/YOLOv6.git

Install Python3 requirements:

cd YOLOv6
pip3 install -r requirements.txt

Download the yolov6_salmon.tar.gz dataset from the Dataset section or convert the dataset to the YOLO format following the instructions in the YOLOv6 repo.

Extract the dataset:

tar xzvf yolov6_salmon.tar.gz

Download the combined_bear-kitwanga.yaml file from the Dataset section and place it in the data folder which describes the location of the dataset and the class labels. Please edit the yaml to point to where you extract the dataset.

Run the training using multi-GPUs:

python -m torch.distributed.launch --nproc_per_node 4 tools/train.py --epoch 100 --batch 512 --conf configs/yolov6n_finetune.py --eval-interval 2 --data data/combined_bear-kitwanga.yaml --device 0,1,2,3

Lower --batch size appropriately if running on GPUs with less memory.

The final outputs will be in runs/train/exp<X>, where <X> is the number of the run.

To run inferencing with YOLOv6:

python3 tools/infer.py \
    --yaml data/combined_bear-kitwanga.yaml \
    --weights runs/train/exp${X}/weights/best_ckpt.pt \
    --source "$vid" \
    --save-txt \
    --device $device

$device describes the GPU device number. If you only have one, $device = 0.

The resulting output will be in the runs/inference folder.

Check the YOLOv6 README for further inference commands or check python3 tools/infer.py -h.

Followed this TF2 Detection API tutorial extensively.

Run MOT Inference Model

We can run MOT inference model to detect fish from frames.

Please read this document for further details.

Train Model

To train the model, we need to run some scripts to prepare training data. Please follow the following sections to run each of scripts.

Run make_cvat_tasks.sh

We need to run make_cvat_tasks.sh script at utils folder to automatically create cvat tasks on our cvat server.

Please read this document for further details about running make_cvat_tasks.sh script.

Run dump_cvat.sh

After running make_cvat_tasks.sh, we will have tasks that are already labelled on cvat server.

Now, we want to dump those tasks to our local computer.

Please follow this document to run dump_cvat.sh script.

Run download_frames.sh

The download_frames.sh uses the cvat plugin to download the frames specified by the xpath_filt (Or just all annotated frames) and renames them to differentiate the task IDs for easy merging.

Please follow this document to run download_frames.sh script.

Run merge_filt.sh

This script merges the dumped annotations from previous script and split them into training set, validation set and test set.

Please follow this document to run merge_filt.sh script.

Export project to MOT Seq GT format

After we have the merged result, we want to export the merged folder produced by the merge_filt.sh script to MOT Seq GT.

datum export -p merged -o merged_mot_gt -f mot_seq_gt -- --save-images

Run the python script convert_gt_jde.py

After we have MOT Seq GT format annotations, we want to convert them to JDE format. Then, we want to split data into training, validation and test dataset.

Please follow this document to run convert_gt_jde.py script.

After We Have JDE Annotations

After running the above scripts, we have created CVAT tasks and converted the annotations to JDE format.

When we have JDE annotations ready, we can start training our model. We need to git clone Towards-Realtime-MOT repository on our computer.

Git Clone Towards-Realtime-MOT

Note that we want to git clone our salmon version MOT repository. Therefore, please git clone from this GitHub repository, and then switch to Salmon branch:

Salmon-Computer-Vision/Towards-Realtime-MOT

Training Set Up

Once we have the JDE annotations generated from the previous command, please move the following three files to parent folder of merged_mot_gt folder:

salmon.train
salmon.val
salmon.test

For example,

mv ~/salmon-computer-vision/utils/merged_mot_gt/salmon.train ~/salmon-computer-vision/utils

so that the image paths in the three salmon files match the actual image path.

Then, we need to modify ~/Towards-Realtime-MOT/cfg/ccmcpe.json to point to the training data. An example of ccmcpe.json is:

{
    "root":"/home/ycchou/salmon-computer-vision/utils/",
    "train":
    {
        "fish":"/home/ycchou/salmon-computer-vision/utils/salmon.train"
    },
    "test_emb":
    {
        "fish":"/home/ycchou/salmon-computer-vision/utils/salmon.val"
    },
    "test":
    {
        "fish":"/home/ycchou/salmon-computer-vision/utils/salmon.val"
    }
}

Run train.py script

Make sure you have installed CUDA on your system. If you did not install CUDA, you will likely encounter Found no NVIDIA driver error. Please check the troubleshooting section to install CUDA for your system.

Please run the train.py script to start training the model. Reduce batch size if you GPU memory is not enough. You would likely encounter RuntimeError: CUDA out of memory if your batch size is too high.

python train.py --batch-size 6 --img-size 576 320

After the training process is finished, we can find the trained weights at:

Towards-Realtime-MOT\weights\{date_folder}\latest.pt

Troubleshooting

RuntimeError: Found no NVIDIA driver on your system.

RuntimeError: Found no NVIDIA driver on your system. Please check that you have an NVIDIA GPU and installed a driver from http://www.nvidia.com/Download/index.aspx

Make sure we have installed CUDA on our system. Please check this page to install CUDA on your system.

Setting up CUDA Toolkit

E: Unable to locate package cuda-toolkit-11-0

E: Unable to locate package cuda-toolkit-11-0

It is likely that the apt-key cuda.list does not have the right version of the system. For example,

apt-key adv --fetch-keys http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub

If you are using Ubuntu 20.04, please replace ubuntu1804 with ubuntu2004.

Also, make sure you install the right version of toolkit for your system version. Check this page to find the correct version of toolkit:

Installation Guide Linux :: CUDA Toolkit Documentation

RuntimeError: CUDA out of memory.

It is likely that the batch size you specified in the train command is too high. Please try to reduce the batch size and try again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Salmon Computer Vision Project

Multi-Object Tracker (MOT)

Object Detector

Run MOT Inference Model

Train Model

Run make_cvat_tasks.sh

Run dump_cvat.sh

Run download_frames.sh

Run merge_filt.sh

Export project to MOT Seq GT format

Run the python script convert_gt_jde.py

After We Have JDE Annotations

Git Clone Towards-Realtime-MOT

Training Set Up

Run train.py script

Troubleshooting

RuntimeError: Found no NVIDIA driver on your system.

E: Unable to locate package cuda-toolkit-11-0

RuntimeError: CUDA out of memory.

FilesExpand file tree

README-old.md

Latest commit

History

README-old.md

File metadata and controls

Salmon Computer Vision Project

Multi-Object Tracker (MOT)

Object Detector

Run MOT Inference Model

Train Model

Run make_cvat_tasks.sh

Run dump_cvat.sh

Run download_frames.sh

Run merge_filt.sh

Export project to MOT Seq GT format

Run the python script convert_gt_jde.py

After We Have JDE Annotations

Git Clone Towards-Realtime-MOT

Training Set Up

Run train.py script

Troubleshooting

RuntimeError: Found no NVIDIA driver on your system.

E: Unable to locate package cuda-toolkit-11-0

RuntimeError: CUDA out of memory.