Label GUI

Label-GUI is a lightweight graphical user interface based on PyQt5 and pyqtgraph for visualizing image or video data and creating semantic segmentation labels using segment anything (SAM) from Meta AI Research.

Instructions

1. Installation

Create a new conda environment called label-gui and activate it

conda create -n label-gui python=3.10
conda activate label-gui

Install Pytorch with cuda (strongly recommended). Here is an example (as of April 2023; it might be different for your set-up):
```
conda install pytorch torchvision pytorch-cuda=11.8 -c pytorch -c nvidia
```
Download the latest release of Label-Gui from GitHub (preferred) or clone the repository. Downloading the latest release is recommended because I might make breaking changes to the main branch.
Navigate into the repository and install the required packages
```
cd label-gui
pip install -r requirements.txt
```
Install Segment Anything (SAM) by following the instructions in their GitHub repo. As of April 2023, this entails:
```
pip install git+https://github.com/facebookresearch/segment-anything.git
```
Download the model weights. The link posted here might be broken so make sure to check their GitHub repo. When you want to annotate segmentation masks, navigate to Dnn -> Load SAM to load the model weights. Currently, supported weights are: sam_vit_b_01ec64, sam_vit_h_4b8939, sam_vit_l_0b3195

Create a models folder under label gui and copy the downloaded model weight under this folder. So the file structure should look like this

+-- imgs
+-- models
|   +-- sam_vit_h_4b8939.pth
+-- src
|   +-- annotation.py
|   +-- app.py
|   +-- ...
+-- .gitignore
+-- readme.md
+-- requirements.txt

Finally, run app.py; it might take a few seconds to load the model weights
```
python src\app.py
```
Note that the model weights directory is relative to the current directory so you need to run the app.py file exactly as above.

2. Using Label-Gui

2.1 Loading and Saving Data

Image data can be loaded two different ways

File -> Load Video File: This will present a folder dialog to select a video file (in mp4 or avi formats).
File -> Load Image Folder: If you have a collection of images under one folder, use this option to load all the images under that folder. Currently, only tiff files are tested but other formats will likely work as well.
File -> Load Annotations: Load annotation data in json format. Make sure to set the Show Mask option to All to view all the mask annotations.
File -> Load Embeddings: Computing the image embeddings for SAM takes long (on my laptop with a GeForce RTX 2070 gpu it takes about 5-6 s per frame). So it is very convenient to Pre-Compute Embeddings (takes about 30 mins for a 300 frame video) and leave it running while I grab some coffee to distract my lab mates with conversations about how AI will take over the world, take our jobs, and make us obselete. After the embeddings are computed they can be saved and subsequently loaded for fast operation.
File -> Save Annotations: Save annotations in json format.
File -> Save Embeddings: Save image embeddings for later use to speed up labelling process.

2.2 Image Viewer

Zoom in/out using the scroll wheel of your mouse.
Click and drag to translate the image
If Annotate setting is On, then
- Left click will include regions (foreground points) and
- Right click will exlude regions (background points) from the segmentation mask
Press Space-bar to accept a mask or Esc to reject it.
If you successfully added a segmentation mask, it will appear on the mask list.

2.3 Histogram and Contrast Controls

Region 3 shown in the screenshot above highlights the image histogram. Use controls there to enhance image contrast to make faint objects easier to see. This is a useful feature for nearly transparent objects in phase contrast images. Increasing contrast operates on a copy of the image data and does not affect mask detection.

2.4 Controls

This is region 4 highlighted in the screenshot above. I will go from start to bottom:

Auto-Detect: This is an experimental feature that aims to make labelling easier but currently does not work great on my data. This will be improved in the future. The idea is to use simple models to generate object proposals and to fine tune them using SAM. Here is a screenshot for the current version:

Implemented models are:
- Sam Automatic Mask Generator: This uses the automatic mask generator shipped with the current version of SAM which queries a regularly spaced grid and the uses multiple methods (thresholding based on confidence, nms, etc) to filter the output.
- Canny Edge Detector: Bounding box proposals are generated using a Canny edge detector, followed by a morphological closing operation and contour detection.
- MOG2 Background Subtractor: Bounding box proposals are generated using MOG2 background subtractor from OpenCV followed by a morphological closing operation and contour detection. This option makes sense only for video data (ie data with temporal correlations).
- Optical Flow (Farneback): Bounding box proposals are generated using OpenCV implementation of optical flow followed. Velocity field magnitude is sent through a morphological closing operation, adaptive thresholding and contour detection to generate bounding box proposals. This option makes sense only for video data as well.
Pre-Compute Embeddings: Image embeddings are computed for SAM. These embeddings can be saved and re-loaded for fast operation. Embeddings are cached in RAM and sent to the GPU when a particular frame is used.
Query Prev Frame Detections: This feature is very useful for labelling videos where objects move slowly. It will use previous frame masks as proposals for the current frame. The proposals are finetuned using SAM. I am planning to incorporate a simple tracker to model fast moving objects as well.
Annotate: If you set this option to On, a cursor will appear you can start annotating the image.
Mask Scale: Turning Annotate setting On will enable the mask scale option. SAM outputs 3 different masks each corresponding to a different scale (0: small scale, 1: medium scale, 2: large scale).
Show Mask: All will show all the annotated masks whereas Last will show the last one. Showing the last mask during annotation is highly recommended since SAM might occasionally pick neighboring objects as well. If all labels are on, they might be difficult to spot. If that happens, use right click to exclude regions.
Mask list region: As you add masks, they will appear on this list. You can click on them to view their corresponding masks on the image viewer (make sure Last option is selected). You can use delete key to remove a mask from the list.
Add class label region: Add a class label by typing its name in the textbox and clicking on the Add button. If you want to assign a particular label to a mask: i) select the class and the mask by clicking on them and ii) press the Shift key.
Prev/Next: Use these buttons to navigate to different buttons. You can alternatively enter the frame ID.

2.5 Summary

Left click to add a foreground point
Right click to add a background point
Space bar to add a segmentation mask to the list
Esc to reject a segmentation mask
Delete to delete a segmentation label
Shift to assign a class label to a mask
Left/Right Arrow keys to navigate to different frames
Scroll Wheel to zoom in/out in the image viewer.

3. TODO

Load arbitrary SAM models as opposed to a hard coded directory
Improve auto-detect
Incorporate a multi-object tracker for video data
Enable changes to the class label list

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Label GUI

Instructions

Table of Contents

1. Installation

2. Using Label-Gui

2.1 Loading and Saving Data

2.2 Image Viewer

2.3 Histogram and Contrast Controls

2.4 Controls

2.5 Summary

3. TODO

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
imgs		imgs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
readme.md		readme.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Label GUI

Instructions

Table of Contents

1. Installation

2. Using Label-Gui

2.1 Loading and Saving Data

2.2 Image Viewer

2.3 Histogram and Contrast Controls

2.4 Controls

2.5 Summary

3. TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages