This dataset, collected by Roman Kucev from TrainingData.ru, contains 1244 images of hot and cold water meters as well as their readings and coordinates of the displays showing those readings. Each image contains exactly one water meter. The archive also includes the pictures of the results of segmentation with the masks and collages. Toloka was used for photo capturing, segmentation, and recognizing the readings.
The dataset is from Toloka
The goal is to build a water meter reader
Label Studio was used for annotating the readings.
Yolov5 was used for object detection
I annotated around 400 images, but you could do with less provided you increase the epochs. Initially I only used 50 epochs which was inadequate as the image quality is poor. In the end, I used 300 epochs which took about 2 hours to train. The mAP@0.5 was around 55%, and the overall accuracy on the validation set was around 65%. Colab was used for training and inference.
This could be further improved by doing the following:
- Add more annotated images
- Include the rotation angle of the bounding boxes
- Train for > 300 epochs
