Fix LFW format by sizov-kirill · Pull Request #19 · cvat-ai/datumaro

sizov-kirill · 2022-12-01T19:43:02Z

Summary

How to test

Checklist

I submit my changes into the develop branch
I have added description of my changes into CHANGELOG
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below)

# Copyright (C) 2021 Intel Corporation
#
# SPDX-License-Identifier: MIT

yasakova-anastasia · 2022-12-02T16:37:24Z

We don't get any annotations when we import if we only export points. This is true? It's correct in terms of format, but I'm not sure that it's good for CVAT. Maybe generalize this format? We can save both points and labels if we only have points. That is, when importing such a dataset, we will add extra labels, but the points will not disappear.

sizov-kirill · 2022-12-05T10:51:29Z

We don't get any annotations when we import if we only export points. This is true? It's correct in terms of format, but I'm not sure that it's good for CVAT. Maybe generalize this format? We can save both points and labels if we only have points. That is, when importing such a dataset, we will add extra labels, but the points will not disappear.

Yes, it's true. If we have only points annotations exported dataset in LFW format will be empty.

I didn't choose the solution you suggested for the following reasons:

If user annotated one image with few Point objects that have different labels it's not clear how to represent such information in LFW format, because in LFW landmarks are described with the following format <img> <points> so we don't have any opportunity to set corresponding between specific point and specific label
If during export we try to add Tags that aren't represented in original annotations it's not clear which labels we should to add. Probably if we have points on image that all have one label it's not problem. But if we have points with different label on the same image it's not clear which label we should chose for Label object. Of course we can add multiply Labels for one image , but format doesn't assume such usage.

But I understand that this is not an obvious question and it is difficult to find only one "correct" solution.

@zhiltsov-max How do you think?

yasakova-anastasia · 2022-12-06T09:31:47Z

Could you please update the Changelog?

I also think the documentation needs to be updated.

zhiltsov-max · 2023-01-03T09:15:24Z

tests/test_lfw_format.py

+        source_dataset = Dataset.from_iterable(
+            [
+                DatasetItem(
+                    id="name0_0001",
+                    subset="test",
+                    media=Image(data=np.ones((2, 5, 3))),
+                    annotations=[
+                        Points([0, 4, 3, 3, 2, 2, 1, 0, 3, 0], label=0),
+                    ],
+                ),
+            ],
+            categories=["name0"],
+        )
+
+        target_dataset = Dataset.from_iterable(
+            [
+                DatasetItem(
+                    id="name0_0001",
+                    subset="test",
+                    media=Image(data=np.ones((2, 5, 3))),
+                ),
+            ],
+            categories=["name0"],
+        )


I don't think this behavior in the test is right. From the format viewpoint, the input annotations are incomplete, and it doesn't seem we can adequately support such cases in the format, so the best approach would be either to complete the missing annotations, if possible, or raise a clear error message otherwise.

My considerations on this specific case with CVAT export are here. Let's try to make the format clearly usable from CVAT, Datumaro seem to be correct at this point.

sonarqubecloud · 2024-06-23T11:01:24Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

sizov-kirill added 3 commits December 1, 2022 21:42

add test

f70e631

refactor and fix lfw format

1eb4b2e

fix typo

6e18272

nmanovic requested a review from zhiltsov-max December 5, 2022 07:56

zhiltsov-max reviewed Jan 3, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LFW format#19

Fix LFW format#19
sizov-kirill wants to merge 3 commits intodevelopfrom
sk/fix-lfw

sizov-kirill commented Dec 1, 2022

Uh oh!

yasakova-anastasia commented Dec 2, 2022

Uh oh!

sizov-kirill commented Dec 5, 2022 •

edited

Loading

Uh oh!

yasakova-anastasia commented Dec 6, 2022

Uh oh!

zhiltsov-max Jan 3, 2023

Uh oh!

sonarqubecloud bot commented Jun 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sizov-kirill commented Dec 1, 2022

Summary

How to test

Checklist

License

Uh oh!

yasakova-anastasia commented Dec 2, 2022

Uh oh!

sizov-kirill commented Dec 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yasakova-anastasia commented Dec 6, 2022

Uh oh!

zhiltsov-max Jan 3, 2023

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Jun 23, 2024

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sizov-kirill commented Dec 5, 2022 •

edited

Loading