Add class masking (draft) #915

rhine3 · 2025-08-07T08:56:47Z

Summary

Brief description of what this PR does. (tl;dr).

List of Changes

Modified class X
Added model Y
Fixed problem Z
etc.

Related Issues

If the PR closes or is related to an issue, reference it here.
For example, "Closes #<ISSUE_NUMBER>", "Fixes #<ISSUE_NUMBER>" or "Relates to #<ISSUE_NUMBER>" .

See Github Keywords
for more information on this

Detailed Description

A clear and detailed description of the changes, how they solve/fix the related issues.

Mention potential side effects or risks associated with the changes, if applicable.

How to Test the Changes

Instructions on how to test the changes Include references to automated and/or manual tests that were created/used to
test the changes.

Screenshots

If applicable, add screenshots to help explain this PR (ex. Before and after for UI changes).

Deployment Notes

Include instructions if this PR requires specific steps for its deployment (database migrations, config changes, etc.)

Checklist

I have tested these changes appropriately.
I have added and/or modified relevant tests.
I updated relevant documentation or comments.
I have verified that this PR follows the project's coding standards.
Any dependent changes have already been merged to main.

instead of updating the existing one.

so that determination is properly updated.

netlify · 2025-08-07T08:57:33Z

✅ Deploy Preview for antenna-preview ready!

Name	Link
🔨 Latest commit	`19b0cec`
🔍 Latest deploy log	https://app.netlify.com/projects/antenna-preview/deploys/68a8d203dca7f2000851ee17
😎 Deploy Preview	https://deploy-preview-915--antenna-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.
Lighthouse	1 paths audited Performance: 61 (🟢 up 1 from production) Accessibility: 80 (no change from production) Best Practices: 100 (no change from production) SEO: 92 (no change from production) PWA: 80 (no change from production) View the detailed breakdown and full score reports

To edit notification comments on pull requests, go to your Netlify project configuration.

Copilot

Pull Request Overview

This pull request adds logistic binning functionality to the AMI system, introducing support for out-of-distribution (OOD) scoring, clustering algorithms, and enhanced user preferences management. The changes include new filtering capabilities, improved data models, and UI enhancements for species management.

Key changes:

Added OOD score tracking and filtering for classifications
Implemented clustering algorithms with feature vector support
Enhanced species management with tags, unknown species support, and improved UI
Added user preference persistence for score thresholds

Reviewed Changes

Copilot reviewed 119 out of 121 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
ui/src/utils/userPreferences/*	Added OOD score threshold to user preferences with fallback handling
ui/src/utils/useFilters.ts	Added new filter options for OOD scores, tags, and unknown species
ui/src/utils/language.ts	Added translations for OOD scores and new UI elements
ui/src/pages/species/*	Enhanced species pages with column settings, tags, and improved layout
ui/src/components/*	Added new components for OOD scores, determination scores, and tag management
ami/main/models/*	Extended models with OOD scores, feature vectors, and improved determination logic
ami/ml/*	Added clustering algorithms and post-processing capabilities
requirements/base.txt	Added ML dependencies (pgvector, scikit-learn, scipy)

Copilot · 2025-08-07T08:58:53Z

ui/src/utils/userPreferences/userPreferencesContext.tsx

+        return {
+          ...DEFAULT_PREFERENCES,
+          ...JSON.parse(storedPreferences),
+        }


Consider adding type safety by defining the expected structure of stored preferences. The current implementation spreads unknown data from localStorage which could introduce unexpected properties.

Suggested change

return {

...DEFAULT_PREFERENCES,

...JSON.parse(storedPreferences),

}

return sanitizeUserPreferences(JSON.parse(storedPreferences))

Copilot · 2025-08-07T08:58:54Z

ui/src/pages/species-details/new-unknown-species-button.tsx

+        createSpecies({
+          projectId: projectId as string,
+          name: `Cluster (${new Date().toISOString()})`,
+          parentId: '2361', // Cluster


Hard-coded parent ID '2361' should be moved to a configuration constant or made configurable. This creates a tight coupling to a specific database record.

Suggested change

parentId: '2361', // Cluster

parentId: CLUSTER_PARENT_ID,

Copilot · 2025-08-07T08:58:54Z

ami/ml/clustering_algorithms/cluster_detections.py

+        classifications__features_2048__isnull=False,
+        classifications__algorithm=feature_extraction_algorithm,
+        source_image__collections=collection,
+        occurrence__determination_ood_score__gt=ood_threshold,


The filter occurrence__determination_ood_score__gt=ood_threshold will exclude detections where the OOD score is null. This might unintentionally filter out valid detections that don't have OOD scores yet.

Suggested change

occurrence__determination_ood_score__gt=ood_threshold,

Q(occurrence__determination_ood_score__gt=ood_threshold) | Q(occurrence__determination_ood_score__isnull=True),

Copilot · 2025-08-07T08:58:55Z

ami/ml/post_processing/class_masking.py

+        print("Top taxon: ", category_map_with_taxa[top_index])  # @TODO: REMOVE
+        print("Top index: ", top_index)  # @TODO: REMOVE


Debug print statements should be removed from production code. Use logging instead for debugging purposes.

Suggested change

print("Top taxon: ", category_map_with_taxa[top_index]) # @TODO: REMOVE

print("Top index: ", top_index) # @TODO: REMOVE

logger.debug(f"Top taxon: {category_map_with_taxa[top_index]}")

logger.debug(f"Top index: {top_index}")

Copilot · 2025-08-07T08:58:55Z

ami/ml/post_processing/class_masking.py

+        print("Top taxon: ", category_map_with_taxa[top_index])  # @TODO: REMOVE
+        print("Top index: ", top_index)  # @TODO: REMOVE


Debug print statements should be removed from production code. Use logging instead for debugging purposes.

Suggested change

print("Top taxon: ", category_map_with_taxa[top_index]) # @TODO: REMOVE

print("Top index: ", top_index) # @TODO: REMOVE

logger.debug(f"Top taxon: {category_map_with_taxa[top_index]}")

logger.debug(f"Top index: {top_index}")

Copilot · 2025-08-07T08:58:55Z

ami/ml/clustering_algorithms/preprocessing_features.py

+
+
+def standardize(features):
+    scaler = preprocessing.StandardScaler().fit(features)
+    features = scaler.transform(features)
+    print("standardized features")


Debug print statement should be replaced with proper logging for production code.

Suggested change

def standardize(features):

scaler = preprocessing.StandardScaler().fit(features)

features = scaler.transform(features)

print("standardized features")

import logging

logging.basicConfig(level=logging.INFO)

def standardize(features):

scaler = preprocessing.StandardScaler().fit(features)

features = scaler.transform(features)

logging.info("standardized features")

Copilot · 2025-08-07T08:58:56Z

ami/ml/clustering_algorithms/preprocessing_features.py

+def dimension_reduction(features, n_components):
+    pca = PCA(n_components=n_components)
+    features = pca.fit_transform(features)
+    print("PCA performed")


Debug print statement should be replaced with proper logging for production code.

Suggested change

print("PCA performed")

logging.info("PCA performed")

Copilot · 2025-08-07T08:58:56Z

ami/main/tests/test_occurrence_determination.py

+        self.user = User.objects.create_user(  # type: ignore
+            email="testuser@insectai.org",
+            is_staff=True,
+        )


The type: ignore comment suggests a type checking issue. Consider fixing the underlying type issue rather than suppressing it.

Suggested change

self.user = User.objects.create_user( # type: ignore

email="testuser@insectai.org",

is_staff=True,

)

self.user = User.objects.create_user(

email="testuser@insectai.org",

)

self.user.is_staff = True

self.user.save()

…stsed

* feat: clean up when events are regrouped for a deployment * feat: add created & updated at columns to sessions/events list * fix: ensure event regrouping happens immediately after sync if needed * chore: save deployment before auditing event lengths * feat: in tests, always group images after creating them, and DRY it up * fix: correct name of aggregated key * fix: correct check for events starting before noon * feat: check for invalid event times and reduce queries * fix: possible None in query * Update ami/main/models.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…nickLab/antenna into feat/logistic-binning

netlify · 2025-08-22T20:24:39Z

✅ Deploy Preview for antenna-ood canceled.

Name	Link
🔨 Latest commit	`19b0cec`
🔍 Latest deploy log	https://app.netlify.com/projects/antenna-ood/deploys/68a8d203e45ad80008525dbe

mihow and others added 16 commits June 13, 2025 13:31

feat: template of a management command for binning classifications

685db89

feat: consider scores from a given species prediction, not top score

ef8d849

fix: pin minio containers to known working versions

b939c02

scratch: testing an approach to class masking / geo-fencing

a708ea5

fix: type annotations

5541e56

fix: require filtering by algorithm

01ecf06

feat: update updated_at timestamp for modified classifications

a3d7bd4

draft: placeholder for creating a classification instead of modifying

62f574e

feat: create new classification when apply class masking

c5e94fa

instead of updating the existing one.

feat: Django admin command for applying masking to a single occurrence

f092378

feat: allow searching by ID, hide detections in detail view

141c152

chore: update logging

3894718

feat: set previous classification as intermediate

1fa4f6a

so that determination is properly updated.

Fix reference to update_occurrences_in_collection

7ce6f84

Try to import unregistered taxa

d2bed8b

Add debugging statements

d97211c

Copilot AI review requested due to automatic review settings August 7, 2025 08:56

rhine3 changed the title ~~Add logistic binning (draft)~~ Add class masking (draft) Aug 7, 2025

Copilot AI reviewed Aug 7, 2025

View reviewed changes

mihow and others added 7 commits August 7, 2025 19:11

feat: update legacy importer for data from AMI Data Companion

6f1cc5b

feat: method to import pipeline results created externally

cbdc72a

fix: associate occurrences with events correctly

c56bfc6

fix[imports]: update stations and sessions after import

c68ba1b

feat[imports]: only use existing algorithms and category maps by default

2db2c8b

fix[imports]: create crops from imports if they are not externally ho…

b607cb7

…stsed

mihow changed the base branch from main to deployments/ood.antenna.insectai.org August 13, 2025 02:08

mihow mentioned this pull request Aug 13, 2025

Auto-process manually uploaded images (if enabled) #909

Merged

mihow force-pushed the deployments/ood.antenna.insectai.org branch from a011a5f to 3528f27 Compare August 21, 2025 02:22

Merge branch 'deployments/ood.antenna.insectai.org' of github.com:Rol…

19b0cec

…nickLab/antenna into feat/logistic-binning

mihow mentioned this pull request Oct 13, 2025

Introduce generic post-processing framework #954

Merged

5 tasks

mihow mentioned this pull request Oct 21, 2025

Feature for masking predictions to a species list #757

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add class masking (draft) #915

Add class masking (draft) #915

Uh oh!

rhine3 commented Aug 7, 2025

Uh oh!

netlify bot commented Aug 7, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

Copilot AI Aug 7, 2025

Uh oh!

netlify bot commented Aug 22, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	occurrence__determination_ood_score__gt=ood_threshold,
	Q(occurrence__determination_ood_score__gt=ood_threshold) \| Q(occurrence__determination_ood_score__isnull=True),

		print("Top taxon: ", category_map_with_taxa[top_index]) # @TODO: REMOVE
		print("Top index: ", top_index) # @TODO: REMOVE

Add class masking (draft) #915

Are you sure you want to change the base?

Add class masking (draft) #915

Uh oh!

Conversation

rhine3 commented Aug 7, 2025

Summary

List of Changes

Related Issues

Detailed Description

How to Test the Changes

Screenshots

Deployment Notes

Checklist

Uh oh!

netlify bot commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for antenna-preview ready!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

netlify bot commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for antenna-ood canceled.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

netlify bot commented Aug 7, 2025 •

edited

Loading

netlify bot commented Aug 22, 2025 •

edited

Loading