From ScanDDM to ART: DDM-DINO

Visual attention models have demonstrated a growing capability in predicting scanpaths, which are sequences of fixations and eye movements. Specifically, ScanDDM introduced a DDM-based approach for predicting goal-directed scanpaths in a zero-shot modality, while ART focused on the incremental prediction of attention during language-guided object referral tasks. The present work explores the combination of these two approaches, modifying ScanDDM with the integration of GroundingDINO to address the incremental object referral task. The resulting model has been named DDM-DINO.

More detailed informations and examples of usage can be found in the attached PDF report.

Setup

Install all the requirements with pip install -r requirements.txt

Usage

In main.py define the prompt and the image path
Run python main.py

Metrics

Uncomment the commented libraries in requirements.txt
Install the new requirements with pip install -r requirements.txt
In calculate_all_metrics.py define the parameters
Run python calculate_all_metrics.py

Project for Natural Interaction and Affective Computing courses, UNIMI @ PHuSe Lab, AY 2024/2025, by Hari Calzi and Salvatore Ferrara.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
metrics		metrics
.gitignore		.gitignore
DDM-DINO.jpeg		DDM-DINO.jpeg
LICENSE		LICENSE
NI_AC_report_Calzi_52061A.pdf		NI_AC_report_Calzi_52061A.pdf
README.md		README.md
main.py		main.py
pixel_race_mcDDM.py		pixel_race_mcDDM.py
race_model.py		race_model.py
requirements.txt		requirements.txt
scanDDM.py		scanDDM.py
vis.py		vis.py
zs_obj_ground.py		zs_obj_ground.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

From ScanDDM to ART: DDM-DINO

Setup

Usage

Metrics

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

From ScanDDM to ART: DDM-DINO

Setup

Usage

Metrics

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages