GitHub - ChicagoHAI/multi-data-align-amh-codex: Machine Learning research: Comprehensive Review of Multimodal Data Alignment Techniques for Adolescent Mental Health AI Modeling | Generated by Idea Explorer on 2025-12-07

Project Overview

Multimodal-alignment study using text-derived views (semantic embeddings, lexical cues, affective/psycholinguistic proxies) to predict affect on CMU-MOSEI and qualitatively project to counseling dialogues.

Key Findings

Fusion (semantic + proxies + TF-IDF) outperforms best unimodal baseline on Macro-F1 (+1.3 points, p=0.0076) and improves calibration (Brier 0.176 vs 0.179) on test split.
Variance across folds drops (Std Macro-F1 0.0045 fused vs 0.0063 semantic), indicating stabler alignment.
Counseling projection shows intuitive polarity spread, aiding interpretability despite unlabeled data.

Reproduction

Ensure virtual env: uv venv && source .venv/bin/activate.
Dependencies tracked in pyproject.toml (installed via uv add ...).
Run experiments:
source .venv/bin/activate && python notebooks/run_multimodal_alignment.py
Outputs land in results/ (metrics JSON, plots, counseling projections).

File Structure

planning.md — research plan and methodology.
notebooks/run_multimodal_alignment.py — end-to-end experiment script.
results/ — metrics (cv_metrics_raw.json, test_metrics.json, etc.) and plots.
datasets/ — local MOSEI text and counseling data (excluded from git).
REPORT.md — full report with analysis and conclusions.

Notes

Seed fixed at 42; CPU execution ~2 minutes.
Uses sentence-transformers/all-MiniLM-L6-v2 for semantic embeddings; see REPORT.md for full details and limitations.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea-explorer		.idea-explorer
code		code
datasets		datasets
logs		logs
notebooks		notebooks
papers		papers
results		results
src/research_workspace		src/research_workspace
.gitignore		.gitignore
.resource_finder_complete		.resource_finder_complete
README.md		README.md
REPORT.md		REPORT.md
literature_review.md		literature_review.md
planning.md		planning.md
pyproject.toml		pyproject.toml
resources.md		resources.md
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Overview

Key Findings

Reproduction

File Structure

Notes

About

Uh oh!

Releases

Packages

Languages

ChicagoHAI/multi-data-align-amh-codex

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Key Findings

Reproduction

File Structure

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages