Skip to content

Machine Learning research: Comprehensive Review of Multimodal Data Alignment Techniques for Adolescent Mental Health AI Modeling | Generated by Idea Explorer on 2025-12-07

Notifications You must be signed in to change notification settings

ChicagoHAI/multi-data-align-amh-codex

Repository files navigation

Project Overview

Multimodal-alignment study using text-derived views (semantic embeddings, lexical cues, affective/psycholinguistic proxies) to predict affect on CMU-MOSEI and qualitatively project to counseling dialogues.

Key Findings

  • Fusion (semantic + proxies + TF-IDF) outperforms best unimodal baseline on Macro-F1 (+1.3 points, p=0.0076) and improves calibration (Brier 0.176 vs 0.179) on test split.
  • Variance across folds drops (Std Macro-F1 0.0045 fused vs 0.0063 semantic), indicating stabler alignment.
  • Counseling projection shows intuitive polarity spread, aiding interpretability despite unlabeled data.

Reproduction

  1. Ensure virtual env: uv venv && source .venv/bin/activate.
  2. Dependencies tracked in pyproject.toml (installed via uv add ...).
  3. Run experiments:
    source .venv/bin/activate && python notebooks/run_multimodal_alignment.py
  4. Outputs land in results/ (metrics JSON, plots, counseling projections).

File Structure

  • planning.md — research plan and methodology.
  • notebooks/run_multimodal_alignment.py — end-to-end experiment script.
  • results/ — metrics (cv_metrics_raw.json, test_metrics.json, etc.) and plots.
  • datasets/ — local MOSEI text and counseling data (excluded from git).
  • REPORT.md — full report with analysis and conclusions.

Notes

  • Seed fixed at 42; CPU execution ~2 minutes.
  • Uses sentence-transformers/all-MiniLM-L6-v2 for semantic embeddings; see REPORT.md for full details and limitations.

About

Machine Learning research: Comprehensive Review of Multimodal Data Alignment Techniques for Adolescent Mental Health AI Modeling | Generated by Idea Explorer on 2025-12-07

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages