GradeCast — AI-powered Measurement Insight Platform

(Artificial Neural Network Prediction + IRT Rasch Diagnostics)

GradeCast enables users to upload construct-based person–item response datasets, perform end-to-end artificial neural network modeling training-validation-prediction directly in the browser, and explore Rasch-based IRT analyses in real time.

Features:
CSV upload & column mapping · multi-config training/validation · precise progress bars · new-item prediction · IRT calibration (Reliability / Variance / Infit / Outfit / t / p) · Wright Map

Statistical Principles:
You can assess and refer to all statistical principles for GradeCast here.

By Linwei Yu
The University of Hong Kong · linweiyu@connect.hku.hk

Overview

GradeCast runs entirely in your browser — no backend or installation required.

It enables you to:

Upload Item Descriptive CSV (e.g., ItemID, Construct, Format, …) and Student Responses CSV.
Map columns interactively (supports Long or Wide response format).
Configure multiple Artificial Neural Network (ANN) architectures and validation schemes (Holdout / LOOCV).
Train with real-time progress bars.
Use the best model (lowest RMSE) to predict performance for new items.
Perform Rasch / IRT calibration and visualize results via a Wright Map.

All computation happens locally using TensorFlow.js, PapaParse, Plotly, and TailwindCSS.

Live Stack

Layer	Library	Description
UI	TailwindCSS	Minimal dark tech-style interface
CSV Parsing	PapaParse	Client-side CSV handling
Machine Learning	TensorFlow.js	ANN regression and evaluation
Charts	Plotly.js	Loss curves, scatter plots, Wright Maps

Repository Structure

/
├─ index.html     # Main entry (UI + containers + CDN deps)
├─ app.js         # Upload/mapping, ANN training, validation, prediction, IRT logic
├─ charts.js      # Plotly charts: loss curve, actual vs. predicted, Wright Map
├─ styles.css     # Minimal CSS; Tailwind handles layout and theme
└─ (CDN-loaded)   # PapaParse / TensorFlow.js / Plotly / Tailwind

Prefer offline vendor scripts? Copy them into /libs and update <script src> paths in index.html.

Quick Start

Download or clone this repository.
Open index.html directly in a modern browser (Chrome / Edge / Firefox).
Upload your CSVs and follow the interface prompts.

No build or installation steps required.

Data Formats

Item Descriptive CSV

Used for both training and prediction; structure must be identical in both.

Required column

ItemID

Optional features (user-selectable as categorical or numeric)

e.g., Construct, Format, Domain, etc.

Example

ItemID,Construct,Format
Item1,A,I
Item2,B,II
Item3,A,III

Student Responses CSV

Supports Long or Wide format.

1) Long Format

Each row = one (StudentID, ItemID, Score) pair.

StudentID,ItemID,Score
S001,Item1,1
S001,Item2,0
S002,Item1,2

2) Wide Format

Each row = one Student; columns represent item scores.

StudentID,Item1,Item2,Item3
S001,1,0,1
S002,1,1,0

During training, responses are joined with item descriptives via ItemID.
StudentID is also one-hot encoded as a personalization feature.

How to Use

1) Upload & Map Columns

Left: Upload Item Descriptive CSV → choose ItemID and feature columns.
Right: Upload Student Responses CSV → choose format (Long/Wide) and map columns.
- Long: StudentID, ItemID, Score
- Wide: StudentID, item columns (multi-select)

Once both files parse successfully (✅), later sections unlock automatically.

2) Feature Selection

Pick which Item CSV columns to use.
Each can be marked categorical (one-hot) or numeric.
Default: all enabled as categorical.

3) Model & Validation

Configure:

Hidden layers (neurons + activation: ReLU / Tanh / Sigmoid)
Learning Rate, Epochs, Batch Size
Validation:
- Holdout (e.g., 80/20 split; model persisted for prediction)
- LOOCV (Leave-One-Out; rigorous evaluation; model not persisted)

4) Training

Click Start Training.

Holdout: epoch-level progress, loss curves (train/val), Predicted vs Actual scatter, metrics (MAE / MSE / RMSE / R²).
LOOCV: sample-level progress, final metrics (MAE / MSE / RMSE / R²).

The best Holdout model (lowest RMSE) is retained for prediction.

5) Prediction Workflow

Ensure a Holdout run was trained (to keep a model).
Upload a new Item Descriptive CSV (same columns as training).
The model reuses encoders; unseen categories are ignored (all-zero).
The app generates (Student × Item) predictions (preview + CSV export).

IRT Calibration

Run Compute IRT from Observed Scores to perform a frontend-friendly Rasch-family approximation (PCM / RSM). Calculations use JML-style alternating updates.

Outputs

Reliability and Variance (Persons)
Fit statistics per item: Infit / Outfit / t / p
Wright Map: overlaid histograms of θ (persons) and b (items)
Exportable CSV with item fit indices

This is designed for realtime diagnostics. For high-stakes calibration, consider full JML implementations.

Model & Validation

ANN Architecture

Input (one-hot categorical + numeric)
   ↓
Dense(hidden₁, activation)
   ↓
Dense(hidden₂, activation)
   ↓
Dense(1, linear)

Loss: Mean Squared Error
Optimizer: Adam (configurable LR)
Validation: Holdout (keeps model) / LOOCV (no persisted model)

Troubleshooting

Issue	Fix
Nothing happens after upload	Upload both CSVs and complete mappings for `ItemID`, `StudentID`, and `Score`/item columns.
LOOCV doesn’t keep a model	Intended behavior. Use Holdout if you need prediction.
Predictions look identical	Ensure selected features vary across items; mark types correctly (categorical vs numeric).
Wright Map looks degenerate	Verify training produced non-trivial variance in predictions/scores.

Security & Privacy

All data stays in the browser; no upload to any server.
State is in-memory; close the tab to clear it.
No tracking or external APIs beyond the included CDNs.

Limitations & Future Work

ANN uses dense layers; deeper/alternative architectures can be added.
IRT module is an approximation; replace with JML/MML for formal analyses.
Potential enhancements: Kidmap heatmaps, richer reliability diagnostics, PNG export buttons, presets.

License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software.

See the LICENSE file for full details.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.gitignore		.gitignore
GradeCast_Statistical_Principles.pdf		GradeCast_Statistical_Principles.pdf
LICENSE		LICENSE
README.md		README.md
app.js		app.js
charts.js		charts.js
index.html		index.html
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GradeCast — AI-powered Measurement Insight Platform

Table of Contents

Overview

Live Stack

Repository Structure

Quick Start

Data Formats

Item Descriptive CSV

Student Responses CSV

1) Long Format

2) Wide Format

How to Use

1) Upload & Map Columns

2) Feature Selection

3) Model & Validation

4) Training

5) Prediction Workflow

IRT Calibration

Model & Validation

Troubleshooting

Security & Privacy

Limitations & Future Work

License

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

LinweiAlinaYU/GradeCast

Folders and files

Latest commit

History

Repository files navigation

GradeCast — AI-powered Measurement Insight Platform

Table of Contents

Overview

Live Stack

Repository Structure

Quick Start

Data Formats

Item Descriptive CSV

Student Responses CSV

1) Long Format

2) Wide Format

How to Use

1) Upload & Map Columns

2) Feature Selection

3) Model & Validation

4) Training

5) Prediction Workflow

IRT Calibration

Model & Validation

Troubleshooting

Security & Privacy

Limitations & Future Work

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages