MAP - Charting Student Math Misunderstandings

Project Overview

This project uses Natural Language Processing (NLP) and Machine Learning to identify student math misconceptions from open-ended responses. It is based on the Kaggle competition dataset.

The project implements a 3-stage modeling approach:

Binary Classification: Predict correct vs. incorrect answers.
3-Class Classification: Categorize explanations (Correct, Misconception, Neither).
Multiclass Classification: Identify specific misconception types (35+ categories).

Structure

Term_Project_geissinger_final.ipynb: Main analysis and modeling notebook.
project_math/: Folder containing the dataset (train.csv, test.csv).
requirements.txt: List of dependencies.

Setup

Clone the repository.
Install dependencies:
```
pip install -r requirements.txt
```
Run the notebook: Open Term_Project_geissinger_final.ipynb and run all cells. The notebook is configured to look for data in the project_math/ directory by default.

Models

Text Representation: TF-IDF and Sentence Transformers (embeddings).
Classifiers: Random Forest and Logistic Regression.
Handling Imbalance: BorderlineSMOTE.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
project_math		project_math
.gitignore		.gitignore
README.md		README.md
Term_Project_geissinger_final.ipynb		Term_Project_geissinger_final.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MAP - Charting Student Math Misunderstandings

Project Overview

Structure

Setup

Models

About

Uh oh!

Releases

Packages

Languages

LanaGeis/MAP-Student-Math-Misunderstandings_Kaggle

Folders and files

Latest commit

History

Repository files navigation

MAP - Charting Student Math Misunderstandings

Project Overview

Structure

Setup

Models

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages