🎓 Student Performance Prediction with Linear Regression

📝 Short Description

This project analyzes and predicts student academic performance using a linear regression model on the Student Performance Dataset (student-mat.csv).
It includes data preprocessing, exploratory data analysis (EDA), model training, evaluation, and prediction for new student data.

🚀 Features

Data Preprocessing:
- Encodes categorical variables into numeric form.
- Checks and handles missing data.
Exploratory Data Analysis:
- Histograms, boxplots, and lineplots for grade distribution and relationships.
- Correlation heatmap of numerical features.
Modeling:
- Splits data into training/testing sets.
- Trains a LinearRegression model using scikit-learn.
- Evaluates performance with MSE and R² score.
Prediction:
- Predicts grades for hypothetical students.
Model Saving:
- Exports trained model with joblib for future use.

📦 Requirements

Install dependencies:

pip install pandas matplotlib seaborn scikit-learn joblib

📂 Dataset

The dataset contains demographic, social, and academic attributes for students.
Target variable: G3 — the final grade.

⚙️ Usage

Place student-mat.csv in the project directory.
Run the notebook or script to:
- Explore data
- Train the model
- Make predictions
Example prediction:

prediction = model.predict(new_data)
print(f'Predicted final grade: {prediction[0]:.1f}')

The trained model is saved as new_model.joblib.

📊 Outputs

Visualizations:
- Grade distribution histogram
- Boxplots (by school, gender, alcohol consumption, study time, etc.)
- Correlation heatmap
Metrics:
- Mean Squared Error (MSE)
- R² score
Saved Model:
- File: new_model.joblib

🔍 Workflow

Data Loading — Load student-mat.csv into a Pandas DataFrame and inspect structure.
Preprocessing — Encode categorical variables into integers and prepare X (features) & y (target).
EDA — Use Seaborn/Matplotlib for distribution plots, boxplots, and correlation analysis.
Model Training — Split data (80% train, 20% test), fit a LinearRegression model.
Evaluation — Predict on the test set and compute MSE & R² score.
Prediction — Create new student profile → predict final grade.
Model Saving — Save trained model with joblib.dump() for reuse.

Author: Nimona Engida
Dataset Source: UCI Machine Learning Repository — Student Performance

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
student-mat.csv		student-mat.csv
student_performece_prediction.ipynb		student_performece_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎓 Student Performance Prediction with Linear Regression

📝 Short Description

🚀 Features

📦 Requirements

📂 Dataset

⚙️ Usage

📊 Outputs

🔍 Workflow

About

Uh oh!

Releases

Packages

Languages

GreatTitanDev/Student_performence_prediction

Folders and files

Latest commit

History

Repository files navigation

🎓 Student Performance Prediction with Linear Regression

📝 Short Description

🚀 Features

📦 Requirements

📂 Dataset

⚙️ Usage

📊 Outputs

🔍 Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages