✨ LexiBoost-Automated Essay Scoring (AES)

An AI-powered web application that automatically evaluates and scores student essays using natural language processing (NLP) and machine learning techniques.

🚀 Project Overview

Automated Essay Scoring (AES) leverages machine learning models to predict essay scores based on linguistic and structural features. The system includes a user-friendly web interface built with HTML, Bootstrap, and JavaScript, which communicates with a Flask-based backend API for real-time scoring.

🎯 Key Features

📝 Accepts user essays through a clean web UI
⚙️ Preprocesses and vectorizes input essays
🤖 Predicts scores using trained machine learning models
📊 Normalizes and interprets scores across different essay prompts
🌐 Hosted on Render for public access

🧠 Technologies Used

💻 Frontend

HTML5, CSS3
Bootstrap 4.4
JavaScript (vanilla)

🔙 Backend

Flask 3.0.3
Flask-CORS
Gunicorn

🧠 Machine Learning

scikit-learn (Random Forest, SVR, Linear Regression)
NLTK for text preprocessing
Gensim
TensorFlow/Keras (for potential deep learning extensions)

📦 Other Libraries

Pandas, NumPy
Matplotlib, Seaborn (for EDA)
Regular Expressions (re)

🗂 Dataset

The model is trained on the ASAP Automated Student Assessment Prize dataset provided by Kaggle.

📄 Format: .tsv file with essays and human-assigned scores
📊 8 different essay prompts, each with varying score ranges
🧹 Extensive preprocessing: punctuation removal, stopword filtering, tokenization, POS tagging, and spell-checking.

🧪 Model Training Pipeline

Data Cleaning & Preprocessing
- Removing usernames, punctuation, and stopwords
- Tokenizing and POS tagging for feature extraction
Feature Engineering
- Word count, character count, average word length
- POS-based features (nouns, verbs, adjectives, adverbs)
- Misspelled words count
- CountVectorizer-based n-grams
Modeling
- Normalization of scores for uniform scaling
- Training ML regressors (Random Forest, SVR, Linear Regression)
- Evaluation using Mean Squared Error (MSE)
Scoring API
- Flask API receives essays and returns predicted scores scaled to a 10-point system

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
.idea		.idea
Dataset		Dataset
webapp		webapp
Automatic_Essay_Scoring with_NN.ipynb		Automatic_Essay_Scoring with_NN.ipynb
CNAME		CNAME
Essay_Scoring_1.ipynb		Essay_Scoring_1.ipynb
LICENSE		LICENSE
Processed_data.csv		Processed_data.csv
README.md		README.md
app.py		app.py
big.txt		big.txt
feature.pkl		feature.pkl
final_lstm.h5		final_lstm.h5
requirements.txt		requirements.txt
sample_essays.txt		sample_essays.txt
word2vecmodel.bin		word2vecmodel.bin
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ LexiBoost-Automated Essay Scoring (AES)

🚀 Project Overview

🎯 Key Features

🧠 Technologies Used

💻 Frontend

🔙 Backend

🧠 Machine Learning

📦 Other Libraries

🗂 Dataset

🧪 Model Training Pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

shivangiS04/LexiBoost

Folders and files

Latest commit

History

Repository files navigation

✨ LexiBoost-Automated Essay Scoring (AES)

🚀 Project Overview

🎯 Key Features

🧠 Technologies Used

💻 Frontend

🔙 Backend

🧠 Machine Learning

📦 Other Libraries

🗂 Dataset

🧪 Model Training Pipeline

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages