SpellCorrectionApp - Persian Spell Checker

A Django-based web application for advanced Persian spelling error correction using BERT (Bidirectional Encoder Representations from Transformers) and Levenshtein distance algorithms.

📖 About

This application leverages neural networks, particularly the ParsBERT masked language model, to identify and correct diverse spelling errors in Persian text. It handles both real-word and non-real-word errors through a combined approach using BERT and Levenshtein distance, offering superior performance for Persian language spell checking.

Key Features

Advanced ML Model: Uses HappyTransformer with ParsBERT for accurate spell correction
Multiple Error Types: Handles homophone, keyboard, and substitution errors
User Authentication: Secure login and registration system
File Processing: Upload text files for batch spell correction
Async Task Processing: Background task processing using Dramatiq
User Dashboard: Track your correction history and download results
Real-time Correction: Process text directly through the web interface

Keywords

Spelling mistakes, Neural Networks, BERT masked language model, Error correction system, Real and non-real word errors, ParsBERT model, Levenshtein distance

🚀 Getting Started

Prerequisites

Python 3.8 or higher
pip (Python package manager)
Virtual environment (recommended)
PostgreSQL (optional, SQLite is used by default)

Installation

Clone the repository

git clone <repository-url>
cd SpellCorrectionApp-main

Create and activate a virtual environment

# On macOS/Linux
python3 -m venv venv
source venv/bin/activate

# On Windows
python -m venv venv
venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```

Set up environment variables

Create a .env file in the root directory:

SECRET_KEY=your-secret-key-here
DEBUG=True

To generate a secure SECRET_KEY:

python -c "from django.core.management.utils import get_random_secret_key; print(get_random_secret_key())"

Run database migrations

python manage.py makemigrations
python manage.py migrate

Create a superuser (admin account)
```
python manage.py createsuperuser
```
Prepare ML Model and Dictionary Files

Ensure you have the following in your project:
- Trained BERT model (ParsBERT)
- Dictionary files:
  - dictionary.txt
  - keyboard_realword_errors.txt
  - substitution_realword_errors.txt
  - homophone_realword_errors.txt
Start the development server
```
python manage.py runserver
```

Start the Dramatiq worker (in a separate terminal)

# Activate your virtual environment first
python manage.py rundramatiq

Access the application

Open your browser and navigate to: http://127.0.0.1:8000/

📋 Usage

For Regular Users

Register an Account
- Navigate to the registration page
- Provide username, email, and password
- Submit the form to create your account
Login
- Use your email and password to log in
- You'll be redirected to the home page
Correct Text

Option A: Direct Text Input
- Enter or paste Persian text directly into the text area
- Click the correction button
- View corrected text and download results
Option B: File Upload
- Upload a text file containing Persian text
- Submit for processing
- The task will be processed in the background
- Check your profile/dashboard for results
- Download the corrected file and correction report
View Your History
- Access your profile page
- View all previous correction tasks
- Download corrected files and reports
- Track task status (processing/completed)

For Administrators

Access Admin Panel
```
http://127.0.0.1:8000/admin/
```
Manage Users
- View, edit, or delete user accounts
- Monitor user activity
Manage Tasks
- View input and output tasks
- Monitor task processing status
- Access user-uploaded and corrected files

🏗️ Project Structure

SpellCorrectionApp-main/
├── manage.py                  # Django management script
├── requirements.txt           # Project dependencies
├── README.md                  # This file
├── .env                       # Environment variables (create this)
├── base/                      # Main application
│   ├── models.py             # Database models (User, InputTask, OutputTask)
│   ├── views.py              # Request handlers
│   ├── forms.py              # Form definitions
│   ├── urls.py               # URL routing
│   ├── tasks.py              # Background task definitions
│   ├── ml_model.py           # ML model implementation
│   ├── templates/            # HTML templates
│   └── migrations/           # Database migrations
├── SpellCorrectionApp/        # Project settings
│   ├── settings.py           # Django configuration
│   ├── urls.py               # Root URL configuration
│   └── wsgi.py               # WSGI configuration
├── static/                    # Static files (CSS, JS, images)
└── templates/                 # Base templates

🔧 Configuration

Database Configuration

By default, the app uses SQLite. To use PostgreSQL:

Install psycopg2 (already in requirements.txt)

Update settings.py:

DATABASES = {
    'default': {
        'ENGINE': 'django.db.backends.postgresql',
        'NAME': 'your_db_name',
        'USER': 'your_db_user',
        'PASSWORD': 'your_db_password',
        'HOST': 'localhost',
        'PORT': '5432',
    }
}

Static Files

For production, collect static files:

python manage.py collectstatic

Media Files

Upload files are stored in:

media/uploads/ - User input files
media/downloads/ - Corrected output files
media/reports/ - Correction reports

🧪 Testing

Run the test suite:

python manage.py test

Run tests for a specific app:

python manage.py test base

🛠️ Technologies Used

Backend Framework: Django 4.1.7
ML Framework: HappyTransformer (BERT)
Task Queue: Dramatiq with django-dramatiq
String Similarity: Polyleven (Levenshtein distance)
Database: SQLite (default) / PostgreSQL
Frontend: HTML, CSS (SASS), JavaScript
Data Processing: Pandas, NumPy

📊 API Endpoints

Public Routes

/ - Home page
/login/ - User login
/register/ - User registration
/about/ - About page

Protected Routes (Login Required)

/profile/ - User profile and task history
/logout/ - User logout
/update-password/ - Change password
/update-user/ - Update user information

🐛 Troubleshooting

Common Issues

Import Errors
- Ensure all dependencies are installed: pip install -r requirements.txt
- Activate your virtual environment
Database Errors
- Run migrations: python manage.py migrate
- Check database configuration in settings.py
Static Files Not Loading
- Run: python manage.py collectstatic
- Check STATIC_URL and STATIC_ROOT in settings.py
Background Tasks Not Processing
- Ensure Dramatiq worker is running: python manage.py rundramatiq
- Check task queue configuration
ML Model Errors
- Verify model path in settings
- Ensure dictionary files are present and accessible
- Check model compatibility with HappyTransformer version

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch: git checkout -b feature/YourFeature
Commit your changes: git commit -m 'Add YourFeature'
Push to the branch: git push origin feature/YourFeature
Open a Pull Request

📝 License

This project is part of academic research on Persian spelling error correction using BERT.

📧 Contact

For questions or support, please open an issue in the repository.

🙏 Acknowledgments

ParsBERT model contributors
HappyTransformer library developers
Django community

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpellCorrectionApp - Persian Spell Checker

📖 About

Key Features

Keywords

🚀 Getting Started

Prerequisites

Installation

📋 Usage

For Regular Users

For Administrators

🏗️ Project Structure

🔧 Configuration

Database Configuration

Static Files

Media Files

🧪 Testing

🛠️ Technologies Used

📊 API Endpoints

Public Routes

Protected Routes (Login Required)

🐛 Troubleshooting

Common Issues

🤝 Contributing

📝 License

📧 Contact

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.vscode		.vscode
SpellCorrectionApp		SpellCorrectionApp
base		base
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
manage.py		manage.py
requirements.txt		requirements.txt

Amir79Naziri/SpellCorrectionApp

Folders and files

Latest commit

History

Repository files navigation

SpellCorrectionApp - Persian Spell Checker

📖 About

Key Features

Keywords

🚀 Getting Started

Prerequisites

Installation

📋 Usage

For Regular Users

For Administrators

🏗️ Project Structure

🔧 Configuration

Database Configuration

Static Files

Media Files

🧪 Testing

🛠️ Technologies Used

📊 API Endpoints

Public Routes

Protected Routes (Login Required)

🐛 Troubleshooting

Common Issues

🤝 Contributing

📝 License

📧 Contact

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages