A desktop automation tool that cleans, filters, and analyzes Excel files — with one click.
| Feature | Description |
|---|---|
| 🗑️ Remove empty rows | Automatically drops fully empty rows |
| 🔁 Remove duplicates | Detects and removes duplicate entries |
| 🏷️ Remove unnamed columns | Cleans up Excel export artifacts |
| 🔍 Filter by column value | Narrow down data by any column |
| 🔃 Sort by column | Ascending or descending order |
| 📊 Numeric summary | Auto-generates sum & mean for all number columns |
| 📄 PDF Report | Exports a clean PDF report after processing |
| ⚡ Threaded processing | UI stays responsive during large file operations |
pip install pandas customtkinter openpyxl fpdf2python main.py- Select your
.xlsxfile via the file dialog - Configure cleaning options in the Cleaner tab
- Filter rows by column value in the Filter tab
- Sort data in the Statistics tab
- Click "Clean and Save File" — done!
The cleaned file is saved as yourfile_cleaned.xlsx in the same directory. A PDF report is generated automatically.
├── main.py # Entry point
├── gui.py # CustomTkinter GUI application
├── excel_processing.py # Core cleaning logic (pandas)
└── pdf_report.py # PDF report generation (fpdf2)
- Python 3.8+
- Pandas – data processing
- CustomTkinter – modern dark-mode GUI
- OpenPyXL – Excel read/write
- fpdf2 – PDF generation
- Threading – non-blocking UI
- Cleaning raw Excel exports from CRM or ERP systems
- Preparing data before importing into databases
- Generating quick summary reports for clients
- Automating repetitive data-cleaning workflows
Have messy Excel data you want automated? Feel free to reach out!
Built with ❤️ by [Goci13] — Open to freelance projects



