A robust, AI-driven tool that extracts text from PDF documents and converts them into clean, unformatted DOCX files using Google's Gemini AI models.
- AI-Powered Extraction: Uses Google's advanced Gemini Flash models (1.5/2.0) for high-accuracy OCR.
- Smart Cleaning: Automatically removes repetitive headers, footers, and page numbers.
- Formatting Removal: Flattens complex PDF layouts into a linear Word document.
- Rate Limit Handling: Manages Free Tier API limits.
- Privacy Focused: API keys are input securely and never stored.
- Interactive UI: Uses ipywidgets for configuration.
- Google account for Gemini API Key.
- Python 3.
git clone https://github.com/YOUR_USERNAME/PDF-to-DOCX-Gemini.git
cd PDF-to-DOCX-Gemini
pip install -r requirements.txt
- Open the notebook:
jupyter notebook PDF2DocX_ConvertAI.ipynb
- Run initialization cells.
- Open configuration form.
- Paste API key.
- Load models.
- Upload PDF.
- Run conversion.
- Core Logic: Python 3
- AI Model: Google Gemini
- Document Handling: python-docx
- Interface: ipywidgets
Pull requests and issues are accepted.
MIT License.
