Skip to content

Latest commit

 

History

History
47 lines (38 loc) · 1.52 KB

File metadata and controls

47 lines (38 loc) · 1.52 KB

Extrating Text From Images and PDFs

Description

This project is a small web application developed with Django that allows users to upload text or PDF files to extract the text from them. The extracted text is displayed on the user interface for visualization. The project utilizes additional dependencies such as Tesseract for image manipulation and PDF file handling, enabling precise and efficient extraction of content from the uploaded documents.

Installation

  1. Clone this repository on your local machine using the following command:
git clone https://github.com/JassielMG/Extrac-Text-Using-Tesseract-App.git 
  1. Navigate to the project directory:
cd Extrac-Text-Using-Tesseract-App
  1. Create a virtual environment, activate it, and install the dependencies:
python -m venv venv
source venv/bin/activate (or venv\Scripts\activate.bat for Windows)
  1. Install the project dependencies:
pip install -r requirements.txt

Usage

  1. Run the Django server:
python manage.py runserver 0.0.0.0:8080
  1. Open your browser and go to the following URL:
http://localhost:8080/
  1. On the home page, you can upload a text or PDF file by clicking the "Choose File" button. Once the file is uploaded, click the "Upload File" button.

  2. The extracted text from the file will be displayed on the same page.

Home Page

Contact

If you have any questions, suggestions, or feedback, please feel free to reach out to me.