Extrating Text From Images and PDFs

Description

This project is a small web application developed with Django that allows users to upload text or PDF files to extract the text from them. The extracted text is displayed on the user interface for visualization. The project utilizes additional dependencies such as Tesseract for image manipulation and PDF file handling, enabling precise and efficient extraction of content from the uploaded documents.

Installation

Clone this repository on your local machine using the following command:

git clone https://github.com/JassielMG/Extrac-Text-Using-Tesseract-App.git

Navigate to the project directory:

cd Extrac-Text-Using-Tesseract-App

Create a virtual environment, activate it, and install the dependencies:

python -m venv venv

source venv/bin/activate (or venv\Scripts\activate.bat for Windows)

Install the project dependencies:

pip install -r requirements.txt

Usage

Run the Django server:

python manage.py runserver 0.0.0.0:8080

Open your browser and go to the following URL:

http://localhost:8080/

On the home page, you can upload a text or PDF file by clicking the "Choose File" button. Once the file is uploaded, click the "Upload File" button.
The extracted text from the file will be displayed on the same page.

Contact

If you have any questions, suggestions, or feedback, please feel free to reach out to me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extrating Text From Images and PDFs

Description

Installation

Usage

Contact

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Extrating Text From Images and PDFs

Description

Installation

Usage

Contact