- The description now clearly outlines the features of the project, specifying the key functions it offers, such as:
- Image-to-text recognition
- Audio-to-text conversion
- Text generation
- An automation page to convert media and generate content.
- ReactPy: For creating the home page UI.
- Gradio: For interacting with models.
- Transformers: For encoding and decoding text, images, and audio.
- Streamlit: For handling the automation page.
- PyAutoGui: For performing the automatic operation on the pc
- Selenium: For operating the webbrowser.
- Tesseract OCR: For image-to-text conversion.
- Speech Recognition: For converting audio to text.
- GPT-2: For text generation.
- Sentiment Analyzer: For analyzing sentiment in text.
- Create a virtual environment using the following command:
python -m venv env
- Activate the virtual environment:
- For Windows:
cd env\Scripts .\activate
- For macOS/Linux:
source env/bin/activate
- For Windows:
Once inside the activated virtual environment, install the required libraries:
pip install reactpy transformers gradio streamlit pyautogui seleniumOpen three terminal windows to run the following commands in parallel:
-
Run the main UI:
python ui.py
This will run the main home page of the project on a local server.
-
Activate the Gradio bots:
python app.py
This will start the Gradio bots on a local server.
-
Activate the Automation page:
streamlit automation.py
This will run the Streamlit automation page on a local server.
- Project Directory:
env/: Virtual environment folderui.py: ReactPy-based UI for the home pageapp.py: Gradio bot integrationautomation.py: Automation page logic using Streamlit




