A tool that allows users to upload and store various file types in their own Supabase vector database.
Extract text from DOCX, PDFs, and spreadsheets, generating embeddings while storing them efficiently.
βοΈ Supports multiple file types: DOCX, PDFs, CSVs, and more
βοΈ Automatic text extraction from documents
βοΈ Embeddings generation for vector storage
βοΈ Seamless integration with Supabase
βοΈ User-friendly interface for easy file uploads
Follow these steps to install and run File2Vector on your local machine.
Before running the app, install the required Python packages.
pip install -r requirements.txtIf there is no requirements.txt, you can install the dependencies manually:
pip install streamlit(Add any additional dependencies if necessary.)
Move into the app/ directory:
cd appStart the application by running:
streamlit run main.pyThis will launch the File2Vector web app in your default browser.
-
Set up Supabase
- Go to Supabase
- Navigate to Project Settings > Data API
- Copy your Project URL and service_role key
- Paste them into the Upload tab of File2Vector
-
Upload Files
- Select the documents you want to convert into embeddings
- The tool will automatically process and store them in your vector database
-
Provide Feedback
- Use the contact page to share your experience or report issues
- API support for any Embedding provider
- Upload to any Vector Database
- Instant RAG functionality using your own LLM API
- More to be announced...
πΌ LinkedIn: Jack van der Vall
π GitHub: jackvandervall