AI-Powered Data Analyst Agent An intelligent Data Analyst Agent that processes documents (.doc, .txt, .xlsx, .csv, .pdf, images), answers queries, generates visualizations, and leverages LLMs (OpenAI/Gemini) for insights. Built with Streamlit, Pandas, PyMuPDF, and OpenAI/Gemini API, this tool is ideal for automating data analysis tasks.
Features ✅ Document Processing – Supports .doc, .txt, .xlsx, .csv, .pdf, images ✅ Query-Based Analysis – Ask questions, get insights ✅ Data Visualization – Generates graphs with Matplotlib & Seaborn ✅ LLM-Powered – Uses Google Gemini/OpenAI API for intelligent responses ✅ Streamlit UI – Interactive & easy-to-use
Installation Clone the Repository
git clone https://github.com/yourusername/data-analyst-agent.git cd data-analyst-agent
Install Dependencies
pip install -r requirements.txt And in the code you have to add you gemini api key to use gemini.
Run the Application
streamlit run app.py
Tech Stack 🔹 Python (Pandas, NumPy, Matplotlib, Seaborn) 🔹 Machine Learning (LLMs via OpenAI/Gemini API) 🔹 OCR & NLP (PyMuPDF, Pytesseract) 🔹 Web Framework (Streamlit)
Usage Upload documents (PDF, CSV, etc.) Ask questions about the data View graphs & summaries Get AI-driven insights To-Do ☑️ Enhance LLM integration ☑️ Add database support (SQL) ☑️ Improve UI for better interactivity