Data Talks

An intelligent data analysis platform that lets you connect data sources, configure AI agents, and get insights through natural language questions.

Overview

Data Talks is a web app that changes how you work with your data. Through a simple interface you can:

Connect data sources (CSV, XLSX, BigQuery, SQL databases)
Configure custom AI agents
Ask questions in natural language about your data
Get visual answers with charts and tables
Set up alerts for ongoing monitoring

Main features

Data source management

File upload: CSV and XLSX support
BigQuery: Direct Google BigQuery integration
SQL databases: PostgreSQL, MySQL, and other SQL-compatible databases
Google Sheets: Direct spreadsheet connection
Automatic metadata: Column and type detection
Data preview: First rows preview

AI agents

Custom setup: Name, description, and data sources
Suggested questions: Auto-suggestions from your data
Conversation history: Full interaction history
Sharing: Public or private agents

Q&A

Conversational UI: Ask in Portuguese, English, or Spanish
Visual answers: Auto-generated charts and tables
Follow-up questions: Suggestions to dig deeper
User feedback: Answer rating

SQL features

Multi-table queries: Ask questions across multiple SQL sources; the agent infers JOINs from column names (e.g. customer_id, order_id) or uses configured relationships
ER diagram: Entity-relationship view showing how tables connect through configured SQL links
SQL mode: When enabled in Agent Settings, the agent responds with the raw SQL query instead of the elaborated answer—useful for debugging or learning

Alerts

Ongoing monitoring: Recurring alerts
Flexible schedule: Daily, weekly, or monthly
Notifications: Alerts when data changes

Studio

Table summaries: Auto-generated executive reports for any data source (CSV, SQL, BigQuery, Google Sheets)
Audio overviews: Text-to-speech narration of source highlights using the configured audio model

Dashboards

Saved charts: Pin Q&A charts to a dashboard for quick reference
Custom layout: Position and resize charts freely
Multiple dashboards: Organize charts by topic or team

Telegram integration

Bot configuration: Register one or more Telegram bots
Agent linking: Connect an agent to a Telegram group via link token
Q&A over Telegram: Ask questions and receive answers directly in chat

Platform logs

LLM activity tracking: Every question and summary is logged with provider, model, and token usage
Channel attribution: See whether activity came from the workspace, Telegram, or Studio

Internationalization

Multilingual: Portuguese, English, and Spanish
Language persistence: Preference saved across sessions
Adaptive UI: All text translated dynamically

Tech stack

Frontend

React 18, TypeScript, Vite, Tailwind CSS, shadcn/ui

Backend

Python: FastAPI, SQLite (default) or PostgreSQL, Alembic migrations
LLM: OpenAI API, local Ollama, or LiteLLM proxy
Per-source scripts: CSV, Google Sheets, SQL (single and multi-source), BigQuery

LLM environment defaults

When you configure OPENAI_API_KEY in backend/.env, the backend uses these environment defaults unless you explicitly override them:

Text model: gpt-4o-mini
Audio model: gpt-4o-mini-tts

If you want different defaults, set OPENAI_MODEL and/or OPENAI_AUDIO_MODEL explicitly in backend/.env.

State

React Context API, React Query, Local Storage

Project structure

data-talks/
├── src/                    # Frontend
│   ├── components/         # Reusable components
│   ├── contexts/           # React contexts (e.g. LanguageContext)
│   ├── hooks/              # Custom hooks
│   ├── lib/                # Utilities
│   ├── pages/              # Pages
│   └── services/           # API clients
├── backend/                # Python API (FastAPI, JWT auth, CRUD, scripts)
│   ├── app/                # FastAPI app, routers, per-source-type scripts
│   ├── alembic/            # Database migrations (SQLite + PostgreSQL)
│   └── pyproject.toml
└── public/                 # Static assets

How to run

Quick start with Make (recommended)

make install        # Install frontend and backend dependencies
make run            # Build frontend and start the server at http://localhost:8000

Other useful commands:

make install-cli    # Install only the data-talks CLI
make build          # Build frontend for production
make dev            # Start backend + frontend dev server with hot reload
make migrate        # Run database migrations
make setup-env      # Create backend/.env from .env.example
make lint           # Run frontend linter
make test           # Run frontend tests
make help           # List all available commands

Requires: Node.js, uv, and Python 3.11+.

Run the app at a single URL (manual)

To open the UI at http://localhost:8000 (backend only):

Project root — install the frontend and build with the API URL:
```
npm install
npm run build
```
Backend — configure and start the API (it will serve the frontend at /):
```
cd backend
uv pip install -e .
uv run data-talks run
```
Open in the browser: http://localhost:8000. By default the app runs without login; enable ENABLE_LOGIN=true in backend/.env to require authentication.

If the dist/ folder does not exist, visiting http://localhost:8000 will show a JSON message with instructions; run npm run build from the project root and restart the backend.

Backend only (API)

From the backend directory:

With uv:

cd backend
uv pip install -e .
cp .env.example .env
uv run data-talks run

With pip + venv:

cd backend
python -m venv .venv
source .venv/bin/activate   # Linux/macOS
# .venv\Scripts\activate    # Windows
pip install -e .
cp .env.example .env
data-talks run

In backend/.env, if you only add OPENAI_API_KEY, the backend will automatically assume gpt-4o-mini for text and gpt-4o-mini-tts for audio as the environment fallback configuration.

data-talks run — starts the API on 0.0.0.0:8000. Use --host and --port to override.
data-talks migrate — runs database migrations.
API: http://localhost:8000 · Docs: http://localhost:8000/docs

Frontend in dev mode (hot reload)

To develop the frontend with hot reload:

Backend running (as above).
From the project root: create .env.local with VITE_API_URL=http://localhost:8000, then npm install and npm run dev.
Open http://localhost:8080.

How to use

By default, the app runs in guest mode: no login required. Choose your language (PT/EN/ES) and start.

Workspace: From the home page, select or create a workspace (agent).
Data sources: In the Sources panel, upload CSV/XLSX, connect BigQuery/Google Sheets, or add SQL databases.
Agent setup: Configure name, description, and data sources; add suggested questions. For multiple SQL sources, configure relationships (SQL Links) and optionally enable SQL mode in Agent Settings.
Ask questions: In the Chat panel, ask in natural language and get answers with charts.
Optional: Set up alerts, dashboards, Telegram connections, or Studio summaries.

When ENABLE_LOGIN=true in the backend, authentication is required before using the app.

Scripts

npm run dev      # Development server
npm run build    # Production build
npm run preview  # Preview production build
npm run lint     # Lint

Internationalization

The app supports Portuguese, English, and Spanish via a central translation layer:

Context: src/contexts/LanguageContext.tsx
Hook: useLanguage() for translations
Usage: t('key.subkey') in components
Storage: Language preference in localStorage

Authentication

Guest mode (default): When ENABLE_LOGIN=false, the app opens directly with no login screen
Login mode: Set ENABLE_LOGIN=true to require email/password authentication
Admin role: Admin users can manage other users and platform settings
JWT tokens: Stateless authentication via Bearer tokens

LLM providers

The backend supports three LLM providers. Configure one in backend/.env:

Provider	Key env vars	Use case
OpenAI	`OPENAI_API_KEY`, `OPENAI_MODEL`	Cloud API (default)
Ollama	`OLLAMA_BASE_URL`, `OLLAMA_MODEL`	Local/self-hosted models
LiteLLM	`LITELLM_BASE_URL`, `LITELLM_MODEL`, `LITELLM_API_KEY`	Proxy to 100+ providers

Users can also create multiple LLM configurations per account and switch between them from the settings page.

Deploy

Frontend: Connect the repo to Vercel, Netlify, or similar; set env vars; deploy on push.

Backend: Run the FastAPI server behind a reverse proxy (nginx, Caddy) or deploy as a Docker container. Set DATABASE_URL for PostgreSQL in production.

Contributing

Fork the project.
Create a feature branch (git checkout -b feature/AmazingFeature).
Commit changes (git commit -m 'Add AmazingFeature').
Push the branch (git push origin feature/AmazingFeature).
Open a Pull Request.

See CONTRIBUTING.md for the project standard (e.g. documentation and comments in English).

License

This project is under the Apache 2.0 license. See LICENSE for details.

Support

Repository: github.com/Empreiteiro/data-talks
Issues: GitHub Issues
Docs: In-code comments and this README

Data Talks — Turn data into insights with conversational AI.

Name		Name	Last commit message	Last commit date
Latest commit History 706 Commits
.github/workflows		.github/workflows
backend		backend
public		public
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
FEATURES.md		FEATURES.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
bun.lockb		bun.lockb
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

Data Talks

Overview

Main features

Data source management

AI agents

Q&A

SQL features

Alerts

Studio

Dashboards

Telegram integration

Platform logs

Internationalization

Tech stack

Frontend

Backend

LLM environment defaults

State

Project structure

How to run

Quick start with Make (recommended)

Run the app at a single URL (manual)

Backend only (API)

Frontend in dev mode (hot reload)

How to use

Scripts

Internationalization

Authentication

LLM providers

Deploy

Contributing

License

Support

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages