Pythinker

🚀 The Open-Source AI Agent That Does It All

Browse the web. Write & run code. Research anything. Generate beautiful reports.

Your self-hosted alternative to Manus AI — with 43+ tools and full autonomy.

Browse the web. Write & run code. Research anything. Generate beautiful reports.

Your self-hosted alternative to Manus AI — with 43+ tools and full autonomy.

🌐 Website · 📖 Docs · 🐛 Report Bug · 💡 Request Feature · 💬 Discussions

⭐ Give us a star!

If you find Pythinker useful, please consider giving us a star ⭐ on GitHub — it helps others discover this project and motivates continued development!

🎯 What is Pythinker?

Pythinker is an open-source, self-hosted AI agent platform that can autonomously browse the web, write & execute code, search the internet, manage files, and deliver polished research reports — all from a beautiful real-time interface.

Think of it as your personal AI researcher + developer + assistant — running entirely on your own infrastructure.

Built with ideas from Manus AI, OpenManus, and Nanobot.

Author: Mohamed Elkholy

🔥 Why Pythinker?

	Pythinker	Manus AI	ChatGPT	Other OSS Agents
Open Source	✅	❌	❌	✅
Self-Hosted	✅	❌	❌	✅
43+ Built-in Tools	✅	✅	❌	❌
Live Browser Streaming	✅	✅	❌	❌
Multi-LLM Support	✅	❌	❌	⚠️
Report Generation	✅	✅	❌	❌
Telegram Bot	✅	❌	❌	❌
Sandboxed Execution	✅	✅	✅	⚠️
Hybrid Memory (RAG)	✅	❌	✅	❌
MCP Tool Integration	✅	❌	❌	⚠️
Free	✅	❌	❌	✅

✨ Key Features

Feature	Description
🛠️ 43+ Built-in Tools	File, browser, shell, search, code, messaging, automation — the agent picks the right tool for every step
🌐 Live Browser Streaming	Watch the agent browse in real-time via CDP screencast — take over control at any moment
📊 Beautiful Report Generation	Automatically produces structured, citation-rich research reports with charts and references
🔒 Sandboxed Execution	Every task runs in an isolated Docker container with Chrome, Python, Node.js, and shell access
🤖 Multi-Model Support	Works with any OpenAI-compatible API — GPT-4o, Claude, DeepSeek, Kimi, GLM, local models, and more
🕷️ Intelligent Web Scraping	Three-tier scraping with automatic escalation — HTTP, dynamic rendering, stealth browser (powered by Scrapling)
📱 Telegram Integration	Full-featured Telegram bot gateway with inline buttons, file sharing, and streaming responses
🔌 MCP Tool Integration	Extend capabilities with external Model Context Protocol servers
🧠 PlanAct Agent Architecture	Intelligent planning, execution, reflection, and verification pipeline with 108 agent modules
💾 Hybrid Memory System	Semantic search over past sessions via Qdrant (BM25 + dense retrieval with RRF fusion)

🏗️ Architecture

+-----------------------------------------------------------------------+
|                         PYTHINKER PLATFORM                            |
|                                                                       |
|  +----------+  +---------------------------------------------------+  |
|  | Frontend  |  |              Backend (FastAPI)                    |  |
|  | Vue 3     |<>|  PlanAct Agent . 43+ Tools . SSE Streaming       |  |
|  | TypeScript|  |  Model Router  . DDD Services . Report Gen       |  |
|  +----------+  +--------------+-------------------+----------------+  |
|                               |                   |                   |
|  +----------+  +--------------v--+  +------------v--------------+    |
|  | Telegram  |  |    Sandbox(es)  |  |       Data Layer          |    |
|  | Gateway   |  |  Ubuntu Docker  |  | MongoDB . Redis . Qdrant  |    |
|  | (Nanobot) |  |  Chrome . Python|  | MinIO (Object Storage)    |    |
|  +----------+  +-----------------+  +---------------------------+    |
+-----------------------------------------------------------------------+

🛠️ Tech Stack

Layer	Technology
Frontend	Vue 3, TypeScript, Vite, TipTap, Monaco Editor, xterm.js, Plotly
Backend	FastAPI, Python 3.12+, Pydantic v2, Beanie ODM, SSE, WebSockets
Agent	PlanAct pipeline, adaptive model routing, tool efficiency monitoring, reflection & verification
Web Scraping	Scrapling (tiered: HTTP/TLS impersonation → dynamic rendering → stealth browser), Playwright
Sandbox	Ubuntu Docker, Chromium, Playwright, Supervisord, CDP screencast
Messaging	Nanobot multi-channel gateway (Telegram, Slack, Discord, DingTalk, Feishu)
Database	MongoDB 7.0 (sessions & state), Redis 8 (cache & coordination)
Vector Search	Qdrant (semantic memory, hybrid BM25 + dense retrieval)
Object Storage	MinIO (file uploads, artifacts, report assets)
CI/CD	GitHub Actions (lint, test, security scan, Docker build)
Monitoring	Prometheus, Grafana, Loki, Promtail

🧰 Tool Categories

The agent has access to 43+ tools organized into 10 categories:

Category	Tools	What They Do
📁 File	read, write, list, search, upload, download	Full filesystem access within the sandbox
🌐 Browser	navigate, click, type, screenshot, scroll, evaluate JS	Headless Chrome with live CDP streaming
🔍 Search	web search, scrape, extract	Internet research with multiple providers (Tavily, Serper, Exa)
💻 Shell	execute, background, interactive	Full terminal access with real-time output
💬 Message	ask user, notify, report	Communication and deliverable generation
🔌 MCP	external tool servers	Extensible via Model Context Protocol
🧬 Code	analyze, refactor, test	Code intelligence and manipulation
📋 Plan	create plan, update step, checkpoint	Structured task planning and tracking
⚡ Automation	batch operations, workflows	Multi-step automated sequences
🔧 System	health, config, diagnostics	Platform management and monitoring

🚀 Quick Start

Prerequisites

Docker 20.10+ and Docker Compose
An LLM API key (any OpenAI-compatible provider)

1. Clone & Configure

git clone https://github.com/mohamed-elkholy95/Pythinker.git
cd Pythinker
cp .env.example .env

Edit .env with your API credentials:

# LLM Configuration (any OpenAI-compatible API)
LLM_PROVIDER=openai
API_KEY=sk-your-api-key
API_BASE=https://api.openai.com/v1
MODEL_NAME=gpt-4o
TEMPERATURE=0.7
MAX_TOKENS=8192

# Security (required — change these)
SANDBOX_API_SECRET=your-secret-here
JWT_SECRET_KEY=your-jwt-secret

2. Start

docker compose up -d

3. Open

Visit http://localhost:5174 — log in and start your first research task.

💡 That's it! Three commands and you have a fully autonomous AI agent running.

🧠 How It Works

The PlanAct Pipeline

When you send a message, Pythinker's agent follows an intelligent pipeline:

User Message
     │
     v
┌──────────────┐  ┌──────────────────┐  ┌────────────────┐  ┌──────────────┐
│   Planning   │─>│    Execution     │─>│   Reflection   │─>│ Verification │
│              │  │                  │  │                │  │              │
│ • Analyze    │  │ • Run tools      │  │ • Evaluate     │  │ • Hallucina- │
│ • Decompose  │  │ • Browse web     │  │   quality      │  │   tion check │
│ • Route model│  │ • Execute code   │  │ • Critic review│  │ • Citation   │
│ • Checkpoint │  │ • Search & scrape│  │ • Self-correct │  │   integrity  │
└──────────────┘  └──────────────────┘  └────────────────┘  └──────────────┘

Agent Architecture Deep Dive

Pythinker's agent is not a simple prompt-and-respond loop. It is a modular, introspective system with 108 specialized modules across dedicated subsystems:

Planning — The Planner decomposes user requests into structured multi-step plans with checkpoints. A complexity assessor routes tasks to the appropriate model tier before planning begins.
Execution — The Step Executor runs each plan step, selecting from 43+ tools. A parallel executor can run independent steps concurrently. Tool efficiency is monitored to detect and break analysis paralysis (5+ consecutive reads without writes).
Reasoning & Reflection — After execution, a reflection layer evaluates output quality. A critic agent provides adversarial review. Chain-of-verification cross-checks claims against sources.
Memory — Conversation context, research traces, and tool results are managed through a sliding-window context system with role-scoped memory. Qdrant provides semantic search over past sessions using hybrid BM25 + dense retrieval with Reciprocal Rank Fusion (RRF).
Self-Healing — A stuck detector identifies when the agent is looping. A self-healing loop attempts recovery strategies. Error pattern analysis learns from failures to avoid them in future runs.
Verification — Hallucination detection (LLM-as-Judge), citation integrity checking, grounding validation, and output coverage validation ensure the final deliverable is accurate and well-sourced.
Document Intelligence — A context-aware document segmenter uses AST-based boundary detection to chunk long documents without splitting mid-function. An implementation tracker validates multi-file code completeness via AST + pattern analysis.
Security — Content safety gates, security assessors, and compliance gates run before any output reaches the user.

Web Scraping Pipeline

Pythinker uses Scrapling as its primary web scraping engine, providing a three-tier escalation strategy that balances speed with stealth:

Tier	Engine	When Used
Tier 1	HTTP + TLS impersonation (curl_cffi)	Default — fast, low overhead
Tier 2	Dynamic rendering (headless browser)	JavaScript-heavy pages
Tier 3	Stealth browser (Patchright)	Anti-bot protected sites

The agent's scraper automatically escalates through tiers when a lower tier fails (blocked, empty content, CAPTCHA detected). This is implemented via the domain Scraper Protocol with a clean port/adapter boundary — the domain layer never imports Scrapling directly.

Real-Time Features

SSE Event Streaming — Every agent action streams to the UI in real-time
Live Browser View — Watch the agent browse via CDP screencast; click to take over
Live Terminal — See shell commands execute with real-time output via xterm.js
Progress Tracking — Visual planning bar, step indicators, and tool timeline
File Preview — In-app code viewer with Monaco Editor and syntax highlighting

Adaptive Model Routing

Pythinker intelligently routes requests to different model tiers based on task complexity:

Fast tier — Simple queries, quick responses (60-70% latency reduction)
Balanced tier — Standard research and coding tasks
Powerful tier — Complex multi-step reasoning and report generation

📱 Telegram Bot

Pythinker includes a multi-channel messaging gateway powered by Nanobot:

Start research tasks from Telegram with /research
Receive streaming responses with inline action buttons
Upload and download files directly
Share reports and artifacts as PDF
Supports Telegram, Slack, Discord, DingTalk, Feishu, and more

Configure in .env:

CHANNEL_GATEWAY_ENABLED=true
TELEGRAM_BOT_TOKEN=your-bot-token
TELEGRAM_CHANNEL_ENABLED=true

🔒 Security

Sandboxed execution — Each task runs in an isolated Docker container with resource limits
Security-hardened containers — no-new-privileges, cap_drop: ALL, minimal capabilities
No direct sandbox access — All browser/terminal access is proxied through authenticated backend endpoints
JWT authentication — Secure user sessions with configurable auth providers
Secret scanning — TruffleHog + GitHub secret scanning in CI
Dependency auditing — pip-audit and npm audit run automatically
Network isolation — Internal services run on private Docker networks
OWASP security headers — Content-Security-Policy, HSTS, X-Frame-Options on all responses
Container scanning — Trivy vulnerability scanning in CI pipeline

See SECURITY.md for our vulnerability reporting policy.

🧑‍💻 Development

Project Structure

Pythinker/
├── frontend/          # Vue 3 + TypeScript + Vite
├── backend/           # FastAPI + Python 3.12 + DDD architecture
│   ├── app/
│   │   ├── core/           # Configuration, settings, lifespan
│   │   ├── domain/         # Models, services, agents (108 modules), tools
│   │   ├── application/    # Use case orchestration, DTOs
│   │   ├── infrastructure/ # External integrations (LLM, DB, browser, scraper)
│   │   └── interfaces/     # API routes, WebSocket handlers
│   ├── nanobot/       # Multi-channel messaging gateway (vendored)
│   ├── scripts/       # Utility & migration scripts
│   └── tests/         # 3,800+ tests
├── sandbox/           # Ubuntu Docker sandbox with Chrome
├── grafana/           # Dashboards & monitoring configs
├── scripts/           # Utility scripts
└── docs/              # Architecture docs & guides

Running Locally

# Development mode with hot reload
docker compose up --watch

# Ports:
# 5174 -> Frontend (Vite dev server)
# 8000 -> Backend API
# 8083 -> Sandbox API (localhost only)

Testing

# Backend
cd backend
ruff check . && ruff format --check .  # Lint
pytest tests/ -v --tb=short            # Tests

# Frontend
cd frontend
bun install
bun run lint        # ESLint
bun run type-check  # TypeScript
bun run test:run    # Vitest

Monitoring

Pythinker ships with a full observability stack:

Prometheus — Metrics collection (SSE connections, sandbox health, API latency, tool efficiency)
Grafana — Pre-configured dashboards
Loki + Promtail — Log aggregation and search

docker compose -f docker-compose-monitoring.yml up -d
# Grafana: http://localhost:3000

⚙️ Configuration

All configuration is via .env. Key sections:

Category	Variables	Description
LLM	`API_KEY`, `API_BASE`, `MODEL_NAME`	Primary model configuration
Fast Model	`FAST_MODEL`	Optional fast-tier model for simple tasks
Search	`SEARCH_PROVIDER`, `TAVILY_API_KEY`	Web search provider
Auth	`AUTH_PROVIDER`, `JWT_SECRET_KEY`	Authentication settings
Sandbox	`SANDBOX_IMAGE`, `SANDBOX_LIFECYCLE_MODE`	Container lifecycle
Telegram	`TELEGRAM_BOT_TOKEN`	Bot gateway
Storage	`MINIO_ROOT_USER/PASSWORD`	Object storage
Multi-Key	`_API_KEY`, `_API_KEY_2`, `*_API_KEY_3`	API key rotation with failover

See .env.example for the complete reference with documentation.

🙏 Acknowledgments

Pythinker is inspired by and builds upon ideas from these projects:

Manus AI — The original vision of an AI agent that can browse, code, and deliver research autonomously. Pythinker's PlanAct pipeline and live browser streaming are directly inspired by Manus.
OpenManus — The open-source Manus implementation that demonstrated the feasibility of a self-hosted agent with sandbox isolation.
Nanobot — Multi-channel AI agent framework. Pythinker vendors Nanobot as its Telegram/Slack/Discord messaging gateway, bridging the MessageBus to Pythinker's AgentService.
Scrapling — Intelligent web scraping library with TLS fingerprinting, anti-bot evasion, and tiered fetching. Pythinker uses Scrapling as its primary scraping engine with automatic tier escalation (HTTP → dynamic → stealth).
browser-use — Browser automation library used for autonomous multi-step web workflows.
Playwright — Browser control and CDP screencast streaming.
Qdrant — Vector search engine powering Pythinker's hybrid semantic memory system.

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for:

Development setup
Code style and commit conventions
Pull request process
Architecture overview

Please read our Code of Conduct before contributing.

📄 License

If Pythinker helps you, please ⭐ star this repo — it really makes a difference!

⬆ Back to Top

Name		Name	Last commit message	Last commit date
Latest commit History 1,797 Commits
.claude		.claude
.codex		.codex
.cursor/rules		.cursor/rules
.github		.github
.opencode		.opencode
.pythinker-cron		.pythinker-cron
backend		backend
docs		docs
examples		examples
frontend		frontend
grafana		grafana
loki		loki
memory		memory
mockserver		mockserver
monitoring		monitoring
prometheus		prometheus
promtail		promtail
qdrant		qdrant
sandbox		sandbox
scripts		scripts
skills		skills
tests/scripts		tests/scripts
.dev.sh		.dev.sh
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Playbook.md		Playbook.md
Pythinker.code-workspace		Pythinker.code-workspace
QUICK_START_2026.md		QUICK_START_2026.md
README.md		README.md
SECURITY.md		SECURITY.md
SOUL.md		SOUL.md
build.sh		build.sh
dev.sh		dev.sh
docker-compose-deploy.yml		docker-compose-deploy.yml
docker-compose-monitoring.yml		docker-compose-monitoring.yml
docker-compose-production.yml		docker-compose-production.yml
docker-compose.plotly.yml		docker-compose.plotly.yml
docker-compose.yml		docker-compose.yml
instructions.md		instructions.md
mcp.json.example		mcp.json.example
monitor_containers.sh		monitor_containers.sh
opencode.json		opencode.json
report_12.md		report_12.md
run.sh		run.sh
show_context_stats.py		show_context_stats.py
test-like-github.sh		test-like-github.sh
test-report-e2e-benchmark-2026-03-22.md		test-report-e2e-benchmark-2026-03-22.md
test_openrouter.py		test_openrouter.py
test_resource_blocking.py		test_resource_blocking.py

Folders and files

Latest commit

History

Repository files navigation

Pythinker

🚀 The Open-Source AI Agent That Does It All

⭐ Give us a star!

🎯 What is Pythinker?

🔥 Why Pythinker?

✨ Key Features

🏗️ Architecture

🛠️ Tech Stack

🧰 Tool Categories

🚀 Quick Start

Prerequisites

1. Clone & Configure

2. Start

3. Open

🧠 How It Works

The PlanAct Pipeline

Agent Architecture Deep Dive

Web Scraping Pipeline

Real-Time Features

Adaptive Model Routing

📱 Telegram Bot

🔒 Security

🧑‍💻 Development

Project Structure

Running Locally

Testing

Monitoring

⚙️ Configuration

🙏 Acknowledgments

🤝 Contributing

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 12

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages