Awesome AI

A curated list of AI APIs, tools, and platforms. 400+ tools across 40+ categories.

⚠️ = aging (6-12 months since last commit) — may still work but watch for staleness.

How to Choose

I need to...	Start here
Call an LLM API	LLM API Providers
Run open-source models	Open Source LLM Models
Build AI agents	AI Agent Frameworks
Generate images	AI Image Generation APIs
Generate video	AI Video Generation APIs
Add voice/speech	AI Audio & Speech APIs
Search the web with AI	AI Search APIs
Store embeddings	Vector Databases & Embeddings
Build RAG pipelines	RAG Tools & Frameworks
Write code with AI	AI Code Assistants
Evaluate LLM outputs	LLM Evaluation & Monitoring
Deploy models to production	MLOps & Deployment
Connect LLMs to tools	MCP (Model Context Protocol)
Rent GPUs	Cloud GPU Providers

🧠 LLM API Providers

Tool	Description
OpenAI API	GPT-4o and o-series reasoning models with tool use, vision, and multimodal capabilities.
Anthropic Claude API	Claude model family with 200K context windows, vision, and agentic capabilities; safety-focused.
Google Gemini API	Gemini models with 1M+ token context, search integration, and Google Cloud ecosystem.
Mistral AI API	European AI lab with open-weight and commercial models under Apache 2.0 licensing at competitive prices.
DeepSeek API	DeepSeek V3 and R1 reasoning models at a fraction of competitor pricing with 128K context.
Cohere API	Command R+ models optimized for RAG, multilingual support, and enterprise deployment.
AI21 Labs API	Jamba models and Maestro orchestration system for enterprise NLP with strong multilingual capabilities.
xAI Grok API	Grok models with competitive pricing, image generation, and video generation.
Inflection AI	Creators of Pi, focused on empathetic and helpful interactions via enterprise API.
Reka AI	Multimodal models spanning text, images, and video understanding.

(⬆ back to top)

🔓 Open Source LLM Models

Tool	Stars	Description
Llama 4 (Meta)	-	Scout (17B active/109B experts) and Maverick variants, 128K context, strong general performance.
Qwen 3 (Alibaba)	-	MoE family (up to 235B parameters) excelling in multilingual and reasoning tasks with open weights.
DeepSeek V3	20k+	671B MoE model (MIT license) matching frontier models at a fraction of the cost.
DeepSeek R1	38k+	Reasoning model trained via large-scale RL achieving o1-level performance (MIT license).
Mistral Large / Mixtral	-	Mixtral MoE and Small 3 (24B) punching above their weight in efficiency.
Gemma 3 (Google)	-	Available in multiple sizes with strong performance for its parameter count.
Phi-4 (Microsoft)	-	Small language model built on synthetic datasets achieving state-of-the-art performance for its size.
Command R+ (Cohere)	-	Optimized for RAG and enterprise use with strong multilingual capabilities.
Yi (01.AI)	-	Bilingual (Chinese-English) models with strong performance in independent benchmarks.
StarCoder2	3k+	Code LLM in 3B/7B/15B sizes where the 15B matches 33B+ models on code evaluations.
OLMo (Allen AI)	-	Training data, code, and weights all publicly available for reproducible research.
DBRX (Databricks)	-	MoE model with 132B total / 36B active parameters for enterprise use cases.

(⬆ back to top)

🤔 AI Reasoning Models

Tool	Description
OpenAI o3 / o4-mini	Chain-of-thought reasoning with inference-time compute scaling for complex problems.
DeepSeek R1	Achieves o1-level performance at 5-10x lower cost ($0.55/$2.19 per 1M tokens).
Anthropic Claude (Extended Thinking)	Hybrid mode producing instant or deep-thinking responses without switching models.
Google Gemini 2.5 Pro (Thinking)	Built-in thinking capabilities for multi-step problem solving.
QwQ (Qwen with Questions)	Strong math and coding chain-of-thought capabilities in the Qwen family.

(⬆ back to top)

🚀 Open-Source Model Hosting & Inference

Tool	Description
Together AI	200+ models with sub-100ms latency, fine-tuning, and horizontal scaling.
Fireworks AI	Low-latency inference for open-weight models including DeepSeek, Llama, and Qwen.
Groq	Purpose-built LPU hardware delivering ultra-fast inference with extremely low latency.
Cerebras	Wafer-scale chip architecture eliminating GPU communication bottlenecks for fast inference.
SambaNova	Enterprise inference with OpenAI-compatible API and ultra-low latency.
Replicate	Run and deploy models via API with simple scaling across image, video, audio, and text.
Modal	Serverless cloud for AI models with pay-per-use pricing and infrastructure control.
Baseten	Deploy and serve ML models with infrastructure control and optimized performance.
DeepInfra	Cost-effective inference API with managed servers, GPUs, and auto-scaling.
Lepton AI	Pythonic framework for building and deploying AI services with cloud-native inference.
fal.ai	600+ models with competitive pricing, optimized for generative media.
Novita AI	Affordable access to models for text, image, and audio generation.

(⬆ back to top)

🔀 LLM API Routers & Gateways

Tool	Stars	Description
OpenRouter	-	Unified API for 100+ LLMs from multiple providers with transparent pricing comparison.
LiteLLM	18k+	Proxy/gateway providing a unified API to 100+ LLMs with load balancing, fallbacks, and spend tracking.
Martian	-	AI-powered router that automatically selects the best model for each request using Model Mapping.
Portkey	-	Enterprise gateway with load balancing, caching, fallbacks, and observability.
Helicone	-	Rust-based gateway with load balancing, native observability, and cost tracking.

(⬆ back to top)

☁️ Cloud AI Platforms (Managed)

Tool	Description
AWS Bedrock	Multi-vendor model marketplace with AgentCore, supporting Anthropic, Meta, Cohere, and Amazon models.
Azure AI Foundry	11,000+ models with GPT-4/5 access and deep Azure integration.
Google Vertex AI	Gemini family, Model Garden for 3rd-party/OSS models, and data-driven MLOps.
AWS SageMaker	End-to-end ML platform for building, training, and deploying models with managed infrastructure.
Azure Machine Learning	Automated ML, responsible AI dashboard, and managed endpoints.
Databricks	Unified analytics and AI with managed MLflow, Unity Catalog, and Mosaic AI.
IBM watsonx	Foundation models, model training, and AI governance for business workflows.

(⬆ back to top)

🤖 AI Agent Frameworks

Tool	Stars	Description
LangChain	-	LLM application framework with extensive integrations for chains, agents, and retrieval.
LangGraph	-	Cyclical graphs for agent runtimes with state management and human-in-the-loop. Extends LangChain.
LlamaIndex	-	RAG-first framework with 100+ data connectors and purpose-built retrieval infrastructure.
CrewAI	-	Role-based multi-agent systems with structured task delegation.
Microsoft AutoGen	-	Multi-agent conversation framework merging with Semantic Kernel into unified Microsoft Agent Framework.
Microsoft Semantic Kernel	-	LLM integration SDK with deep Azure ecosystem support across C#, Python, and Java.
DSPy	-	Declarative framework for programming (not prompting) language models with automatic prompt optimization.
Haystack	-	AI orchestration framework for LLM applications with modular pipelines.
Instructor	9k+	Structured output extraction using Pydantic models with validation and retry logic.
Pydantic AI	-	Agent framework from the Pydantic team with type-safe structured outputs.
OpenAI Agents SDK	7k+	OpenAI's official framework for agentic workflows with tool use, handoffs, and guardrails.
Google ADK (Agent Development Kit)	-	Google's framework with native Gemini integration and multi-agent orchestration.
Smolagents	15k+	Hugging Face's lightweight agent framework with minimal boilerplate.
CopilotKit	-	Framework for building AI copilots into React applications with AG-UI protocol support.

(⬆ back to top)

💾 AI Memory & Context Management

Each tool takes a different approach to giving agents persistent memory across sessions.

Tool	Stars	Description
Mem0	-	Dedicated memory layer extracting and managing memories across sessions with up to 80% token reduction.
Zep	-	Temporal knowledge graph tracking how facts change over time with structured business data.
Letta (MemGPT)	-	Agent framework with first-class memory management exposing editable memory blocks and stateful runtime.
Cognee	-	Memory engine finding hidden connections in data using knowledge graphs with semantic embeddings.
LangMem	1k+	Memory tools for LangGraph agents extracting information from conversations for long-term context.
Supermemory	-	Universal memory API with time-annotated semantic traces and scalable semantic recall.
Graphlit	-	Semantic content infrastructure with multimodal ingestion linked to a shared knowledge graph.

(⬆ back to top)

📊 Structured Output & Data Extraction

Tool	Stars	Description
Instructor	9k+	Structured output extraction using Pydantic models with validation and retry logic.
BAML	5k+	Domain-specific language for writing and testing LLM functions with type-safe validation.
Marvin	5k+	Lightweight Python library for natural language interfaces that extract structured data from LLMs.
Outlines	10k+	Constrained text generation using regular expressions, JSON schemas, and context-free grammars.
Guidance	19k+	Microsoft's language for controlling LLMs with interleaving generation, prompting, and logical control.
SGLang	8k+	Fast serving framework with built-in structured output via JSON schema, regex, and EBNF.
TypeChat	8k+	Uses TypeScript types to guide LLM outputs into well-typed structured responses.
JSONFormer	4k+	Constrains generation to only produce valid JSON tokens.
LlamaParse	-	Enterprise document parsing with agentic OCR for extracting structured data from complex documents.
Docling	18k+	IBM's document parser with 97.9% accuracy on complex table extraction.

(⬆ back to top)

💻 AI Code Assistants

Tool	Description
GitHub Copilot	20M+ users, deep IDE integration, agent mode, and enterprise features.
Cursor	AI-native IDE built on VS Code with deep codebase understanding, multi-file editing, and Composer agent.
Windsurf (Codeium)	Agentic IDE with Cascade flow combining deep context awareness with autonomous code generation and free tier.
Claude Code	Terminal-based agentic coding excelling at reasoning, multi-file refactoring, and autonomous workflows.
Tabnine	Privacy-first; multi-model support trained only on permissive licenses.
Amazon Q Developer	Autonomous agents, AWS resource management, and console diagnostics.
Sourcegraph Cody	Search-first architecture for analyzing entire multi-repo environments.
Continue.dev	Supports any LLM and deep IDE customization.
Aider	Free terminal-based AI pair programming tool for command-line workflows and automation.
Qodo (formerly CodiumAI)	Focused on generating meaningful test suites and code review.
JetBrains AI	Natively integrated into JetBrains IDEs with context-aware completions.
Google Gemini Code Assist	Powered by Gemini models with Google Cloud integration.

(⬆ back to top)

🎨 AI Image Generation APIs

Tool	Description
OpenAI DALL-E API	Strong prompt understanding and seamless GPT integration.
Stability AI API	Stable Diffusion 3.5 and SDXL models with fine-grained control and open-source weights.
Black Forest Labs (FLUX)	FLUX.2 and Kontext APIs with state-of-the-art photorealism and typography.
Ideogram API	Specializes in accurate text rendering, ideal for logos, posters, and branded content.
Recraft API	Excels at vector outputs (icons, logos) alongside raster images with style controls.
Leonardo AI API	Presets optimized for game art, textures, and product visuals.
Google Imagen API	Imagen 3/4 models via Vertex AI with studio-quality outputs.
Midjourney	Best for artistic and aesthetic outputs; accessible via Discord bot.

(⬆ back to top)

🎬 AI Video Generation APIs

Tool	Description
Runway API	Gen-4.5 with cinematic quality, motion brushes, and scene consistency.
Pika API	PikaSwaps, Pikaffects, and fast rendering optimized for social media.
Luma AI (Dream Machine)	4K HDR video with Ray3 engine and superior physics simulation.
Kling AI	Up to 2-minute videos at 1080p with simultaneous audio-visual generation.
OpenAI Sora	Cinematic-quality videos with realistic physics.
Synthesia API	230+ AI avatars, 140+ languages, and Express-2 engine for enterprise video.
HeyGen API	Realistic language dubbing with voice cloning and lip-sync.
D-ID API	Talking-head videos from images and audio, supporting 120+ languages.
Google Veo API	Native audio integration and high-fidelity output via Vertex AI.
Twelve Labs	Multimodal video understanding: search, classification, and generation from video content.

(⬆ back to top)

🎙️ AI Audio & Speech APIs

Tool	Description
ElevenLabs API	3,000+ voices, 70+ languages, emotional nuance, and voice cloning.
OpenAI Audio API	TTS and STT including gpt-4o-transcribe and Whisper for 50+ languages.
Deepgram API	STT and TTS (Aura-2) optimized for conversational AI with sub-250ms latency.
AssemblyAI API	STT with speaker diarization, sentiment analysis, and content moderation.
Cartesia API	Ultra-low-latency voice generation (<150ms) with rapid voice cloning for interactive use.
Play.ht API	900+ voices across 140+ languages and accents.
Resemble AI API	Expressive speech synthesis, voice cloning, and deepfake detection for enterprise.
Speechify API	Apple Design Award winner with 1,000+ voices in 60+ languages.
OpenAI Realtime API	Speech-to-speech for real-time voice applications with low latency.

(⬆ back to top)

🔍 AI Search APIs

Tool	Description
Perplexity Sonar API	LLM-powered search providing direct answers with cited sources.
Tavily API	Built for agents and RAG workflows with SOC 2 certification.
Exa API	Semantic/neural search scoring 94.9% on SimpleQA, returning meaning-based results.
Brave Search API	Independent index with no tracking and privacy-focused results.
SerpAPI	SERP scraping across search engines for AI features, SEO, and price monitoring.
You.com API	Deep Search, Express Agent, and MCP support.
Serper API	Fast, affordable Google SERP API for high-volume use cases.
WebSearch API	Web search API designed for AI workflow integration.

(⬆ back to top)

🌐 AI Translation APIs

Tool	Description
DeepL API	Known for natural, polished translations especially for European languages.
Google Cloud Translation	130+ languages with AutoML custom model training.
Microsoft Translator	100+ languages with custom translation models and document translation on Azure.
Amazon Translate	Real-time and batch translation integrated into the AWS ecosystem.
ModernMT	Adaptive engine that learns from human corrections in real-time.
Unbabel	Blends AI translation with human editors for enterprise content.
LILT	Adaptive neural models with human-in-the-loop workflow for enterprise localization.
Phrase	Developer-friendly localization with extensive APIs and CLI tools for custom pipelines.
Lokalise	400+ languages with tone, style, and industry customization.
Tolgee	In-context translating features, popular with JavaScript developers.

(⬆ back to top)

✍️ AI Writing & Content APIs

Tool	Description
Jasper API	Content platform for marketing teams with brand voice, campaign management, and generation.
Writer API	Full-stack content generation with brand governance and custom model training.
Copy.ai	Marketing copy, sales emails, and Go-To-Market workflows.
Writesonic	Fact-checked, SEO-friendly article generation pulling real-time data from Google.
Surfer SEO	SERP analysis and real-time content scoring for SEO optimization.
Frase	Analyzes SERPs in real-time with AI-driven content suggestions for SEO.
Grammarly API	Grammar, tone, clarity, and brand-voice checking for integration into applications.
Anyword	Predictive performance scoring that forecasts content performance before publishing.
Wordtune	Rewriting and paraphrasing by AI21 Labs for improving clarity, tone, and conciseness.

(⬆ back to top)

📐 Vector Databases & Embeddings

Vector Databases

Tool	Stars	Description
Pinecone	-	Fully managed with serverless architecture and support for billions of vectors.
Weaviate	-	Excels at hybrid search (vector + keyword) with GraphQL API.
Qdrant	-	High QPS at 99% recall, production-proven at scale.
Chroma	-	Developer-friendly embedding database ideal for prototyping RAG systems.
Milvus	-	Leads in low-latency benchmarks with support for billions of vectors.
Zilliz Cloud	-	Managed cloud service for Milvus with enterprise features and production SLAs.
pgvector	13k+	PostgreSQL extension keeping vectors and relational data together.
LanceDB	-	Serverless, built on Lance columnar format with zero-copy access.

Embedding APIs

Tool	Description
OpenAI Embeddings API	ada-002 and newer models for semantic search, clustering, and classification.
Voyage AI API	Voyage 4 series with shared embedding spaces and domain-specific models.
Cohere Embed API	Multilingual embeddings optimized for search and retrieval with Embed v3.
Google Embedding API	Text and multimodal embeddings via Vertex AI with Gecko and newer models.
Jina AI Embeddings	API-based embedding models with multilingual support and reranking.

(⬆ back to top)

📚 RAG Tools & Frameworks

Tool	Description
LlamaIndex	100+ data connectors, retrieval strategies, and production-ready pipelines.
Unstructured	Converts PDFs, images, HTML, and Word docs into structured data for ingestion.
Vectara	RAG-as-a-service with built-in semantic search, summarization, and hallucination detection.
Ragas	Reference-free evaluation of RAG pipelines with metrics for context relevance and groundedness.
LangChain Retrievers	Modular retrieval components supporting vector stores, multi-query, and contextual compression.
Cohere Rerank	Neural reranking that improves search relevance by reordering results on semantic similarity.
Morphik	RAG infrastructure for building and deploying retrieval-augmented generation systems.

(⬆ back to top)

📄 AI Document Processing & OCR

Tool	Stars	Description
Azure Document Intelligence	-	Extracts structure, relationships, and key-values from documents.
Google Document AI	-	Pre-trained models for invoices, receipts, and custom documents.
Amazon Textract	-	Extracts text, forms, and tables integrated with S3 and Lambda.
Mistral OCR	-	1000 pages/$ with state-of-the-art accuracy.
LlamaParse	-	Agentic OCR extracting structured data from complex documents.
Docling	18k+	IBM's parser with 97.9% accuracy on complex table extraction.
Mathpix	-	Specialized in mathematical equations, scientific documents, and LaTeX conversion.
Reducto	-	API-first parsing for extracting structured data from PDFs for RAG pipelines.
Mindee	-	Developer-friendly extraction for invoices, receipts, IDs, and custom document types.

(⬆ back to top)

🗃️ AI Database & SQL Assistants

Tool	Description
Vanna.ai	Self-hosted AI SQL agent with personalized schema training.
Defog.ai	Text-to-SQL engine achieving 90-95% accuracy, matching GPT-4 on specialized benchmarks.
Text2SQL.ai	Converts natural language to SQL with multiple dialect support.
DBHub	Universal database MCP server enabling AI assistants to query databases.
Outerbase	Spreadsheet-like views with EZQL natural language agent.
SQL Chat	Chat-based SQL client using natural language.
BlazeSQL	Generates SQL from natural language by parsing schema metadata.
DataGPT	Slack-native analytics answering business questions with raw SQL transparency.

(⬆ back to top)

📏 LLM Evaluation & Monitoring

Tool	Description
LangSmith	Tracing, prompt testing, and RAG assessment from the LangChain team.
Langfuse	LLM engineering platform for tracing, prompt management, evaluations, and observability.
Braintrust	Evaluation-first platform for systematic prompt experimentation with side-by-side comparison.
Arize AI	LLM observability with Phoenix tracing and production monitoring.
Promptfoo	Prompt engineering, A/B testing, and LLM-as-a-judge evaluations.
Humanloop	Prompt management and evaluation with human feedback loops.
WhyLabs	Detects anomalies, drift, hallucinations, and bias in production systems.
Galileo AI	Luna-2 evaluation models and real-time guardrail system for agent actions.
TruLens	Evaluating and tracing AI agents and RAG systems with OpenTelemetry tracing.
Maxim AI	Testing, monitoring, and optimizing LLM systems.
Evidently AI	Detects data drift and model performance degradation.
Fiddler AI	Focused on explainability, fairness, and compliance for ML and LLM models.

(⬆ back to top)

💰 AI Gateway & Cost Management

Tool	Description
Helicone	Gateway and observability with request logging, cost tracking, and Rust-based performance.
Portkey	Unified API with load balancing, semantic caching, and multi-provider fallbacks.
Keywords AI	Tracks usage, latency, and spend across providers.
Lunary	LLM monitoring and analytics with cost tracking and prompt management.
OpenRouter	Unified API marketplace for 100+ LLMs with transparent pricing and cost optimization.

(⬆ back to top)

🎯 Prompt Engineering & Management

Tool	Description
Langfuse	Prompt management with versioning, tracing, evaluations, and observability.
PromptLayer	Logging, versioning, visual editing, and regression testing.
Agenta	LLMOps platform for prompt management, evaluation, and A/B testing.
Pezzo	Lightweight prompt management for versioning, testing, and instant deployment.
Promptfoo	CLI for prompt testing, evaluation, and red-teaming with YAML-based configuration.
Maxim AI	Prompt management and evaluation with testing workflows and version control.

(⬆ back to top)

🔧 Fine-Tuning Platforms

Tool	Stars	Description
OpenAI Fine-Tuning	-	Managed fine-tuning for GPT models with reinforcement fine-tuning preview.
Together AI Fine-Tuning	-	Cloud-based fine-tuning for open-source models with LoRA adapter support.
Predibase	-	Low-code fine-tuning and hosting with LoRAX multi-adapter serving and serverless deployment.
LoRAX	3k+	Serves 100s of fine-tuned LoRA adapters on a single GPU.
Modal Fine-Tuning	-	Serverless GPU compute with pay-per-use pricing and zero idle costs.
Lambda Cloud	-	GPU cloud optimized for AI training with zero egress fees and InfiniBand networking.
Anyscale	-	Built on Ray for distributed fine-tuning and serving of LLMs.

(⬆ back to top)

🛡️ AI Safety & Guardrails

Tool	Stars	Description
Guardrails AI	-	LLM output validation with pre-built validators from Guardrails Hub.
NVIDIA NeMo Guardrails	4k+	Programmable safety guardrails using a domain-specific language.
Lakera Guard	-	Real-time prompt injection detection with customizable safeguards and monitoring dashboards.
LLM Guard	-	Scanners for prompt injection, PII, and toxicity in LLM interactions.
Rebuff	1k+	Self-hardening prompt injection detector that learns from attack attempts over time.
Azure AI Content Safety	-	Detects harmful content in text and images with configurable severity levels.

(⬆ back to top)

⚖️ AI Compliance & Governance

Tool	Description
Credo AI	Compliance automation with EU AI Act, NIST AI RMF, and ISO 42001.
Holistic AI	AI lifecycle governance assessing models for safety, fairness, bias, and regulatory alignment.
IBM watsonx.governance	Model lifecycle management, bias detection, and compliance tracking.
Zenity	AI agent compliance: risk detection, guardrail enforcement, and regulatory reporting.
Fairly AI	AI risk management for financial services with model validation and bias detection.
Drata	Compliance automation with AI-powered controls for SOC 2, ISO 27001, and AI regulations.
Securiti AI	Data command center with AI governance, data privacy, and consent management.

(⬆ back to top)

🏭 MLOps & Deployment

Tool	Description
MLflow	Experiment tracking, model registry, and deployment; MLflow 3 adds GenAI ops support.
BentoML	Build, ship, and scale AI model serving with high-performance inference.
KServe	Kubernetes-native serverless model serving with autoscaling.
ZenML	MLOps framework for portable ML pipelines across any infrastructure.
Metaflow	Netflix-originated framework for real-life ML workflows with deep AWS integration.
NVIDIA Triton Inference Server	GPU inference serving supporting multiple ML frameworks and model formats.
DVC (Data Version Control)	Version control for data, models, and experiments alongside Git.
Kubeflow	Google-backed ML toolkit for Kubernetes with pipeline orchestration and LLM fine-tuning.
Seldon Core	ML model serving at scale with A/B testing and canary rollouts.
Weights & Biases	Experiment tracking, model management, and dataset versioning for the full ML lifecycle.
Neptune.ai	Experiment tracking and model registry for logging, comparing, and collaborating on models.
Comet ML	Experiment management with real-time tracking, model comparison, and production monitoring.

(⬆ back to top)

⚡ AI Workflow & Automation

Tool	Description
n8n	AI agent nodes for building LLM-powered automation workflows.
Zapier AI	8,000+ app connections with AI-powered workflows, agents, and chatbots.
Make (Integromat)	Visual automation with detailed branching logic and deep app integrations.
Lindy AI	Agent-first automation for tasks like lead qualification and email follow-ups.
Relevance AI	No-code platform for building collaborating AI agent teams.
Activepieces	MIT-licensed no-code automation with AI workflow integration and self-hosting.
Pipedream	Developer-focused hybrid no-code/serverless platform for API workflows with code at any step.
Inngest	Durable functions replacing queues for reliable multi-step AI workflows.
Trigger.dev	TypeScript-first platform for deploying managed AI agents and background workflows.
Windmill	Workflow engine for internal tools, background jobs, and AI pipelines.
Hatchet	Distributed task queue for reliable background jobs and AI agent workflows.
Relay.app	Human-in-the-loop automation combining AI steps with human approval.

(⬆ back to top)

🔌 AI Function Calling & Tool Use

Tool	Description
Composio	150+ pre-built tool integrations for AI agents with authentication handling.
Toolhouse	Universal tool server enabling any LLM to use tools via a simple API.
Arcade AI	Authenticated API access to third-party services for AI agents.
NPI (Natural Programming Interface)	Tool-use APIs enabling AI agents to operate software tools.
Dify	Platform (93K+ stars) for AI agents with built-in RAG, Function Calling, and ReAct strategies.

(⬆ back to top)

🧪 AI Testing & QA

Tool	Description
testRigor	Generative AI test automation from plain English descriptions.
Applitools	Visual AI detecting UI bugs across browsers and devices.
Testsigma	Natural language test creation for web, mobile, and API.
Katalon	AI-powered test automation for web, mobile, API, and desktop.
mabl	Low-code, AI-driven test automation for Agile and DevOps.
QA Wolf	AI + human-in-the-loop QA with Playwright-based testing and massive parallelization.
Qodo	Generates meaningful test suites and improves code quality.

(⬆ back to top)

🧬 AI Synthetic Data Generation

Tool	Description
Gretel	Fine-tuning capabilities, privacy metrics, and domain-specific generation APIs.
Mostly AI	Transforms production data into privacy-safe versions for AI training.
Tonic.ai	Fabricate (synthetic data), Structural (test data), and Textual (data redaction).
YData	Automated data profiling with synthetic generation for improving AI training quality.
Hazy	Privacy-preserving synthetic data for regulated industries using differential privacy.
Synthesis AI	Specialized in computer vision training with photorealistic 3D-rendered images.

(⬆ back to top)

🏷️ AI Data Labeling & Annotation

Tool	Description
Scale AI	Data labeling with RLHF capabilities, used by major AI labs.
Labelbox	Annotation, data management, and Model Foundry with HIPAA/SOC2 compliance.
Snorkel AI	Programmatic labeling using weak supervision to encode labeling logic as functions.
Argilla	Free data annotation for NLP, LLMs, and multimodal models with Hugging Face integration.
Label Studio	Flexible templates, ML backends, and model-in-the-loop support.
Encord	Computer vision labeling with active learning and model-assisted labeling.
CVAT	Annotation tool for computer vision including image and video labeling.

(⬆ back to top)

🔗 MCP (Model Context Protocol)

Anthropic's open standard (donated to Linux Foundation AAIF) for connecting AI models to external tools and data sources.

Core Protocol

Tool	Stars	Description
Model Context Protocol Specification	-	The protocol spec defining how AI models connect to tools and data.
MCP TypeScript SDK	3k+	Official TypeScript SDK for building MCP servers and clients.
MCP Python SDK	4k+	Official Python SDK for building MCP servers and clients.
MCP Registry	-	Catalog with ~2,000 entries and 407% growth since launch.

Notable MCP Servers

Tool	Stars	Description
MCP Servers Repository	16k+	Official collection of reference server implementations.
Awesome MCP Servers	30k+	Community-curated collection covering 7,260+ servers across diverse integrations.
Playwright MCP	5k+	Microsoft's server enabling LLMs to interact with web pages through accessibility snapshots.
ActionKit by Paragon	-	Connects to 130+ SaaS integrations for AI agent workflows.

(⬆ back to top)

🛠️ AI Developer Tools & SDKs

Tool	Description
Vercel AI SDK	TypeScript toolkit for React/Next.js AI apps with composable agents and MCP support.
Streamlit	Python framework for data-heavy AI apps combining chat interfaces with dashboards.
Gradio	Python library for ML demo UIs with instant public links.
Chainlit	Python framework for conversational AI UIs with step-by-step visualization.
Langflow	Low-code Python-based builder for visually creating agentic and RAG applications.
Flowise	Node.js visual AI workflow builder for agents and chatbots.
Dify	LLM app development platform with visual workflow builder and model management.

(⬆ back to top)

⏱️ Realtime & Streaming AI

Tool	Description
OpenAI Realtime API	Speech-to-speech via WebSocket for real-time voice applications.
Google Gemini Live API	Real-time voice and video interaction with Gemini via WebSocket.
LiveKit	Real-time audio/video AI applications with WebRTC and agent frameworks.
Deepgram Streaming	Real-time STT with sub-250ms latency via WebSocket for live transcription.
AG-UI Protocol	Protocol for streaming agent events between AI backends and frontend UIs.
Decart	Lucy 2 model generating video live with near-zero latency.

(⬆ back to top)

🎭 Multi-Modal AI APIs

Tool	Description
OpenAI GPT-4o API	Natively multimodal: text, images, audio, and video with unified reasoning.
Google Gemini API	1M+ token context with native image/video/audio understanding.
Anthropic Claude API	Vision-capable: analyzes images, documents, charts, and screenshots alongside text.
Mistral Pixtral	Image understanding capabilities via API.
Meta Llama Multimodal	Vision capabilities via various inference providers.
Twelve Labs	Video understanding: search, classification, and generation from video.
Roboflow	Computer vision APIs for training, deploying, and managing detection and segmentation models.

(⬆ back to top)

📱 Edge & On-Device AI

Tool	Stars	Description
Ollama	-	Run LLMs locally with a single command; wide model support with easy management.
LM Studio	-	Desktop app for running LLMs locally with Vulkan GPU offloading and model discovery.
llama.cpp	75k+	C/C++ inference engine optimized for CPU-only inference with extensive quantization.
vLLM	-	High-throughput serving with PagedAttention and continuous batching.
NVIDIA TensorRT-LLM	10k+	30-70% faster throughput on NVIDIA GPUs.
MLX	18k+	Apple's ML framework for Apple Silicon achieving ~230 tokens/second.
ExecuTorch	3k+	Meta's on-device framework with 50KB footprint supporting 12+ hardware backends.
Jan	-	Offline-first desktop AI assistant for running LLMs locally with privacy-first design.
LocalAI	-	Self-hosted OpenAI-compatible API for running models locally.
LlamaEdge	-	Lightweight runtime for deploying LLMs at the edge using WebAssembly.

(⬆ back to top)

🖥️ Cloud GPU Providers

Tool	Description
RunPod	On-demand and serverless compute supporting A100/H100 GPUs at flexible pricing.
Lambda Cloud	H100/A100 instances with zero egress fees and InfiniBand networking.
CoreWeave	Kubernetes-native GPU cloud at hyperscale; serves OpenAI and Microsoft.
Vast.ai	GPU marketplace with competitive bidding, best for cost-sensitive workloads.
Nebius	AI-native cloud with Blackwell Ultra compute and European data centers.
TensorDock	GPU marketplace spanning 100+ global locations.
Paperspace (DigitalOcean)	Notebooks, virtual desktops, and serverless functions for ML development.
Crusoe Cloud	Clean-energy-powered GPU cloud for sustainable AI workloads.
Fluidstack	Distributed GPU cloud aggregating idle compute at competitive prices.

(⬆ back to top)

🧩 AI Browser Extensions & Desktop Apps

Tool	Description
ChatGPT Desktop	Native desktop app with system-wide access via global shortcuts.
Claude Desktop	MCP-based extensions and local tool integration.
Poe	Multi-model platform accessing GPT-4, Claude, Gemini, Llama, and custom bots in one interface.
Perplexity	AI search engine as web app and browser extension providing cited answers.
Raycast AI	macOS launcher with built-in AI and extensions for system-wide access.
Monica	Browser extension accessing GPT, Claude, Gemini with sidebar chat.
Sider	Browser sidebar unifying GPT, DeepSeek, Gemini, Claude, and Grok for comparing outputs.
Pieces for Developers	Captures live context from browsers to IDEs with multiple LLM support.
HARPA AI	Chrome extension combining ChatGPT, Claude, Gemini, Perplexity for web page automation.

(⬆ back to top)

🎮 AI Playgrounds & API Testing

Tool	Stars	Description
OpenAI Playground	-	Official tool for experimenting with OpenAI models and testing prompts.
Vercel AI Playground	-	Compare AI model outputs across providers side-by-side.
TypingMind	-	LLM frontend supporting multiple models with your own API keys and custom plugins.
ChatHub	-	Browser extension for using ChatGPT, Claude, Gemini side-by-side.
Msty	-	Desktop app for managing multiple LLM providers with conversation branching and prompt library.
OpenPlayground	3k+	LLM playground for running and comparing models locally.

(⬆ back to top)

🕷️ Web Scraping & Data Extraction for AI

Tool	Stars	Description
Firecrawl	-	Scrapes, crawls, and extracts structured data into LLM-ready formats.
Crawl4AI	58k+	AI-ready web crawler generating clean Markdown with local LLM support.
Spider	-	Fast web scraping designed for AI with structured data extraction.
Jina Reader	-	Converts any URL into LLM-friendly text.
ScrapeGraphAI	-	Scraping library using LLMs to create pipelines from natural language.

(⬆ back to top)

📖 AI Knowledge Management

Tool	Description
Glean	Enterprise search connecting Google Workspace, Microsoft 365, Slack, and Salesforce.
Notion AI	AI layer combining wiki, project management, and databases with AI search and writing.
Guru	Knowledge verification delivering knowledge at point-of-work via Slack and Teams.
Mem	Note-taking that self-organizes information using machine learning.
Slite	Collaborative docs using AI to categorize content, identify gaps, and surface relevant info.
Confluence AI	Atlassian's enterprise wiki with AI-powered search and content generation.

(⬆ back to top)

🏆 Benchmarks & Leaderboards

Tool	Description
Chatbot Arena (LMArena)	Crowdsourced LLM ranking using 6M+ blind pairwise votes and Elo ratings.
Artificial Analysis	Independent comparison of 100+ models across quality, price, speed, and latency.
Open LLM Leaderboard	Community benchmark tracking open-source model performance across standardized evaluations.
Stanford HELM	Holistic evaluation across 42 scenarios and 7 dimensions including accuracy, bias, and toxicity.
LiveBench	Continuously updated with new questions to prevent contamination.
LLM Stats	Aggregates multiple leaderboards and benchmarks in one place.
MTEB (Massive Text Embedding Benchmark)	Evaluates embedding models across retrieval, classification, and clustering.
EvalPlus	80x more test cases than HumanEval to address overfitting.
BigCodeBench	1,140 tasks requiring composing function calls from 139 libraries.

(⬆ back to top)

📚 Learning Resources

Guides & Tutorials

Resource	Stars	Description
OpenAI API Quickstart	-	Official getting started guide covering chat completions, tool use, and multimodal capabilities.
OpenAI Academy	-	Workshops from foundational AI literacy to advanced integration for engineers.
Anthropic Getting Started	-	Set up your environment, install an SDK, and send your first message to Claude.
Anthropic Quickstarts	1k+	Projects to help build deployable applications using the Claude API.
Google Gemini Quickstart	-	Step-by-step tutorials for getting started with Gemini models.
Google Gemini Cookbook	8k+	Official examples and guides for using the Gemini API.
OpenAI Cookbook	-	Recipes, guides, and notebooks for building with OpenAI models.
OpenAI Prompt Engineering Guide	-	Strategies for getting better results from LLMs.
Anthropic Prompt Engineering Tutorial	3k+	Interactive tutorial for effective prompt design with Claude.
DAIR.AI Prompt Engineering Guide	-	Community-driven guide covering techniques and examples for all skill levels.
Microsoft Generative AI for Beginners	70k+	Free 21-lesson course covering generative AI fundamentals with code examples.
Microsoft AI Agents for Beginners	15k+	Free 12-lesson course for getting started building AI agents.

Courses

Resource	Description
DeepLearning.AI Short Courses	50+ free short courses on LLMs, prompt engineering, RAG, agents, and fine-tuning.
fast.ai Practical Deep Learning	Free course teaching practical deep learning with a top-down approach.
Hugging Face LLM Course	Using, fine-tuning, and deploying LLMs with the Hugging Face ecosystem.
Google ML Crash Course	Free 15-hour self-study course on ML fundamentals.
LangChain Academy	Free courses on LangGraph agent architectures and prompt engineering.
Stanford CS224N	Stanford's flagship NLP course with free lecture videos and assignments.
Stanford CS229	Stanford's ML course.
Generative AI with LLMs (Coursera)	DeepLearning.AI and AWS course on generative models and fine-tuning.

Books

Resource	Description
AI Engineering (Chip Huyen)	Guide to building production AI systems; most-read book on O'Reilly since launch.
Build a Large Language Model From Scratch (Sebastian Raschka)	Step-by-step guide to coding a GPT-like LLM from scratch.
Designing Machine Learning Systems (Chip Huyen)	Holistic approach to designing ML systems for production.
Hands-On Large Language Models (O'Reilly)	Practical guide to understanding and applying LLMs in real-world scenarios.
Deep Learning for Coders with fastai and PyTorch	Hands-on approach to deep learning starting with practical applications.

Newsletters

Resource	Description
The Batch	Andrew Ng's weekly AI news for engineers, executives, and enthusiasts.
TLDR AI	Daily AI newsletter covering technical developments for 1.25M+ readers.
Ben's Bites	Daily newsletter with 120K+ subscribers on product launches and business use cases.
The Neuron	Morning-Brew-style AI newsletter for 550K+ professionals.
Latent Space	Technical AI newsletter and podcast on research and infrastructure.
Simon Willison's Weblog	Blog covering LLMs, AI tools, and open source from a prolific builder.

Podcasts

Resource	Description
Latent Space: The AI Engineer Podcast	Deep technical dives into foundation models, agents, and AI engineering.
Practical AI	Making AI practical, productive, and accessible with real-world discussions.
NVIDIA AI Podcast	Biweekly interviews exploring innovations in AI and deep learning.
Lex Fridman Podcast	Long-form conversations about science, AI, and intelligence.
Dwarkesh Podcast	Deeply researched interviews with leading thinkers in AI and technology.

YouTube Channels

Resource	Description
Andrej Karpathy	Former Tesla AI Director teaching neural networks from scratch in "Zero to Hero.".
3Blue1Brown	Beautifully animated explanations of mathematics, neural networks, and deep learning.
Yannic Kilcher	Detailed breakdowns of the latest ML research papers.
AI Explained	Clear explanations of AI concepts and model comparisons.
Two Minute Papers	Bite-sized summaries of cutting-edge AI research with visual explanations.
Umar Jamil	Highly technical implementations of ML and LLM techniques from scratch.

Communities

Resource	Description
r/LocalLLaMA	266K+ member community focused on running LLMs locally and open-source models.
r/MachineLearning	2.8M+ subscriber academic-leaning community for ML papers and research.
Hugging Face Community	Hub for sharing ML models, datasets, and Spaces.
OpenAI Developer Forum	Official forum for OpenAI API developers.
LangChain Discord	Developer community for building LLM applications with LangChain.

Research Paper Aggregators

Resource	Description
arXiv	Open-access preprint repository with 2M+ papers in AI, ML, and CS.
Semantic Scholar	AI-powered tool for finding and connecting scientific papers.
Papers With Code	ML papers linked with code implementations and benchmark results.
Connected Papers	Explores related papers through interactive citation graphs.
Hugging Face Daily Papers	Curated trending AI papers with links to models and datasets.

Conferences

Resource	Description
NeurIPS	Premier ML conference (December annually).
ICML	International Conference on Machine Learning (July 2026, Seoul).
ICLR	International Conference on Learning Representations (April 2026, Rio de Janeiro).
AI Engineer World's Fair	Practitioner-focused conference (June-July 2026, San Francisco).

(⬆ back to top)

🔗 Related Awesome Lists

List	Stars	Description
awesome-chatgpt-prompts	120k+	Curated ChatGPT prompts for creative and effective use.
awesome-llm-apps	20k+	LLM app examples with code.
awesome-generative-ai	7k+	Generative AI tools and resources.
awesome-langchain	7k+	Tools and projects using LangChain.
awesome-mcp-servers	30k+	Community-curated MCP server collection.
awesome-llm-agents	3k+	LLM agent frameworks and tools.
awesome-vibe-coding	5k+	Tools and resources for AI-assisted coding.
awesome-local-ai	3k+	Tools for running AI models locally.
awesome-scrapers	-	Scrapers, crawlers, and data extraction tools.
awesome-robotics	-	Robotics frameworks, simulators, and platforms.
awesome-web3-ai	-	Web3 x AI tools, agent frameworks, and protocols.

(⬆ back to top)

🪦 Deprecated Tools Graveyard

Dead Tool	Why	Use Instead
OpenAI Codex	Shut down Mar 2023	GPT-4o, Claude Code.
Hugging Face Inference API (free)	Removed free tier	Hugging Face Inference Endpoints (paid).
Jasper Boss Mode	Rebranded/discontinued	Jasper AI platform.
ChatGPT Plugins	Shut down Apr 2024	GPTs, function calling.
Bard	Renamed to Gemini	Google Gemini.
LangChain Agents (legacy)	Deprecated in favor of LangGraph	LangGraph.

(⬆ back to top)

Contributing

Contributions welcome! Please read the contribution guidelines first.

Add tools you've actually used or evaluated
Include star count and language where applicable
Note if a tool is unmaintained (last commit >1 year ago)
Commercial tools/services are fine but must be clearly labeled

License

To the extent possible under law, Edward Tay has waived all copyright and related or neighboring rights to this work.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github		.github
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
social-preview.png		social-preview.png

Folders and files

Latest commit

History

Repository files navigation

Awesome AI

How to Choose

🧠 LLM API Providers

🔓 Open Source LLM Models

🤔 AI Reasoning Models

🚀 Open-Source Model Hosting & Inference

🔀 LLM API Routers & Gateways

☁️ Cloud AI Platforms (Managed)

🤖 AI Agent Frameworks

💾 AI Memory & Context Management

📊 Structured Output & Data Extraction

💻 AI Code Assistants

🎨 AI Image Generation APIs

🎬 AI Video Generation APIs

🎙️ AI Audio & Speech APIs

🔍 AI Search APIs

🌐 AI Translation APIs

✍️ AI Writing & Content APIs

📐 Vector Databases & Embeddings

Vector Databases

Embedding APIs

📚 RAG Tools & Frameworks

📄 AI Document Processing & OCR

🗃️ AI Database & SQL Assistants

📏 LLM Evaluation & Monitoring

💰 AI Gateway & Cost Management

🎯 Prompt Engineering & Management

🔧 Fine-Tuning Platforms

🛡️ AI Safety & Guardrails

⚖️ AI Compliance & Governance

🏭 MLOps & Deployment

⚡ AI Workflow & Automation

🔌 AI Function Calling & Tool Use

🧪 AI Testing & QA

🧬 AI Synthetic Data Generation

🏷️ AI Data Labeling & Annotation

🔗 MCP (Model Context Protocol)

Core Protocol

Notable MCP Servers

🛠️ AI Developer Tools & SDKs

⏱️ Realtime & Streaming AI

🎭 Multi-Modal AI APIs

📱 Edge & On-Device AI

🖥️ Cloud GPU Providers

🧩 AI Browser Extensions & Desktop Apps

🎮 AI Playgrounds & API Testing

🕷️ Web Scraping & Data Extraction for AI

📖 AI Knowledge Management

🏆 Benchmarks & Leaderboards

📚 Learning Resources

Guides & Tutorials

Courses

Books

Newsletters

Podcasts

YouTube Channels

Communities

Research Paper Aggregators

Conferences

🔗 Related Awesome Lists

🪦 Deprecated Tools Graveyard

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages