Skip to content

edwardtay/awesome-ai

Awesome AI Awesome

Stars Links Last Commit PRs Welcome License: CC0

A curated list of AI APIs, tools, and platforms. 400+ tools across 40+ categories.

⚠️ = aging (6-12 months since last commit) — may still work but watch for staleness.

How to Choose

I need to... Start here
Call an LLM API LLM API Providers
Run open-source models Open Source LLM Models
Build AI agents AI Agent Frameworks
Generate images AI Image Generation APIs
Generate video AI Video Generation APIs
Add voice/speech AI Audio & Speech APIs
Search the web with AI AI Search APIs
Store embeddings Vector Databases & Embeddings
Build RAG pipelines RAG Tools & Frameworks
Write code with AI AI Code Assistants
Evaluate LLM outputs LLM Evaluation & Monitoring
Deploy models to production MLOps & Deployment
Connect LLMs to tools MCP (Model Context Protocol)
Rent GPUs Cloud GPU Providers

🧠 LLM API Providers

Tool Description
OpenAI API GPT-4o and o-series reasoning models with tool use, vision, and multimodal capabilities.
Anthropic Claude API Claude model family with 200K context windows, vision, and agentic capabilities; safety-focused.
Google Gemini API Gemini models with 1M+ token context, search integration, and Google Cloud ecosystem.
Mistral AI API European AI lab with open-weight and commercial models under Apache 2.0 licensing at competitive prices.
DeepSeek API DeepSeek V3 and R1 reasoning models at a fraction of competitor pricing with 128K context.
Cohere API Command R+ models optimized for RAG, multilingual support, and enterprise deployment.
AI21 Labs API Jamba models and Maestro orchestration system for enterprise NLP with strong multilingual capabilities.
xAI Grok API Grok models with competitive pricing, image generation, and video generation.
Inflection AI Creators of Pi, focused on empathetic and helpful interactions via enterprise API.
Reka AI Multimodal models spanning text, images, and video understanding.

(⬆ back to top)

🔓 Open Source LLM Models

Tool Stars Description
Llama 4 (Meta) - Scout (17B active/109B experts) and Maverick variants, 128K context, strong general performance.
Qwen 3 (Alibaba) - MoE family (up to 235B parameters) excelling in multilingual and reasoning tasks with open weights.
DeepSeek V3 20k+ 671B MoE model (MIT license) matching frontier models at a fraction of the cost.
DeepSeek R1 38k+ Reasoning model trained via large-scale RL achieving o1-level performance (MIT license).
Mistral Large / Mixtral - Mixtral MoE and Small 3 (24B) punching above their weight in efficiency.
Gemma 3 (Google) - Available in multiple sizes with strong performance for its parameter count.
Phi-4 (Microsoft) - Small language model built on synthetic datasets achieving state-of-the-art performance for its size.
Command R+ (Cohere) - Optimized for RAG and enterprise use with strong multilingual capabilities.
Yi (01.AI) - Bilingual (Chinese-English) models with strong performance in independent benchmarks.
StarCoder2 3k+ Code LLM in 3B/7B/15B sizes where the 15B matches 33B+ models on code evaluations.
OLMo (Allen AI) - Training data, code, and weights all publicly available for reproducible research.
DBRX (Databricks) - MoE model with 132B total / 36B active parameters for enterprise use cases.

(⬆ back to top)

🤔 AI Reasoning Models

Tool Description
OpenAI o3 / o4-mini Chain-of-thought reasoning with inference-time compute scaling for complex problems.
DeepSeek R1 Achieves o1-level performance at 5-10x lower cost ($0.55/$2.19 per 1M tokens).
Anthropic Claude (Extended Thinking) Hybrid mode producing instant or deep-thinking responses without switching models.
Google Gemini 2.5 Pro (Thinking) Built-in thinking capabilities for multi-step problem solving.
QwQ (Qwen with Questions) Strong math and coding chain-of-thought capabilities in the Qwen family.

(⬆ back to top)

🚀 Open-Source Model Hosting & Inference

Tool Description
Together AI 200+ models with sub-100ms latency, fine-tuning, and horizontal scaling.
Fireworks AI Low-latency inference for open-weight models including DeepSeek, Llama, and Qwen.
Groq Purpose-built LPU hardware delivering ultra-fast inference with extremely low latency.
Cerebras Wafer-scale chip architecture eliminating GPU communication bottlenecks for fast inference.
SambaNova Enterprise inference with OpenAI-compatible API and ultra-low latency.
Replicate Run and deploy models via API with simple scaling across image, video, audio, and text.
Modal Serverless cloud for AI models with pay-per-use pricing and infrastructure control.
Baseten Deploy and serve ML models with infrastructure control and optimized performance.
DeepInfra Cost-effective inference API with managed servers, GPUs, and auto-scaling.
Lepton AI Pythonic framework for building and deploying AI services with cloud-native inference.
fal.ai 600+ models with competitive pricing, optimized for generative media.
Novita AI Affordable access to models for text, image, and audio generation.

(⬆ back to top)

🔀 LLM API Routers & Gateways

Tool Stars Description
OpenRouter - Unified API for 100+ LLMs from multiple providers with transparent pricing comparison.
LiteLLM 18k+ Proxy/gateway providing a unified API to 100+ LLMs with load balancing, fallbacks, and spend tracking.
Martian - AI-powered router that automatically selects the best model for each request using Model Mapping.
Portkey - Enterprise gateway with load balancing, caching, fallbacks, and observability.
Helicone - Rust-based gateway with load balancing, native observability, and cost tracking.

(⬆ back to top)

☁️ Cloud AI Platforms (Managed)

Tool Description
AWS Bedrock Multi-vendor model marketplace with AgentCore, supporting Anthropic, Meta, Cohere, and Amazon models.
Azure AI Foundry 11,000+ models with GPT-4/5 access and deep Azure integration.
Google Vertex AI Gemini family, Model Garden for 3rd-party/OSS models, and data-driven MLOps.
AWS SageMaker End-to-end ML platform for building, training, and deploying models with managed infrastructure.
Azure Machine Learning Automated ML, responsible AI dashboard, and managed endpoints.
Databricks Unified analytics and AI with managed MLflow, Unity Catalog, and Mosaic AI.
IBM watsonx Foundation models, model training, and AI governance for business workflows.

(⬆ back to top)

🤖 AI Agent Frameworks

Tool Stars Description
LangChain - LLM application framework with extensive integrations for chains, agents, and retrieval.
LangGraph - Cyclical graphs for agent runtimes with state management and human-in-the-loop. Extends LangChain.
LlamaIndex - RAG-first framework with 100+ data connectors and purpose-built retrieval infrastructure.
CrewAI - Role-based multi-agent systems with structured task delegation.
Microsoft AutoGen - Multi-agent conversation framework merging with Semantic Kernel into unified Microsoft Agent Framework.
Microsoft Semantic Kernel - LLM integration SDK with deep Azure ecosystem support across C#, Python, and Java.
DSPy - Declarative framework for programming (not prompting) language models with automatic prompt optimization.
Haystack - AI orchestration framework for LLM applications with modular pipelines.
Instructor 9k+ Structured output extraction using Pydantic models with validation and retry logic.
Pydantic AI - Agent framework from the Pydantic team with type-safe structured outputs.
OpenAI Agents SDK 7k+ OpenAI's official framework for agentic workflows with tool use, handoffs, and guardrails.
Google ADK (Agent Development Kit) - Google's framework with native Gemini integration and multi-agent orchestration.
Smolagents 15k+ Hugging Face's lightweight agent framework with minimal boilerplate.
CopilotKit - Framework for building AI copilots into React applications with AG-UI protocol support.

(⬆ back to top)

💾 AI Memory & Context Management

Each tool takes a different approach to giving agents persistent memory across sessions.

Tool Stars Description
Mem0 - Dedicated memory layer extracting and managing memories across sessions with up to 80% token reduction.
Zep - Temporal knowledge graph tracking how facts change over time with structured business data.
Letta (MemGPT) - Agent framework with first-class memory management exposing editable memory blocks and stateful runtime.
Cognee - Memory engine finding hidden connections in data using knowledge graphs with semantic embeddings.
LangMem 1k+ Memory tools for LangGraph agents extracting information from conversations for long-term context.
Supermemory - Universal memory API with time-annotated semantic traces and scalable semantic recall.
Graphlit - Semantic content infrastructure with multimodal ingestion linked to a shared knowledge graph.

(⬆ back to top)

📊 Structured Output & Data Extraction

Tool Stars Description
Instructor 9k+ Structured output extraction using Pydantic models with validation and retry logic.
BAML 5k+ Domain-specific language for writing and testing LLM functions with type-safe validation.
Marvin 5k+ Lightweight Python library for natural language interfaces that extract structured data from LLMs.
Outlines 10k+ Constrained text generation using regular expressions, JSON schemas, and context-free grammars.
Guidance 19k+ Microsoft's language for controlling LLMs with interleaving generation, prompting, and logical control.
SGLang 8k+ Fast serving framework with built-in structured output via JSON schema, regex, and EBNF.
TypeChat 8k+ Uses TypeScript types to guide LLM outputs into well-typed structured responses.
JSONFormer 4k+ Constrains generation to only produce valid JSON tokens.
LlamaParse - Enterprise document parsing with agentic OCR for extracting structured data from complex documents.
Docling 18k+ IBM's document parser with 97.9% accuracy on complex table extraction.

(⬆ back to top)

💻 AI Code Assistants

Tool Description
GitHub Copilot 20M+ users, deep IDE integration, agent mode, and enterprise features.
Cursor AI-native IDE built on VS Code with deep codebase understanding, multi-file editing, and Composer agent.
Windsurf (Codeium) Agentic IDE with Cascade flow combining deep context awareness with autonomous code generation and free tier.
Claude Code Terminal-based agentic coding excelling at reasoning, multi-file refactoring, and autonomous workflows.
Tabnine Privacy-first; multi-model support trained only on permissive licenses.
Amazon Q Developer Autonomous agents, AWS resource management, and console diagnostics.
Sourcegraph Cody Search-first architecture for analyzing entire multi-repo environments.
Continue.dev Supports any LLM and deep IDE customization.
Aider Free terminal-based AI pair programming tool for command-line workflows and automation.
Qodo (formerly CodiumAI) Focused on generating meaningful test suites and code review.
JetBrains AI Natively integrated into JetBrains IDEs with context-aware completions.
Google Gemini Code Assist Powered by Gemini models with Google Cloud integration.

(⬆ back to top)

🎨 AI Image Generation APIs

Tool Description
OpenAI DALL-E API Strong prompt understanding and seamless GPT integration.
Stability AI API Stable Diffusion 3.5 and SDXL models with fine-grained control and open-source weights.
Black Forest Labs (FLUX) FLUX.2 and Kontext APIs with state-of-the-art photorealism and typography.
Ideogram API Specializes in accurate text rendering, ideal for logos, posters, and branded content.
Recraft API Excels at vector outputs (icons, logos) alongside raster images with style controls.
Leonardo AI API Presets optimized for game art, textures, and product visuals.
Google Imagen API Imagen 3/4 models via Vertex AI with studio-quality outputs.
Midjourney Best for artistic and aesthetic outputs; accessible via Discord bot.

(⬆ back to top)

🎬 AI Video Generation APIs

Tool Description
Runway API Gen-4.5 with cinematic quality, motion brushes, and scene consistency.
Pika API PikaSwaps, Pikaffects, and fast rendering optimized for social media.
Luma AI (Dream Machine) 4K HDR video with Ray3 engine and superior physics simulation.
Kling AI Up to 2-minute videos at 1080p with simultaneous audio-visual generation.
OpenAI Sora Cinematic-quality videos with realistic physics.
Synthesia API 230+ AI avatars, 140+ languages, and Express-2 engine for enterprise video.
HeyGen API Realistic language dubbing with voice cloning and lip-sync.
D-ID API Talking-head videos from images and audio, supporting 120+ languages.
Google Veo API Native audio integration and high-fidelity output via Vertex AI.
Twelve Labs Multimodal video understanding: search, classification, and generation from video content.

(⬆ back to top)

🎙️ AI Audio & Speech APIs

Tool Description
ElevenLabs API 3,000+ voices, 70+ languages, emotional nuance, and voice cloning.
OpenAI Audio API TTS and STT including gpt-4o-transcribe and Whisper for 50+ languages.
Deepgram API STT and TTS (Aura-2) optimized for conversational AI with sub-250ms latency.
AssemblyAI API STT with speaker diarization, sentiment analysis, and content moderation.
Cartesia API Ultra-low-latency voice generation (<150ms) with rapid voice cloning for interactive use.
Play.ht API 900+ voices across 140+ languages and accents.
Resemble AI API Expressive speech synthesis, voice cloning, and deepfake detection for enterprise.
Speechify API Apple Design Award winner with 1,000+ voices in 60+ languages.
OpenAI Realtime API Speech-to-speech for real-time voice applications with low latency.

(⬆ back to top)

🔍 AI Search APIs

Tool Description
Perplexity Sonar API LLM-powered search providing direct answers with cited sources.
Tavily API Built for agents and RAG workflows with SOC 2 certification.
Exa API Semantic/neural search scoring 94.9% on SimpleQA, returning meaning-based results.
Brave Search API Independent index with no tracking and privacy-focused results.
SerpAPI SERP scraping across search engines for AI features, SEO, and price monitoring.
You.com API Deep Search, Express Agent, and MCP support.
Serper API Fast, affordable Google SERP API for high-volume use cases.
WebSearch API Web search API designed for AI workflow integration.

(⬆ back to top)

🌐 AI Translation APIs

Tool Description
DeepL API Known for natural, polished translations especially for European languages.
Google Cloud Translation 130+ languages with AutoML custom model training.
Microsoft Translator 100+ languages with custom translation models and document translation on Azure.
Amazon Translate Real-time and batch translation integrated into the AWS ecosystem.
ModernMT Adaptive engine that learns from human corrections in real-time.
Unbabel Blends AI translation with human editors for enterprise content.
LILT Adaptive neural models with human-in-the-loop workflow for enterprise localization.
Phrase Developer-friendly localization with extensive APIs and CLI tools for custom pipelines.
Lokalise 400+ languages with tone, style, and industry customization.
Tolgee In-context translating features, popular with JavaScript developers.

(⬆ back to top)

✍️ AI Writing & Content APIs

Tool Description
Jasper API Content platform for marketing teams with brand voice, campaign management, and generation.
Writer API Full-stack content generation with brand governance and custom model training.
Copy.ai Marketing copy, sales emails, and Go-To-Market workflows.
Writesonic Fact-checked, SEO-friendly article generation pulling real-time data from Google.
Surfer SEO SERP analysis and real-time content scoring for SEO optimization.
Frase Analyzes SERPs in real-time with AI-driven content suggestions for SEO.
Grammarly API Grammar, tone, clarity, and brand-voice checking for integration into applications.
Anyword Predictive performance scoring that forecasts content performance before publishing.
Wordtune Rewriting and paraphrasing by AI21 Labs for improving clarity, tone, and conciseness.

(⬆ back to top)

📐 Vector Databases & Embeddings

Vector Databases

Tool Stars Description
Pinecone - Fully managed with serverless architecture and support for billions of vectors.
Weaviate - Excels at hybrid search (vector + keyword) with GraphQL API.
Qdrant - High QPS at 99% recall, production-proven at scale.
Chroma - Developer-friendly embedding database ideal for prototyping RAG systems.
Milvus - Leads in low-latency benchmarks with support for billions of vectors.
Zilliz Cloud - Managed cloud service for Milvus with enterprise features and production SLAs.
pgvector 13k+ PostgreSQL extension keeping vectors and relational data together.
LanceDB - Serverless, built on Lance columnar format with zero-copy access.

Embedding APIs

Tool Description
OpenAI Embeddings API ada-002 and newer models for semantic search, clustering, and classification.
Voyage AI API Voyage 4 series with shared embedding spaces and domain-specific models.
Cohere Embed API Multilingual embeddings optimized for search and retrieval with Embed v3.
Google Embedding API Text and multimodal embeddings via Vertex AI with Gecko and newer models.
Jina AI Embeddings API-based embedding models with multilingual support and reranking.

(⬆ back to top)

📚 RAG Tools & Frameworks

Tool Description
LlamaIndex 100+ data connectors, retrieval strategies, and production-ready pipelines.
Unstructured Converts PDFs, images, HTML, and Word docs into structured data for ingestion.
Vectara RAG-as-a-service with built-in semantic search, summarization, and hallucination detection.
Ragas Reference-free evaluation of RAG pipelines with metrics for context relevance and groundedness.
LangChain Retrievers Modular retrieval components supporting vector stores, multi-query, and contextual compression.
Cohere Rerank Neural reranking that improves search relevance by reordering results on semantic similarity.
Morphik RAG infrastructure for building and deploying retrieval-augmented generation systems.

(⬆ back to top)

📄 AI Document Processing & OCR

Tool Stars Description
Azure Document Intelligence - Extracts structure, relationships, and key-values from documents.
Google Document AI - Pre-trained models for invoices, receipts, and custom documents.
Amazon Textract - Extracts text, forms, and tables integrated with S3 and Lambda.
Mistral OCR - 1000 pages/$ with state-of-the-art accuracy.
LlamaParse - Agentic OCR extracting structured data from complex documents.
Docling 18k+ IBM's parser with 97.9% accuracy on complex table extraction.
Mathpix - Specialized in mathematical equations, scientific documents, and LaTeX conversion.
Reducto - API-first parsing for extracting structured data from PDFs for RAG pipelines.
Mindee - Developer-friendly extraction for invoices, receipts, IDs, and custom document types.

(⬆ back to top)

🗃️ AI Database & SQL Assistants

Tool Description
Vanna.ai Self-hosted AI SQL agent with personalized schema training.
Defog.ai Text-to-SQL engine achieving 90-95% accuracy, matching GPT-4 on specialized benchmarks.
Text2SQL.ai Converts natural language to SQL with multiple dialect support.
DBHub Universal database MCP server enabling AI assistants to query databases.
Outerbase Spreadsheet-like views with EZQL natural language agent.
SQL Chat Chat-based SQL client using natural language.
BlazeSQL Generates SQL from natural language by parsing schema metadata.
DataGPT Slack-native analytics answering business questions with raw SQL transparency.

(⬆ back to top)

📏 LLM Evaluation & Monitoring

Tool Description
LangSmith Tracing, prompt testing, and RAG assessment from the LangChain team.
Langfuse LLM engineering platform for tracing, prompt management, evaluations, and observability.
Braintrust Evaluation-first platform for systematic prompt experimentation with side-by-side comparison.
Arize AI LLM observability with Phoenix tracing and production monitoring.
Promptfoo Prompt engineering, A/B testing, and LLM-as-a-judge evaluations.
Humanloop Prompt management and evaluation with human feedback loops.
WhyLabs Detects anomalies, drift, hallucinations, and bias in production systems.
Galileo AI Luna-2 evaluation models and real-time guardrail system for agent actions.
TruLens Evaluating and tracing AI agents and RAG systems with OpenTelemetry tracing.
Maxim AI Testing, monitoring, and optimizing LLM systems.
Evidently AI Detects data drift and model performance degradation.
Fiddler AI Focused on explainability, fairness, and compliance for ML and LLM models.

(⬆ back to top)

💰 AI Gateway & Cost Management

Tool Description
Helicone Gateway and observability with request logging, cost tracking, and Rust-based performance.
Portkey Unified API with load balancing, semantic caching, and multi-provider fallbacks.
Keywords AI Tracks usage, latency, and spend across providers.
Lunary LLM monitoring and analytics with cost tracking and prompt management.
OpenRouter Unified API marketplace for 100+ LLMs with transparent pricing and cost optimization.

(⬆ back to top)

🎯 Prompt Engineering & Management

Tool Description
Langfuse Prompt management with versioning, tracing, evaluations, and observability.
PromptLayer Logging, versioning, visual editing, and regression testing.
Agenta LLMOps platform for prompt management, evaluation, and A/B testing.
Pezzo Lightweight prompt management for versioning, testing, and instant deployment.
Promptfoo CLI for prompt testing, evaluation, and red-teaming with YAML-based configuration.
Maxim AI Prompt management and evaluation with testing workflows and version control.

(⬆ back to top)

🔧 Fine-Tuning Platforms

Tool Stars Description
OpenAI Fine-Tuning - Managed fine-tuning for GPT models with reinforcement fine-tuning preview.
Together AI Fine-Tuning - Cloud-based fine-tuning for open-source models with LoRA adapter support.
Predibase - Low-code fine-tuning and hosting with LoRAX multi-adapter serving and serverless deployment.
LoRAX 3k+ Serves 100s of fine-tuned LoRA adapters on a single GPU.
Modal Fine-Tuning - Serverless GPU compute with pay-per-use pricing and zero idle costs.
Lambda Cloud - GPU cloud optimized for AI training with zero egress fees and InfiniBand networking.
Anyscale - Built on Ray for distributed fine-tuning and serving of LLMs.

(⬆ back to top)

🛡️ AI Safety & Guardrails

Tool Stars Description
Guardrails AI - LLM output validation with pre-built validators from Guardrails Hub.
NVIDIA NeMo Guardrails 4k+ Programmable safety guardrails using a domain-specific language.
Lakera Guard - Real-time prompt injection detection with customizable safeguards and monitoring dashboards.
LLM Guard - Scanners for prompt injection, PII, and toxicity in LLM interactions.
Rebuff 1k+ Self-hardening prompt injection detector that learns from attack attempts over time.
Azure AI Content Safety - Detects harmful content in text and images with configurable severity levels.

(⬆ back to top)

⚖️ AI Compliance & Governance

Tool Description
Credo AI Compliance automation with EU AI Act, NIST AI RMF, and ISO 42001.
Holistic AI AI lifecycle governance assessing models for safety, fairness, bias, and regulatory alignment.
IBM watsonx.governance Model lifecycle management, bias detection, and compliance tracking.
Zenity AI agent compliance: risk detection, guardrail enforcement, and regulatory reporting.
Fairly AI AI risk management for financial services with model validation and bias detection.
Drata Compliance automation with AI-powered controls for SOC 2, ISO 27001, and AI regulations.
Securiti AI Data command center with AI governance, data privacy, and consent management.

(⬆ back to top)

🏭 MLOps & Deployment

Tool Description
MLflow Experiment tracking, model registry, and deployment; MLflow 3 adds GenAI ops support.
BentoML Build, ship, and scale AI model serving with high-performance inference.
KServe Kubernetes-native serverless model serving with autoscaling.
ZenML MLOps framework for portable ML pipelines across any infrastructure.
Metaflow Netflix-originated framework for real-life ML workflows with deep AWS integration.
NVIDIA Triton Inference Server GPU inference serving supporting multiple ML frameworks and model formats.
DVC (Data Version Control) Version control for data, models, and experiments alongside Git.
Kubeflow Google-backed ML toolkit for Kubernetes with pipeline orchestration and LLM fine-tuning.
Seldon Core ML model serving at scale with A/B testing and canary rollouts.
Weights & Biases Experiment tracking, model management, and dataset versioning for the full ML lifecycle.
Neptune.ai Experiment tracking and model registry for logging, comparing, and collaborating on models.
Comet ML Experiment management with real-time tracking, model comparison, and production monitoring.

(⬆ back to top)

⚡ AI Workflow & Automation

Tool Description
n8n AI agent nodes for building LLM-powered automation workflows.
Zapier AI 8,000+ app connections with AI-powered workflows, agents, and chatbots.
Make (Integromat) Visual automation with detailed branching logic and deep app integrations.
Lindy AI Agent-first automation for tasks like lead qualification and email follow-ups.
Relevance AI No-code platform for building collaborating AI agent teams.
Activepieces MIT-licensed no-code automation with AI workflow integration and self-hosting.
Pipedream Developer-focused hybrid no-code/serverless platform for API workflows with code at any step.
Inngest Durable functions replacing queues for reliable multi-step AI workflows.
Trigger.dev TypeScript-first platform for deploying managed AI agents and background workflows.
Windmill Workflow engine for internal tools, background jobs, and AI pipelines.
Hatchet Distributed task queue for reliable background jobs and AI agent workflows.
Relay.app Human-in-the-loop automation combining AI steps with human approval.

(⬆ back to top)

🔌 AI Function Calling & Tool Use

Tool Description
Composio 150+ pre-built tool integrations for AI agents with authentication handling.
Toolhouse Universal tool server enabling any LLM to use tools via a simple API.
Arcade AI Authenticated API access to third-party services for AI agents.
NPI (Natural Programming Interface) Tool-use APIs enabling AI agents to operate software tools.
Dify Platform (93K+ stars) for AI agents with built-in RAG, Function Calling, and ReAct strategies.

(⬆ back to top)

🧪 AI Testing & QA

Tool Description
testRigor Generative AI test automation from plain English descriptions.
Applitools Visual AI detecting UI bugs across browsers and devices.
Testsigma Natural language test creation for web, mobile, and API.
Katalon AI-powered test automation for web, mobile, API, and desktop.
mabl Low-code, AI-driven test automation for Agile and DevOps.
QA Wolf AI + human-in-the-loop QA with Playwright-based testing and massive parallelization.
Qodo Generates meaningful test suites and improves code quality.

(⬆ back to top)

🧬 AI Synthetic Data Generation

Tool Description
Gretel Fine-tuning capabilities, privacy metrics, and domain-specific generation APIs.
Mostly AI Transforms production data into privacy-safe versions for AI training.
Tonic.ai Fabricate (synthetic data), Structural (test data), and Textual (data redaction).
YData Automated data profiling with synthetic generation for improving AI training quality.
Hazy Privacy-preserving synthetic data for regulated industries using differential privacy.
Synthesis AI Specialized in computer vision training with photorealistic 3D-rendered images.

(⬆ back to top)

🏷️ AI Data Labeling & Annotation

Tool Description
Scale AI Data labeling with RLHF capabilities, used by major AI labs.
Labelbox Annotation, data management, and Model Foundry with HIPAA/SOC2 compliance.
Snorkel AI Programmatic labeling using weak supervision to encode labeling logic as functions.
Argilla Free data annotation for NLP, LLMs, and multimodal models with Hugging Face integration.
Label Studio Flexible templates, ML backends, and model-in-the-loop support.
Encord Computer vision labeling with active learning and model-assisted labeling.
CVAT Annotation tool for computer vision including image and video labeling.

(⬆ back to top)

🔗 MCP (Model Context Protocol)

Anthropic's open standard (donated to Linux Foundation AAIF) for connecting AI models to external tools and data sources.

Core Protocol

Tool Stars Description
Model Context Protocol Specification - The protocol spec defining how AI models connect to tools and data.
MCP TypeScript SDK 3k+ Official TypeScript SDK for building MCP servers and clients.
MCP Python SDK 4k+ Official Python SDK for building MCP servers and clients.
MCP Registry - Catalog with ~2,000 entries and 407% growth since launch.

Notable MCP Servers

Tool Stars Description
MCP Servers Repository 16k+ Official collection of reference server implementations.
Awesome MCP Servers 30k+ Community-curated collection covering 7,260+ servers across diverse integrations.
Playwright MCP 5k+ Microsoft's server enabling LLMs to interact with web pages through accessibility snapshots.
ActionKit by Paragon - Connects to 130+ SaaS integrations for AI agent workflows.

(⬆ back to top)

🛠️ AI Developer Tools & SDKs

Tool Description
Vercel AI SDK TypeScript toolkit for React/Next.js AI apps with composable agents and MCP support.
Streamlit Python framework for data-heavy AI apps combining chat interfaces with dashboards.
Gradio Python library for ML demo UIs with instant public links.
Chainlit Python framework for conversational AI UIs with step-by-step visualization.
Langflow Low-code Python-based builder for visually creating agentic and RAG applications.
Flowise Node.js visual AI workflow builder for agents and chatbots.
Dify LLM app development platform with visual workflow builder and model management.

(⬆ back to top)

⏱️ Realtime & Streaming AI

Tool Description
OpenAI Realtime API Speech-to-speech via WebSocket for real-time voice applications.
Google Gemini Live API Real-time voice and video interaction with Gemini via WebSocket.
LiveKit Real-time audio/video AI applications with WebRTC and agent frameworks.
Deepgram Streaming Real-time STT with sub-250ms latency via WebSocket for live transcription.
AG-UI Protocol Protocol for streaming agent events between AI backends and frontend UIs.
Decart Lucy 2 model generating video live with near-zero latency.

(⬆ back to top)

🎭 Multi-Modal AI APIs

Tool Description
OpenAI GPT-4o API Natively multimodal: text, images, audio, and video with unified reasoning.
Google Gemini API 1M+ token context with native image/video/audio understanding.
Anthropic Claude API Vision-capable: analyzes images, documents, charts, and screenshots alongside text.
Mistral Pixtral Image understanding capabilities via API.
Meta Llama Multimodal Vision capabilities via various inference providers.
Twelve Labs Video understanding: search, classification, and generation from video.
Roboflow Computer vision APIs for training, deploying, and managing detection and segmentation models.

(⬆ back to top)

📱 Edge & On-Device AI

Tool Stars Description
Ollama - Run LLMs locally with a single command; wide model support with easy management.
LM Studio - Desktop app for running LLMs locally with Vulkan GPU offloading and model discovery.
llama.cpp 75k+ C/C++ inference engine optimized for CPU-only inference with extensive quantization.
vLLM - High-throughput serving with PagedAttention and continuous batching.
NVIDIA TensorRT-LLM 10k+ 30-70% faster throughput on NVIDIA GPUs.
MLX 18k+ Apple's ML framework for Apple Silicon achieving ~230 tokens/second.
ExecuTorch 3k+ Meta's on-device framework with 50KB footprint supporting 12+ hardware backends.
Jan - Offline-first desktop AI assistant for running LLMs locally with privacy-first design.
LocalAI - Self-hosted OpenAI-compatible API for running models locally.
LlamaEdge - Lightweight runtime for deploying LLMs at the edge using WebAssembly.

(⬆ back to top)

🖥️ Cloud GPU Providers

Tool Description
RunPod On-demand and serverless compute supporting A100/H100 GPUs at flexible pricing.
Lambda Cloud H100/A100 instances with zero egress fees and InfiniBand networking.
CoreWeave Kubernetes-native GPU cloud at hyperscale; serves OpenAI and Microsoft.
Vast.ai GPU marketplace with competitive bidding, best for cost-sensitive workloads.
Nebius AI-native cloud with Blackwell Ultra compute and European data centers.
TensorDock GPU marketplace spanning 100+ global locations.
Paperspace (DigitalOcean) Notebooks, virtual desktops, and serverless functions for ML development.
Crusoe Cloud Clean-energy-powered GPU cloud for sustainable AI workloads.
Fluidstack Distributed GPU cloud aggregating idle compute at competitive prices.

(⬆ back to top)

🧩 AI Browser Extensions & Desktop Apps

Tool Description
ChatGPT Desktop Native desktop app with system-wide access via global shortcuts.
Claude Desktop MCP-based extensions and local tool integration.
Poe Multi-model platform accessing GPT-4, Claude, Gemini, Llama, and custom bots in one interface.
Perplexity AI search engine as web app and browser extension providing cited answers.
Raycast AI macOS launcher with built-in AI and extensions for system-wide access.
Monica Browser extension accessing GPT, Claude, Gemini with sidebar chat.
Sider Browser sidebar unifying GPT, DeepSeek, Gemini, Claude, and Grok for comparing outputs.
Pieces for Developers Captures live context from browsers to IDEs with multiple LLM support.
HARPA AI Chrome extension combining ChatGPT, Claude, Gemini, Perplexity for web page automation.

(⬆ back to top)

🎮 AI Playgrounds & API Testing

Tool Stars Description
OpenAI Playground - Official tool for experimenting with OpenAI models and testing prompts.
Vercel AI Playground - Compare AI model outputs across providers side-by-side.
TypingMind - LLM frontend supporting multiple models with your own API keys and custom plugins.
ChatHub - Browser extension for using ChatGPT, Claude, Gemini side-by-side.
Msty - Desktop app for managing multiple LLM providers with conversation branching and prompt library.
OpenPlayground 3k+ LLM playground for running and comparing models locally.

(⬆ back to top)

🕷️ Web Scraping & Data Extraction for AI

Tool Stars Description
Firecrawl - Scrapes, crawls, and extracts structured data into LLM-ready formats.
Crawl4AI 58k+ AI-ready web crawler generating clean Markdown with local LLM support.
Spider - Fast web scraping designed for AI with structured data extraction.
Jina Reader - Converts any URL into LLM-friendly text.
ScrapeGraphAI - Scraping library using LLMs to create pipelines from natural language.

(⬆ back to top)

📖 AI Knowledge Management

Tool Description
Glean Enterprise search connecting Google Workspace, Microsoft 365, Slack, and Salesforce.
Notion AI AI layer combining wiki, project management, and databases with AI search and writing.
Guru Knowledge verification delivering knowledge at point-of-work via Slack and Teams.
Mem Note-taking that self-organizes information using machine learning.
Slite Collaborative docs using AI to categorize content, identify gaps, and surface relevant info.
Confluence AI Atlassian's enterprise wiki with AI-powered search and content generation.

(⬆ back to top)

🏆 Benchmarks & Leaderboards

Tool Description
Chatbot Arena (LMArena) Crowdsourced LLM ranking using 6M+ blind pairwise votes and Elo ratings.
Artificial Analysis Independent comparison of 100+ models across quality, price, speed, and latency.
Open LLM Leaderboard Community benchmark tracking open-source model performance across standardized evaluations.
Stanford HELM Holistic evaluation across 42 scenarios and 7 dimensions including accuracy, bias, and toxicity.
LiveBench Continuously updated with new questions to prevent contamination.
LLM Stats Aggregates multiple leaderboards and benchmarks in one place.
MTEB (Massive Text Embedding Benchmark) Evaluates embedding models across retrieval, classification, and clustering.
EvalPlus 80x more test cases than HumanEval to address overfitting.
BigCodeBench 1,140 tasks requiring composing function calls from 139 libraries.

(⬆ back to top)

📚 Learning Resources

Guides & Tutorials

Resource Stars Description
OpenAI API Quickstart - Official getting started guide covering chat completions, tool use, and multimodal capabilities.
OpenAI Academy - Workshops from foundational AI literacy to advanced integration for engineers.
Anthropic Getting Started - Set up your environment, install an SDK, and send your first message to Claude.
Anthropic Quickstarts 1k+ Projects to help build deployable applications using the Claude API.
Google Gemini Quickstart - Step-by-step tutorials for getting started with Gemini models.
Google Gemini Cookbook 8k+ Official examples and guides for using the Gemini API.
OpenAI Cookbook - Recipes, guides, and notebooks for building with OpenAI models.
OpenAI Prompt Engineering Guide - Strategies for getting better results from LLMs.
Anthropic Prompt Engineering Tutorial 3k+ Interactive tutorial for effective prompt design with Claude.
DAIR.AI Prompt Engineering Guide - Community-driven guide covering techniques and examples for all skill levels.
Microsoft Generative AI for Beginners 70k+ Free 21-lesson course covering generative AI fundamentals with code examples.
Microsoft AI Agents for Beginners 15k+ Free 12-lesson course for getting started building AI agents.

Courses

Resource Description
DeepLearning.AI Short Courses 50+ free short courses on LLMs, prompt engineering, RAG, agents, and fine-tuning.
fast.ai Practical Deep Learning Free course teaching practical deep learning with a top-down approach.
Hugging Face LLM Course Using, fine-tuning, and deploying LLMs with the Hugging Face ecosystem.
Google ML Crash Course Free 15-hour self-study course on ML fundamentals.
LangChain Academy Free courses on LangGraph agent architectures and prompt engineering.
Stanford CS224N Stanford's flagship NLP course with free lecture videos and assignments.
Stanford CS229 Stanford's ML course.
Generative AI with LLMs (Coursera) DeepLearning.AI and AWS course on generative models and fine-tuning.

Books

Resource Description
AI Engineering (Chip Huyen) Guide to building production AI systems; most-read book on O'Reilly since launch.
Build a Large Language Model From Scratch (Sebastian Raschka) Step-by-step guide to coding a GPT-like LLM from scratch.
Designing Machine Learning Systems (Chip Huyen) Holistic approach to designing ML systems for production.
Hands-On Large Language Models (O'Reilly) Practical guide to understanding and applying LLMs in real-world scenarios.
Deep Learning for Coders with fastai and PyTorch Hands-on approach to deep learning starting with practical applications.

Newsletters

Resource Description
The Batch Andrew Ng's weekly AI news for engineers, executives, and enthusiasts.
TLDR AI Daily AI newsletter covering technical developments for 1.25M+ readers.
Ben's Bites Daily newsletter with 120K+ subscribers on product launches and business use cases.
The Neuron Morning-Brew-style AI newsletter for 550K+ professionals.
Latent Space Technical AI newsletter and podcast on research and infrastructure.
Simon Willison's Weblog Blog covering LLMs, AI tools, and open source from a prolific builder.

Podcasts

Resource Description
Latent Space: The AI Engineer Podcast Deep technical dives into foundation models, agents, and AI engineering.
Practical AI Making AI practical, productive, and accessible with real-world discussions.
NVIDIA AI Podcast Biweekly interviews exploring innovations in AI and deep learning.
Lex Fridman Podcast Long-form conversations about science, AI, and intelligence.
Dwarkesh Podcast Deeply researched interviews with leading thinkers in AI and technology.

YouTube Channels

Resource Description
Andrej Karpathy Former Tesla AI Director teaching neural networks from scratch in "Zero to Hero.".
3Blue1Brown Beautifully animated explanations of mathematics, neural networks, and deep learning.
Yannic Kilcher Detailed breakdowns of the latest ML research papers.
AI Explained Clear explanations of AI concepts and model comparisons.
Two Minute Papers Bite-sized summaries of cutting-edge AI research with visual explanations.
Umar Jamil Highly technical implementations of ML and LLM techniques from scratch.

Communities

Resource Description
r/LocalLLaMA 266K+ member community focused on running LLMs locally and open-source models.
r/MachineLearning 2.8M+ subscriber academic-leaning community for ML papers and research.
Hugging Face Community Hub for sharing ML models, datasets, and Spaces.
OpenAI Developer Forum Official forum for OpenAI API developers.
LangChain Discord Developer community for building LLM applications with LangChain.

Research Paper Aggregators

Resource Description
arXiv Open-access preprint repository with 2M+ papers in AI, ML, and CS.
Semantic Scholar AI-powered tool for finding and connecting scientific papers.
Papers With Code ML papers linked with code implementations and benchmark results.
Connected Papers Explores related papers through interactive citation graphs.
Hugging Face Daily Papers Curated trending AI papers with links to models and datasets.

Conferences

Resource Description
NeurIPS Premier ML conference (December annually).
ICML International Conference on Machine Learning (July 2026, Seoul).
ICLR International Conference on Learning Representations (April 2026, Rio de Janeiro).
AI Engineer World's Fair Practitioner-focused conference (June-July 2026, San Francisco).

(⬆ back to top)


🔗 Related Awesome Lists

List Stars Description
awesome-chatgpt-prompts 120k+ Curated ChatGPT prompts for creative and effective use.
awesome-llm-apps 20k+ LLM app examples with code.
awesome-generative-ai 7k+ Generative AI tools and resources.
awesome-langchain 7k+ Tools and projects using LangChain.
awesome-mcp-servers 30k+ Community-curated MCP server collection.
awesome-llm-agents 3k+ LLM agent frameworks and tools.
awesome-vibe-coding 5k+ Tools and resources for AI-assisted coding.
awesome-local-ai 3k+ Tools for running AI models locally.
awesome-scrapers - Scrapers, crawlers, and data extraction tools.
awesome-robotics - Robotics frameworks, simulators, and platforms.
awesome-web3-ai - Web3 x AI tools, agent frameworks, and protocols.

(⬆ back to top)

🪦 Deprecated Tools Graveyard

Dead Tool Why Use Instead
OpenAI Codex Shut down Mar 2023 GPT-4o, Claude Code.
Hugging Face Inference API (free) Removed free tier Hugging Face Inference Endpoints (paid).
Jasper Boss Mode Rebranded/discontinued Jasper AI platform.
ChatGPT Plugins Shut down Apr 2024 GPTs, function calling.
Bard Renamed to Gemini Google Gemini.
LangChain Agents (legacy) Deprecated in favor of LangGraph LangGraph.

(⬆ back to top)


Contributing

Contributions welcome! Please read the contribution guidelines first.

  • Add tools you've actually used or evaluated
  • Include star count and language where applicable
  • Note if a tool is unmaintained (last commit >1 year ago)
  • Commercial tools/services are fine but must be clearly labeled

License

CC0

To the extent possible under law, Edward Tay has waived all copyright and related or neighboring rights to this work.

Releases

No releases published

Packages

 
 
 

Contributors