Awesome Vector Databases

A curated list of vector database solutions, libraries, and resources for AI applications.

🔥 Acknowledgements

This directory was built and is maintained using the Ever Works Directory Builder platform.
The public-facing website is based on the open-source Directory Website Template.

📑 Table of Contents

Machine Learning Models (35)
Concepts & Definitions (74)
Vector Database Engines (62)
Managed Vector Databases (14)
Curated Resource Lists (55)
LLM Tools (5)
Multi Model & Hybrid Databases (9)
SDKs & Libraries (71)
Vector Database Engines (10)
Cloud Services (5)
Cloud Services (7)
Curated Resource Lists (4)
Managed Vector Databases (11)
Sdks & Libraries (11)
Vector Database Extensions (8)
Benchmarks & Evaluation (9)
Data Integration & Migration (4)
Llm Frameworks (3)
Llm Tools (6)
Machine Learning Models (2)
Multi Model & Hybrid Databases (6)
Vector Database Extensions (8)
Benchmarks & Evaluation (9)
Commerce (7)
Concepts & Definitions (12)
Data Integration & Migration (12)
LLM Frameworks (12)
LLM Frameworks (1)
LLM Tools (24)
Multi-Model & Hybrid Databases (3)
Open Sources (31)
Relational Databases (2)
Research Papers & Surveys (25)
Research Papers & Surveys (7)
SDKs & Libraries (9)
Security & Governance (1)
Security & Governance (1)

Machine Learning Models

BGE-VL - State-of-the-art multimodal embedding model from BAAI supporting text-to-image, image-to-text, and compositional visual search. Trained on the MegaPairs dataset with over 26 million retrieval triplets. (Read more) Multimodal Open Source Visual Search
ColBERT - Late interaction architecture for efficient and effective passage search. Encodes queries and documents independently using BERT, then performs token-level similarity via maxsim operator for strong generalization. (Read more) Retrieval Open Source Nlp
ColBERTv2 - Advanced multi-vector retrieval model creating token-level embeddings with late interaction mechanism, featuring denoised supervision and improved memory efficiency over original ColBERT. (Read more) Late Interaction Embeddings Retrieval
Jina Embeddings v4 - Universal multimodal embedding model from Jina AI supporting text and images through unified pathway. Built on Qwen2.5-VL-3B-Instruct, outperforms proprietary models on visually rich document retrieval. This is a commercial API with free tier, though OSS weights available. (Read more) Commercial Multimodal Open Source
Nomic Embed Text - First fully reproducible open-source text embedding model with 8,192 context length. v2 introduces Mixture-of-Experts architecture for multilingual embeddings. Outperforms OpenAI models on benchmarks. This is an OSS model under Apache 2.0 license. (Read more) Open Source Embedding Multilingual
NV-Embed - NVIDIA's generalist embedding model achieving record 69.32 score on MTEB benchmark. Fine-tuned from Llama architecture with improved techniques for training LLMs as embedding models. (Read more) Embeddings Nvidia Llm
pinecone-sparse-english-v0 - Learned sparse embedding model built on DeepImpact architecture, outperforming BM25 by up to 44% on TREC benchmarks for high-precision keyword search and hybrid retrieval. (Read more) Sparse Embeddings Hybrid Search
Qwen3 Embedding - Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions. (Read more) Multilingual Open Source Embeddings
voyage-3-large - State-of-the-art general-purpose and multilingual embedding model from Voyage AI that ranks first across eight domains spanning 100 datasets, outperforming OpenAI and Cohere models by significant margins. (Read more) Embeddings Multilingual Api
BGE Reranker Base - Open-source cross-encoder reranking model from BAAI that enhances RAG retrieval quality by examining query-document pairs individually. Self-hostable with Apache 2.0 licensing for cost-effective production deployments. (Read more) Reranking Open Source Rag
BGE-reranker-v2-m3 - Open-source multilingual reranking model from BAAI supporting 100+ languages with Apache 2.0 licensing, matching Cohere's latency on GPU with zero ongoing costs for production deployments. (Read more) Reranking Multilingual Open Source
Cohere Embed v3 - Commercial text embedding model from Cohere with multilingual support and 1,024-dimensional vectors. Optimized for semantic search and retrieval tasks. This is a commercial API service with pay-per-use pricing. (Read more) Commercial Embedding Api
Cohere Embed v4 - Multilingual, multimodal enterprise embedding model supporting over 100 programming languages and primary business languages with advanced quantization for cost optimization. (Read more) Embeddings Multilingual Multimodal
ColPali - Vision Language Model trained to produce high-quality multi-vector embeddings from document page images for efficient retrieval, eliminating need for OCR pipelines with ColBERT-style late interaction. (Read more) Multimodal Document Retrieval Vision
E5 Embeddings - Open-source text embedding models from Microsoft supporting 100+ languages. Features small, base, and large variants with weakly-supervised contrastive pre-training. This is an OSS model family released by Microsoft Research. (Read more) Open Source Microsoft Multilingual
E5-Mistral-7B-Instruct - Open-source embeddings model from Microsoft initialized from Mistral-7B-v0.1, achieving state-of-the-art BEIR score of 56.9 for English text embedding and retrieval tasks with 4096-dimensional vectors. (Read more) Embeddings Open Source Instruction Based
Gemini Embedding 2 - Google's first natively multimodal embedding model that maps text, images, video, audio and documents into a single embedding space. Supports over 100 languages with flexible output dimensions using Matryoshka Representation Learning. (Read more) Multimodal Embeddings Google
GTE Embeddings - General Text Embeddings from Alibaba DAMO Academy trained on large-scale relevance pairs. Available in three sizes (large, base, small) with GTE-v1.5 supporting 8192 context length. (Read more) Embeddings Open Source Multilingual
INSTRUCTOR - Instruction-finetuned text embedding model that generates task-specific embeddings based on natural language instructions. One embedder for any task, achieving state-of-the-art results across 70 diverse datasets without additional fine-tuning. (Read more) Instruction Based Embeddings Task Specific
Jina ColBERT v2 - Groundbreaking multilingual information retrieval model supporting 89 languages with token-level embeddings and late interaction. Features Matryoshka embeddings for flexible efficiency-precision tradeoffs and 8192 token input context. (Read more) Embedding Multilingual Colbert
jina-embeddings-v5 - Jina AI's latest embedding model achieving the highest multilingual performance among models under 1B parameters with 71.7 average MTEB score and 67.7 MMTEB score. (Read more) Embeddings Multilingual Open Source
Llama-Embed-Nemotron-8B - Universal text embedding model from NVIDIA achieving state-of-the-art performance on MMTEB leaderboard, optimized for retrieval, reranking, semantic similarity, and classification with 4,096-dimensional embeddings. (Read more) Embeddings Multilingual Nvidia
Mixedbread AI - AI startup providing state-of-the-art embedding and reranking models through accessible APIs, offering both open-source and proprietary models optimized for various use cases. (Read more) Embeddings Re Ranking Api
ModernBERT Embed - Open-source embedding model from Nomic AI based on ModernBERT-base with 149M parameters. Supports 8192 token sequences and Matryoshka Representation Learning for 3x memory reduction. (Read more) Open Source Embeddings Nlp
mxbai-embed-large - State-of-the-art large embedding model from Mixedbread AI, ranked first among similar-sized models, supporting Matryoshka Representation Learning and binary quantization with 700M+ training pairs. (Read more) Embeddings Open Source Matryoshka
Nomic Embed Text v2 - Open-source multilingual embedding model using Mixture-of-Experts architecture, achieving excellent semantic performance with efficient inference and full offline support. (Read more) Embeddings Multilingual Open Source
nomic-embed-text-v2-moe - Multilingual MoE text embedding model excelling at multilingual retrieval with SoTA performance compared to ~300M parameter models, supporting ~100 languages with Matryoshka Embeddings trained on 1.6B pairs. (Read more) Embeddings Multilingual Local
Reranking Models - Cross-encoder models that rerank initial retrieval results for improved relevance. More accurate than bi-encoders but slower, typically applied to top-k candidates. (Read more) Reranking Cross Encoder Rag
SPLADE - Learned sparse embedding model using BERT for term weighting and expansion. Outperforms BM25 in information retrieval by identifying semantic connections between words while maintaining interpretability. (Read more) Sparse Embeddings Learned Interpretable
text-embedding-3-large - OpenAI's flagship text embedding model with up to 3,072 dimensions, offering best-in-class performance and accuracy for English tasks with adjustable output sizes to optimize storage costs. (Read more) Openai Embeddings Api
UForm - Pocket-sized multimodal AI for content understanding across multilingual texts, images, and video. Up to 5x faster than OpenAI CLIP with quantization-aware embeddings and support for 20+ languages. (Read more) Multimodal Embeddings Multilingual
Voyage AI Embeddings - Commercial embedding models built for enterprise-grade semantic search and RAG applications. Features voyage-3 and voyage-3-large models with multimodal support. This is a commercial API service with usage-based pricing. (Read more) Commercial Embedding Multimodal
Voyage Multimodal 3.5 - Next-generation multimodal embedding model built for retrieval over text, images, and videos, supporting Matryoshka embeddings with 4.56% higher accuracy than Cohere Embed v4 on visual document retrieval. (Read more) Multimodal Embeddings Video
voyage-4 - Latest Voyage AI embedding model family featuring shared embedding space with MoE architecture, supporting flexible output dimensions and advanced quantization options for cost optimization. (Read more) Embeddings Multilingual Quantization
voyage-multimodal-3 - Voyage AI's first all-in-one multimodal embedding model supporting interleaved text and content-rich images including screenshots, PDFs, slide decks, tables, and figures. (Read more) Multimodal Embeddings Visual Search

Concepts & Definitions

Cascading Retrieval - Advanced retrieval approach combining dense vectors, sparse vectors, and reranking in a multi-stage pipeline, achieving up to 48% better performance than single-method retrieval. (Read more) Hybrid Search Rag Retrieval
HNSW-IF - Hybrid billion-scale vector search method combining HNSW with inverted file indexes, enabling cost-efficient search by keeping centroids in memory while storing vectors on disk. (Read more) Hnsw Disk Based Scalability
Matryoshka Embeddings - Representation learning approach encoding information at multiple granularities, allowing embeddings to be truncated while maintaining performance. Enables 14x smaller sizes and 5x faster search. (Read more) Embeddings Optimization Research
RecursiveCharacterTextSplitter - LangChain's hierarchical text chunking strategy achieving 85-90% accuracy by recursively splitting using progressively finer separators to preserve semantic boundaries. (Read more) Chunking Text Processing Rag
Vector Index Comparison Guide (Flat, HNSW, IVF) - Comprehensive comparison of vector indexing strategies including Flat, HNSW, and IVF approaches. Covers performance characteristics, memory requirements, and use case recommendations for 2026. (Read more) Indexing Comparison Best Practices
ACORN Algorithm - Performant and predicate-agnostic search algorithm for vector embeddings with structured data. Uses two-hop graph expansion to maintain high recall under selective filters in Weaviate. (Read more) Ann Graph Based Filtering
Approximate Nearest Neighbors (ANN) - Family of algorithms trading perfect accuracy for speed in high-dimensional similarity search. Enables sub-linear query time with 90%+ recall on billion-scale datasets. (Read more) Algorithm Ann Search
Ball-Tree - Tree-based spatial data structure organizing vectors using spherical regions instead of axis-aligned splits, making it better suited for high-dimensional data compared to KD-trees. (Read more) Tree Based Indexing High Dimensional
Binary Quantization - Extreme vector compression technique converting each dimension to a single bit (0 or 1), achieving 32x memory reduction and enabling ultra-fast Hamming distance calculations with acceptable accuracy trade-offs. (Read more) Quantization Compression Optimization
BM25 - Best Matching 25 ranking function for information retrieval that ranks documents based on query term frequency with length normalization. Core component of hybrid search RAG systems combining keyword and semantic search. (Read more) Information Retrieval Ranking Keyword Search
BM25 (Okapi BM25) - Probabilistic ranking function for estimating document relevance to search queries. Industry standard for keyword search, combining term frequency, rarity, and length normalization into a single scoring model. (Read more) Ranking Information Retrieval Keyword Search
BM42 - Experimental sparse embedding approach combining exact keyword search with transformer intelligence, integrating sparse and dense vector searches for improved RAG results, developed by Qdrant. (Read more) Sparse Hybrid Search Experimental
Chunk Overlap Strategy - Text chunking technique using 10-20% overlap between consecutive chunks to preserve context continuity and prevent information loss at chunk boundaries for improved retrieval. (Read more) Chunking Rag Text Processing
Chunking Strategies for RAG - Methods for splitting documents into optimal pieces for vector embedding and retrieval. Includes fixed-size, recursive, semantic, and agentic chunking approaches. (Read more) Rag Document Processing Chunking
Context Precision - RAG evaluation metric assessing retriever's ability to rank relevant chunks higher than irrelevant ones, measuring context relevance and ranking quality for optimal retrieval. (Read more) Rag Evaluation Metrics
Context Recall - RAG evaluation metric measuring whether retrieved context contains all information required to produce ideal output, assessing completeness and sufficiency of retrieval. (Read more) Rag Evaluation Retrieval
Context Window - Maximum number of tokens an embedding model or LLM can process in a single input. Critical parameter for vector databases affecting chunk sizes, with modern models supporting 512 to 32,000+ tokens for long-document understanding. (Read more) Llm Embeddings Architecture
Contextual Retrieval - Anthropic's RAG technique that prepends chunk-specific explanatory context before embedding, reducing failed retrievals by 49% (67% with reranking). Uses Contextual Embeddings and Contextual BM25. (Read more) Rag Retrieval Context
Cosine Similarity - Fundamental similarity metric for vector search measuring the cosine of the angle between vectors. Range from -1 to 1, with 1 indicating identical direction regardless of magnitude. (Read more) Similarity Distance Metric Vector Search
Cross-Encoder - Neural reranking architecture that examines full query-document pairs simultaneously for deeper semantic understanding, achieving higher accuracy than bi-encoders at the cost of computational efficiency. (Read more) Reranking Neural Networks Nlp
Dot Product - Vector similarity metric measuring both directional similarity and magnitude of vectors. Used by many LLMs for training and equivalent to cosine similarity for normalized data. Reports both angle and magnitude information. (Read more) Similarity Distance Metric Llm
Dot Product (Inner Product) - Similarity metric computing sum of element-wise products between vectors. Efficient for normalized vectors, equivalent to cosine similarity when vectors are unit length. (Read more) Similarity Distance Metric Vector Search
Dot Product Similarity - Vector similarity metric combining both angle and magnitude information for comprehensive similarity measurement, equivalent to cosine similarity when vectors are normalized. (Read more) Similarity Search Metrics Algorithm
Embedding Dimensionality - The size of vector embeddings, typically ranging from 384 to 4096 dimensions. Higher dimensions capture more information but increase storage, compute, and latency costs. (Read more) Embeddings Optimization Dimensions
Embedding Fine-Tuning - Process of adapting pre-trained embedding models to specific domains or tasks for improved performance. Techniques include supervised fine-tuning, contrastive learning, and domain adaptation to optimize embeddings for particular use cases. (Read more) Embeddings Fine Tuning Machine Learning
Embedding Models Overview - Neural networks that convert text, images, or other data into dense vector representations. Enable semantic understanding by mapping similar concepts to nearby points in vector space. (Read more) Embeddings Models Neural Networks
Euclidean Distance - Straight-line distance metric between vectors in multidimensional space, sensitive to both magnitude and direction, ideal when embedding magnitude carries important information. (Read more) Similarity Search Metrics Algorithm
Euclidean Distance (L2 Distance) - Distance metric measuring straight-line distance between vectors in multi-dimensional space. Lower values indicate higher similarity, with 0 meaning identical vectors. (Read more) Distance Metric Similarity Vector Search
Faithfulness - RAG evaluation metric measuring whether generated answers accurately align with retrieved context without hallucination, ensuring factual grounding of LLM responses. (Read more) Rag Evaluation Llm
Filtered Vector Search - Combining vector similarity search with metadata filtering. Enables queries like find similar documents published after 2023 in category Technology. (Read more) Filtering Metadata Hybrid Search
Filtered Vector Search Guide - Complete guide to metadata filtering in vector search covering pre-filtering, post-filtering, and hybrid approaches. Addresses the Achilles heel of vector search with modern solutions. (Read more) Filtering Metadata Best Practices
Hamming Distance - Distance metric for binary vectors counting the number of positions at which corresponding bits differ, computed efficiently using XOR and popcount operations for ultra-fast similarity search. (Read more) Distance Metric Binary Similarity
HCNNG - Hierarchical Clustering-based Nearest Neighbor Graph using MST to connect dataset points through multiple hierarchical clusters. Performs efficient guided search instead of traditional greedy routing. (Read more) Ann Graph Based Clustering
HNSW (Hierarchical Navigable Small World) - Graph-based approximate nearest neighbor algorithm with logarithmic search complexity. Industry standard for high-dimensional vector search with excellent recall-speed tradeoff. (Read more) Algorithm Graph Based Ann
Hybrid Search - Search approach combining keyword-based (BM25) and semantic (vector) search for best of both worlds. Uses fusion techniques like RRF to merge results. (Read more) Search Hybrid Fusion
Hybrid Search Best Practices - Comprehensive guide to combining BM25 keyword search with vector semantic search using reciprocal rank fusion and reranking. Essential pattern for production RAG systems in 2026. (Read more) Hybrid Search Rag Best Practices
IVF - Inverted File Index vector search algorithm that partitions high-dimensional vectors into clusters using k-means, enabling efficient nearest neighbor search by restricting searches to relevant clusters and dramatically reducing search space. (Read more) Algorithm Indexing Ann
IVF (Inverted File Index) - Clustering-based approximate nearest neighbor algorithm that partitions vector space into Voronoi cells. Fast search through coarse-to-fine strategy, often combined with Product Quantization (IVF-PQ). (Read more) Algorithm Clustering Ann
IVF-FLAT - Inverted File index with FLAT (uncompressed) vectors, partitioning the vector space into clusters with centroids, offering a balance between search speed and accuracy for approximate nearest neighbor search. (Read more) Indexing Ivf Clustering
IVF-FLAT Index - Inverted File Index with flat vectors using K-means clustering to partition high-dimensional space into regions, enhancing search efficiency by narrowing search area through neighbor partitions. (Read more) Indexing Algorithm Ann
IVF-PQ (Inverted File with Product Quantization) - Vector indexing method combining inverted file index with product quantization for memory-efficient search. Reduces storage from 128x4 bytes to 32x1 bytes (1/16th) while maintaining search quality. (Read more) Quantization Indexing Compression
KD-Tree - Tree-based data structure for organizing vectors through recursive axis-aligned partitioning, enabling logarithmic time complexity searches for balanced data but struggling with high-dimensional spaces. (Read more) Tree Based Indexing Data Structure
Late Interaction - Retrieval paradigm where query and document tokens are encoded separately and interactions computed at search time, combining efficiency of bi-encoders with expressiveness of cross-encoders. (Read more) Retrieval Colbert Neural Search
Locality Sensitive Hashing (LSH) - Algorithmic technique for approximate nearest neighbor search in high-dimensional spaces using hash functions to map similar items to the same buckets with high probability. (Read more) Hashing Ann Algorithm
Manhattan Distance - Vector distance metric calculating the sum of absolute differences between vector components. Measures grid-like distance and is robust to outliers, with faster calculation as data dimensionality increases. (Read more) Similarity Distance Metric High Dimensional
Matryoshka Representation Learning - Training technique enabling flexible embedding dimensions by learning representations where truncated vectors maintain good performance, achieving 75% cost savings when using smaller dimensions. (Read more) Embeddings Optimization Machine Learning
MaxSim - Maximum Similarity late interaction function introduced by ColBERT for ranking. Calculates cosine similarity between query and document token embeddings, keeping maximum score per query token for highly effective long-document retrieval. (Read more) Colbert Ranking Late Interaction
MaxSim Operator - Scoring function used in late interaction models like ColBERT that computes query-document relevance by finding maximum similarity between each query token and document tokens, then summing. (Read more) Late Interaction Colbert Ranking
MSTG (Multi-Stage Tree Graph) - Hierarchical vector index developed by MyScale overcoming IVF limitations through multi-layered design, creating multiple layers unlike IVF's single layer of cluster vectors for improved search performance. (Read more) Indexing Tree Based Hierarchical
Multimodal Embeddings - Vector representations mapping different data types (text, images, audio, video) into a shared embedding space. Enables cross-modal search and understanding. (Read more) Multimodal Embeddings Cross Modal
NSW (Navigable Small World) - Graph-based algorithm for approximate nearest neighbor search where vertices represent vectors and edges are constructed heuristically. Foundation for HNSW with (poly/)logarithmic search complexity using greedy routing. (Read more) Ann Graph Based Algorithm
Product Quantization (PQ) - Vector compression technique that splits high-dimensional vectors into subvectors and quantizes each independently, achieving significant memory reduction while enabling approximate similarity search. (Read more) Quantization Compression Optimization
Product Quantization Compression - Lossy vector compression dividing vectors into subvectors for independent quantization. Achieves 8-64x storage reduction while enabling fast approximate distance computation via lookup tables. (Read more) Compression Quantization Pq
RAG (Retrieval-Augmented Generation) - AI technique combining information retrieval with LLM generation. Retrieves relevant context from knowledge base before generating responses, reducing hallucinations and enabling grounded answers. (Read more) Rag Llm Retrieval
Reciprocal Rank Fusion - Method for combining ranked lists from multiple retrieval systems in hybrid search. Standard technique in RAG pipelines for fusing BM25 and dense vector results before reranking, creating diverse high-confidence candidate sets. (Read more) Hybrid Search Ranking Fusion
Reciprocal Rank Fusion (RRF) - Hybrid search algorithm combining results from multiple ranking systems by computing reciprocal ranks, commonly used to merge dense vector search with sparse keyword search for improved retrieval. (Read more) Hybrid Search Ranking Fusion
Reranking - Two-stage retrieval pattern where initial candidates from vector/keyword search are re-scored using more sophisticated models. Combines fast initial retrieval with accurate final ranking using cross-encoders or ColBERT for 15-40% accuracy improvements. (Read more) Rag Ranking Retrieval
Scalar Quantization - Vector compression technique reducing precision of each vector component from 32-bit floats to 8-bit integers, achieving 4x memory reduction with minimal accuracy loss for vector search. (Read more) Quantization Compression Optimization
Semantic Caching - AI caching pattern that stores vector embeddings of LLM queries and responses, serving cached results when new queries are semantically similar. Cuts LLM costs by 50%+ with millisecond response times versus seconds for fresh calls. (Read more) Caching Optimization Llm
Semantic Chunking - Advanced text splitting technique using embeddings to divide documents based on semantic content instead of arbitrary positions, preserving cohesive ideas within chunks for improved RAG performance. (Read more) Chunking Rag Text Processing
Semantic Search - Search technique understanding meaning and context rather than exact keyword matching. Uses vector embeddings to find semantically similar content even with different wording. (Read more) Search Embeddings Semantics
TreeAH - Vector index type based on Google's ScaNN algorithm combining tree-like structure with Asymmetric Hashing quantization, optimized for batch queries with 10x faster index generation and smaller memory footprint. (Read more) Indexing Quantization Google
Vamana - Graph-based indexing algorithm powering Microsoft's DiskANN. Uses flat graph structure with minimized search diameter for efficient disk-based nearest neighbor search with 40x GPU speedup available via NVIDIA cuVS. (Read more) Ann Graph Based Algorithm
Vector Database Backup and Recovery Guide - Best practices for backup and disaster recovery in vector databases. Covers full/incremental backups, replication strategies, and cloud-native approaches for safeguarding high-dimensional embeddings. (Read more) Backup Disaster Recovery Best Practices
Vector Database Cost Optimization Guide - Comprehensive strategies for reducing vector database costs including storage management, compute optimization, and monitoring. Covers cloud pricing trends and hidden costs in 2026. (Read more) Cost Optimization Cloud Best Practices
Vector Database Monitoring - Observability practices for vector databases including query latency, recall metrics, storage utilization, and index health monitoring. (Read more) Monitoring Observability Operations
Vector Database Performance Tuning Guide - Comprehensive guide covering index optimization, quantization, caching, and parameter tuning for vector databases. Includes techniques for balancing performance, cost, and accuracy at scale. (Read more) Performance Optimization Best Practices
Vector Database Sharding - Distributing vector data across multiple nodes for horizontal scaling. Enables handling billions of vectors by partitioning data and parallelizing queries. (Read more) Sharding Scalability Distributed
Vector Database Sharding Strategies - Comprehensive guide to sharding approaches for distributed vector databases including range-based, hash-based, geographic, and vector-aware clustering methods for horizontal scaling. (Read more) Scalability Distributed Sharding
Vector Database Use Cases - Applications of vector databases across industries including semantic search, RAG systems, recommendations, anomaly detection, and multimodal search. (Read more) Use Cases Applications Ai
Vector Dimensionality - Number of components in an embedding vector, typically ranging from 128 to 4096 dimensions. Higher dimensions can capture more information but increase storage, computation, and costs. Critical design parameter for vector databases. (Read more) Embeddings Optimization Architecture
Vector Index Types - Overview of indexing structures for approximate nearest neighbor search including HNSW (graph-based), IVF (clustering), LSH (hashing), and tree-based approaches. (Read more) Indexing Algorithms Ann
Vector Quantization Techniques - Methods for compressing vector embeddings to reduce storage and memory costs. Includes scalar quantization, product quantization, and binary quantization with varying compression-accuracy tradeoffs. (Read more) Compression Optimization Cost Reduction
Vector Similarity Search - Finding nearest vectors in high-dimensional space based on distance or similarity metrics. Core operation of vector databases enabling semantic search, recommendations, and RAG. (Read more) Similarity Search Vectors

Vector Database Engines

Data Cloud Vector Database - Built into the Salesforce platform, Data Cloud Vector Database ingests various large datasets from customer interactions, classifies and organizes unstructured data, and merges it with structured data to enrich customer profiles and store as metadata in Data Cloud. It enhances generative AI by providing more relevant, accurate, and up-to-date responses through improved data retrieval and semantic search capabilities. (Read more) Enterprise Cloud Native vector database
Instaclustr - Instaclustr offers comprehensive managed services for vector databases, handling deployment, configuration, ongoing maintenance, tuning, optimization, scalability, security, and data protection. This allows organizations to offload the complexities of managing their vector database infrastructure and focus on their core business objectives. (Read more) Managed Service Cloud Enterprise
Qdrant Vector Database - Qdrant is an open‑source vector database designed for high‑performance similarity search and AI applications such as RAG, recommendation systems, advanced semantic search, anomaly detection, and AI agents. It provides scalable storage and retrieval of vector embeddings with features like filtering, hybrid search, and production‑grade APIs for integrating with machine learning workloads. (Read more) vector database Open Source Hybrid Search
Qwak - A platform designed to simplify the building, management, and deployment of Large Language Model (LLM) applications, enabling rapid operationalization of context-aware LLMs and offering integration with its Vector Store. (Read more) MLOps Llm platform
vector engine for OpenSearch Serverless - An on-demand serverless configuration for OpenSearch Service that simplifies the operational complexities of managing OpenSearch domains, integrated with Knowledge Bases for Amazon Bedrock to support generative AI applications. (Read more) Cloud Native Serverless OpenSearch
Aerospike - A multi-model AI database designed for high-throughput vector processing at scale, supporting real-time AI use cases with a patented Hybrid Memory Architecture and efficient infrastructure usage, capable of handling large volumes of data and concurrent users. (Read more) multi-model Real Time Scalable
AllegroGraph - A database that incorporates neuro-symbolic AI and offers a managed service (AllegroGraph Cloud) for neuro-symbolic AI knowledge graphs, indicating its relevance to advanced AI applications, likely including vector capabilities. (Read more) Graph Database Ai Knowledge Graph
Amazon Web Services Vector Search - AWS has introduced vector search in several of its managed database services, including OpenSearch, Bedrock, MemoryDB, Neptune, and Amazon Q, making it a comprehensive platform for vector search solutions. (Read more) Cloud Native Vector Search Managed Service Enterprise
Apache Cassandra - Apache Cassandra is a distributed NoSQL database that is adding native support for high-dimensional vector storage and approximate nearest neighbor search, making it a scalable choice for AI and vector search workloads. (Read more) Nosql Distributed Vector Search Scalable
AstraDB - AstraDB (also known as Astra DB by DataStax) is a cloud-native vector database built on Apache Cassandra, supporting real-time AI applications with scalable vector search. It is designed for large-scale deployments and features a user-friendly Data API, robust vector capabilities, and automation for AI-powered applications. (Read more) Cloud Native Vector Search Scalable Ai
Blaze - An emerging solution diversifying the options available to data engineers in the vector database landscape. (Read more) vector database emerging data engineering
ChromaDB - ChromaDB (also known as Chroma or chroma-core) is an open-source vector database focused on LLM applications, emphasizing simplicity and in-memory HNSW-based dense vector search. It is suited for prototyping, metadata filtering, and offers a user-friendly interface for building and testing vector search applications, though it currently lacks hybrid and distributed features. (Read more) Open Source In Memory Vector Search Llm
citrus - A distributed vector database designed for scalable and efficient vector similarity search. It is purpose-built for handling large-scale vector data and search workloads. (Read more) Open Source Distributed Vector Search Scalable
ClickHouse - ClickHouse is an open-source column-oriented database that supports vectorized computation and now offers vector search features. Its architecture enables efficient real-time analytics and vector operations, making it a relevant choice for vector database use cases. (Read more) Open Source Analytics Vector Search Real Time
Cottontail DB - Cottontail DB is an open-source vector database for storing and searching high-dimensional data, with features geared towards research and production environments. (Read more) Open Source vector databases High Dimensional Vector Search
DataFusion - A general-purpose analytical engine with built-in vector processing capabilities, excelling at traditional analytical workloads and efficient handling of vector operations. It is an example of a vector engine. (Read more) analytical engine vector processing Open Source
Datastax - Datastax offers a vector search solution integrated with its database platform, enabling approximate similarity search and hybrid queries for enterprise use cases. (Read more) Enterprise Vector Search Hybrid Search Similarity Search
Deep Lake - Deep Lake is a vector database designed as a data lake for AI, capable of storing and managing vector embeddings, text, images, and videos. It utilizes a tensor format for efficient querying and integration with AI algorithms, making it suitable for similarity search and machine learning workflows. It is open-source and tailored for handling unstructured and multimodal data, with seamless integration with frameworks like PyTorch and TensorFlow. (Read more) Open Source Vector Search Ai Multimodal
Elasticsearch - Elasticsearch is a distributed search engine supporting various data types, including vectors, and provides scalable vector search capabilities, making it a popular choice for modern AI-powered applications. It can be extended with the k-NN plugin to provide scalable vector search using HNSW and Lucene, enabling hybrid semantic and keyword search capabilities. (Read more) Open Source Vector Search Hybrid Search Scalable
Google Cloud Vertex AI Vector Search - Google Cloud Platform offers vector search as part of its Vertex AI suite, enabling scalable and integrated vector search capabilities for AI-driven applications. (Read more) Cloud Native Vector Search Ai Scalable
Google Vertex AI - Google Vertex AI offers managed vector search capabilities as part of its AI platform, supporting hybrid and semantic search for text, image, and other embeddings. (Read more) Managed Service Vector Search Hybrid Search Semantic Search Cloud Native
HAKES - HAKES is a system designed for efficient data search using embedding vectors at scale, making it a relevant solution for vector database applications. (Read more) Vector Search Scalable Embeddings
Infinity - Infinity is an AI-native database built for LLM applications, offering fast hybrid search of dense vectors, sparse vectors, tensors, and full-text data. (Read more) Ai Llm Hybrid Search Vector Search
JaguarDB - JaguarDB is a database solution, identified as a vector database in the context of the provided research. (Read more) vector database Commercial High Performance
KDB - KDB is a high-performance vector database supporting billion-scale vector search, with features aimed at enterprises needing large-scale vector storage and retrieval. (Read more) Enterprise Scalable Vector Search High Performance
KDB.AI - KDB.AI is a proprietary vector database and search engine designed for real-time AI applications. It offers advanced vector search, integrates with popular ML tools, and supports temporal and semantic context for embeddings. KDB.AI Server is a high-performance vector database and search engine from KX, designed for real-time analytics and AI applications requiring rapid similarity search. (Read more) proprietary Real Time Ai Vector Search
LanceDB - LanceDB is a columnar vector database optimized for real-time AI use cases and analytics workloads, providing efficient vector storage and fast similarity search. (Read more) Vector Search Real Time Analytics Ai
Manu - A cloud-native vector database management system designed for efficient storage and retrieval of vector embeddings. Directly relevant as a vector database platform. (Read more) vector databases Cloud Native Vector Search Scalable
Marqo - Marqo is an open-source neural search engine that leverages vector representations to enable semantic search over textual data. It abstracts vector database complexity and provides a high-level interface for building advanced search applications. (Read more) Open Source Semantic Search Vector Search Ai
Microsoft Azure AI Search - Azure AI Search provides vector search capabilities as a managed service, supporting approximate KNN, hybrid search, and integration with other Azure AI tools. (Read more) Managed Service Vector Search Hybrid Search Cloud Native
Microsoft Azure Vector Database - Microsoft Azure offers vector search support across multiple database services, enabling developers to leverage vector search in cloud-native and enterprise scenarios. (Read more) Cloud Native Vector Search Enterprise Scalable
Milvus - Milvus is a mature, open-source vector database maintained by Zilliz, supporting large-scale similarity search with multiple indexing strategies and GPU acceleration. It includes variants such as Milvus Lite (lightweight version), Milvus Standalone (single-machine deployment), and Milvus Distributed (Kubernetes-based deployment for large scale). (Read more) Open Source Vector Search Scalable Gpu Acceleration
Milvus Distributed - Milvus Distributed is a horizontally scalable, distributed deployment of the Milvus vector database designed for enterprise workloads, offering high reliability and the ability to handle billions of vectors with a comprehensive management toolkit. (Read more) Distributed Scalable vector database
Milvus Standalone - Milvus Standalone is a single-machine deployment option of the Milvus vector database that provides a complete, production-ready vector search engine suitable for datasets up to millions of vectors. (Read more) vector database single-node Similarity Search
MongoDB - MongoDB is a general-purpose database that now includes vector search capabilities, enabling light vector workloads alongside traditional database functionality. MongoDB Atlas, the managed cloud offering, includes vector search built on Lucene, supporting ANN queries and hybrid search. MongoDB Atlas Search integrates powerful vector search capabilities directly within MongoDB. (Read more) Vector Search Hybrid Search Nosql Managed Service
MongoDB Atlas Vector Search - A vector search capability integrated within MongoDB Atlas, enabling vector-based retrieval and similarity search over unstructured data. Relevant for users seeking vector search in a popular database platform. MongoDB Vector Search is an integrated feature in MongoDB Atlas that enables efficient vector-based search within a comprehensive document database, supporting up to 2,048 dimensions and hybrid search capabilities. (Read more) Cloud Native Vector Search document database Hybrid Search
MyScale - A relational database engine extended with native vector search capabilities, allowing for scalable and efficient similarity search in combination with SQL queries. (Read more) Vector Search Sql Scalable Hybrid Search
Neo4j - Neo4j is a graph database that has added vector search capabilities, providing unique and effective approaches for retrieval augmented generation (RAG) and other AI applications. (Read more) Graph Database Vector Search Rag Ai
NucliaDB - NucliaDB is a commercial vector database that enables semantic and vector search across unstructured data, supporting advanced AI and ML-powered applications. (Read more) Commercial Vector Search Semantic Search Ai
ObjectBox - A high-performance embedded database for edge devices and mobile, offering vector search capabilities for AI applications. (Read more) Embedded Edge Mobile
OpenSearch - OpenSearch is a fully open-source, community-driven search and analytics suite that supports vector search, providing a transparent and flexible alternative for organizations seeking advanced search features. (Read more) Open Source Vector Search Analytics Scalable
Oracle Database Vector Search - Oracle's core database now includes vector search capabilities, enabling enterprises to perform scalable vector queries natively as part of their data management workflows. Oracle includes vector search capabilities in its database platform, supporting approximate KNN and hybrid search for enterprise-scale use cases. (Read more) Enterprise Vector Search Hybrid Search KNN
orama - Orama is a lightweight search engine that supports vector and hybrid search functionalities, suitable for browser, server, or edge environments. (Read more) Open Source Vector Search Hybrid Search Lightweight
Photon Engine - A general-purpose analytical engine with built-in vector processing capabilities, excelling at traditional analytical workloads and efficient handling of vector operations. It is an example of a vector engine. (Read more) analytical engine vector processing Performance
Qdrant - Qdrant is a dedicated vector database and similarity search engine supporting advanced filtering and efficient retrieval, suitable for faceted search and retrieval-augmented generation. It offers self-hosted and cloud deployment options, making it highly relevant for vector search applications. (Read more) Open Source Vector Search Similarity Search Rag
Quokka - An emerging solution diversifying the options available to data engineers in the vector database landscape. (Read more) vector database emerging data engineering
Qwak Vector Store - Qwak provides a vector store solution engineered for optimized storage and querying of vector embeddings, offering efficient search capabilities, high performance, scalability, and data retrieval by identifying similarities among data points. (Read more) vector store Scalable Embeddings
Redis - Redis, while primarily an in-memory data store, offers vector search capabilities through its RediSearch and RedisAI modules, enabling vector similarity searches and deep learning model management for existing Redis users. With the RediSearch module, Redis extends its functionality to support native vector search, indexing, and hybrid queries, making it suitable for real-time AI and semantic search applications. (Read more) Vector Search In Memory Hybrid Search Real Time
seekdb - seekdb is OceanBase’s experimental vector database component for high-performance nearest neighbor search over embedding vectors. (Read more) Ann vector database High Performance
Solr - Solr is a mature open-source search engine that has incorporated vector search capabilities, making it relevant for enterprises looking to implement vector-based search alongside traditional keyword search. (Read more) Open Source Vector Search Hybrid Search Enterprise
tinyvector - tinyvector is a minimal vector database / ANN engine focused on simplicity and compact implementation for educational and small-scale similarity search uses. (Read more) Ann Similarity Search Lightweight
Transwarp Hippo - Transwarp Hippo is an enterprise-grade, cloud-native distributed vector database designed for scalable vector operations, including similarity search and clustering, targeting massive datasets and real-time recommendation systems. (Read more) Enterprise Cloud Native Distributed Vector Search
Trieve - Trieve provides an all-in-one infrastructure for vector search, recommendations, retrieval-augmented generation (RAG), and analytics, accessible via API for seamless integration. (Read more) Open Source Vector Search Rag Analytics
Typesense - Typesense is an open-source search engine that supports hybrid search, including vector search capabilities, providing an alternative to proprietary vector search solutions. (Read more) Open Source Hybrid Search Vector Search full-text search
Vald - Vald is an open-source, highly scalable distributed vector search engine known for its asynchronous auto-indexing and ability to efficiently handle large-scale vector data in real time, making it suitable for demanding vector search applications. (Read more) Open Source Distributed Scalable Real Time
vearch - Vearch is a distributed vector search engine designed for AI-native applications, enabling scalable and efficient similarity search across large datasets. (Read more) Open Source Distributed Vector Search Ai
Vector Databases - A critical emerging technology focused on processing, storing, and retrieving vast amounts of high-dimensional vector data rapidly and efficiently. Unlike traditional databases, they offer unique advantages for use cases such as image and video recognition, natural language processing (NLP), and Retrieval-Augmented Generation (RAG). (Read more) vector databases Ai Rag
Vector.ai - Vector.ai offers commercial vector database solutions for efficient high-dimensional similarity search and machine learning applications. (Read more) Commercial Vector Search Machine Learning Similarity Search
Vespa.ai - Vespa.ai is a scalable open-source platform for real-time big data serving and vector search. It supports vector similarity search and is used for applications like retrieval augmented generation and e-commerce search, making it highly relevant for vector database and vector search use cases. (Read more) Open Source Vector Search Real Time Scalable
Vexvault - Vexvault is an open-source vector database designed for efficient storage, management, and similarity search of high-dimensional vector data. (Read more) Open Source vector databases Similarity Search High Dimensional
Weaviate - Weaviate is an open-source, cloud-native vector database that supports fast semantic search, modular extensions, and graph-like querying, making it an ideal solution for building scalable, modern vector search applications. (Read more) Open Source Cloud Native Semantic Search Scalable
Zilliz Cloud - Zilliz Cloud is a fully managed vector database service powered by Milvus, offering hassle-free deployment, scalability, and high performance for vector search applications. (Read more) Cloud Native Managed Service Vector Search Milvus

Managed Vector Databases

AlloyDB - Google Cloud's fully managed, PostgreSQL-compatible database service that offers vector capabilities, leveraging the power of PostgreSQL and pgvector for AI applications. (Read more) Managed Service Postgresql Cloud
Azure Database for PostgreSQL - Microsoft Azure's managed service for PostgreSQL, which supports the pgvector extension, enabling robust vector database capabilities in the cloud for AI and machine learning workloads. (Read more) Managed Service Cloud Native Postgresql
DataRobot Vector Databases - The DataRobot vector databases feature provides FAISS-based internal vector databases and connections to external vector databases such as Pinecone, Elasticsearch, and Milvus. It supports creating and configuring vector databases, adding internal and external data sources, versioning internal and connected databases, and registering and deploying vector databases within the DataRobot AI platform to power retrieval-augmented generation and other AI use cases. (Read more) vector databases Rag Managed Service
Pinecone - Pinecone is a fully managed vector database designed for high‑performance semantic search and AI applications. It provides scalable, low-latency storage and retrieval of vector embeddings, allowing developers to build semantic search, recommendation, and RAG (Retrieval-Augmented Generation) systems without managing infrastructure. (Read more) Managed Service vector database Semantic Search
Amazon DocumentDB (with MongoDB compatibility) - An AWS document database service compatible with MongoDB, identified as a great choice for vector database needs. (Read more) Managed Service document database MongoDB
Amazon RDS for PostgreSQL - A managed relational database service from AWS that can host PostgreSQL, including specific community versions, and is a suitable choice for deploying the pgvector extension for vector storage. (Read more) Managed Service Cloud Native Postgresql
Aurora PostgreSQL-Compatible - An AWS database service compatible with PostgreSQL, identified as a great choice for vector database needs. (Read more) Managed Service Cloud Native Postgresql
Azure Cosmos DB - A vector database solution provided by Microsoft Azure. (Read more) Managed Service Cloud Native Azure
Cloudflare Vectorize - Cloudflare Vectorize is a managed vector database/indexing service integrated with Cloudflare Workers AI. It stores and searches high-dimensional vector embeddings (such as text embeddings) using configurable dimensions and distance metrics like cosine and euclidean, automatically handling index optimization and regeneration when new data is inserted. (Read more) Managed Service vector database Cloud Native
DataRobot Vector Database - DataRobot Vector Database is a managed vector store capability within the DataRobot AI Platform that allows users to create, register, deploy, and update vector databases for AI workloads, including RAG and semantic search. It integrates with NVIDIA NIM embeddings and supports both built-in and bring-your-own embeddings for building production-grade vector search solutions. (Read more) Managed Service Rag Semantic Search
DataRobot Vector Databases (GenAI) - A premium vector database capability within the DataRobot Generative AI platform that stores chunked unstructured text and their embeddings for retrieval-augmented generation (RAG). Users can create vector database objects, connect supported data sources from the DataRobot Data Registry, configure embeddings and chunking, and attach these vector databases to LLM blueprints in the playground to ground model responses in proprietary data. (Read more) Rag vector store Enterprise
Instaclustr for Managed Apache Cassandra 5.0 - A managed service offering Apache Cassandra 5.0, which can be utilized as a vector database for AI applications. (Read more) Managed Service Cassandra Nosql
Instaclustr for PostgreSQL - A managed service for PostgreSQL that includes support for pgvector, enabling PostgreSQL to function as a vector database for AI workloads. (Read more) Managed Service Postgresql Ai
Weaviate Cloud - Weaviate Cloud is the fully managed cloud deployment of the Weaviate vector database, providing a hosted environment for building and operating AI applications with scalable vector search, without managing infrastructure. (Read more) Managed Service Cloud Native Vector Search

Curated Resource Lists

MongoDB Vector Search - MongoDB Vector Search turns MongoDB into a full-featured vector database, enabling approximate and exact nearest neighbor search over vector embeddings stored alongside operational data. It supports semantic similarity search, retrieval-augmented generation (RAG) for AI applications, and lets you combine vector search with full‑text search and structured filters in the same query. Available on supported MongoDB Atlas clusters, it integrates with popular AI frameworks and services for building intelligent, agentic systems. (Read more)
Survey of Vector Database Management Systems - A comprehensive 2023 survey that systematically analyzes the design, architecture, indexing techniques, and system implementations of modern vector database management systems, serving as a foundational reference for understanding the vector database ecosystem used in AI applications. (Read more)
Vector DB Feature Matrix - A collaboratively maintained Google Sheets matrix comparing features, capabilities, and characteristics of many vector databases and approximate nearest neighbor libraries, useful for selecting solutions for AI and similarity search applications. (Read more)
Algolia Vector Search - Algolia’s vector search capability that augments its search-as-a-service platform with semantic and similarity search using embeddings. (Read more)
Alibaba Cloud OpenSearch Vector Search - Alibaba Cloud’s OpenSearch service with vector search support for semantic retrieval and intelligent search applications. (Read more)
Awesome papers and technical blogs on vector DB - A curated collection of papers and technical blogs focused on vector databases, semantic-based vector search, and approximate nearest neighbor search (ANN Search). These resources are essential for understanding and building large-scale information retrieval systems and vector databases. (Read more) vector databases Research blogs Ann Semantic Search
Awesome Vector Databases - A curated list of vector database solutions, libraries, and resources tailored for AI applications. Categorizes items by license and type, providing a valuable directory for those seeking vector database technologies. (Read more) awesome list resources vector databases Open Source
awesome-vector-database - A curated awesome list compiling resources, tools, vector databases, and research relevant to vector search and storage. Serves as a meta-resource for exploring the vector database ecosystem. (Read more) vector databases resources tools awesome list
awesome-vector-databases-data - A data repository that powers the 'Awesome Vector Databases' curated list, collecting structured information about vector database solutions, libraries, and resources for AI applications. Directly supports the discovery and categorization of vector database tools. (Read more) resources awesome list vector databases Open Source
awesome-vector-search - A curated collection of libraries, services, and research papers focused on vector search, including vector database technologies and related resources. (Read more) Vector Search libraries resources papers
Chroma - Chroma is an open-source AI-native vector database that provides semantic, full-text, and regex search as a memory layer for LLM and RAG applications. (Read more)
Databricks Vector Search - Databricks Vector Search is a managed vector search capability in Databricks that lets you create and maintain vector search indexes over Delta tables. It supports multiple modes for providing vector embeddings, including Databricks-computed embeddings (Delta Sync Index with managed embeddings), self-managed precomputed embeddings (Delta Sync Index with self-managed embeddings), and Direct Vector Access Index where clients directly manage vector updates via REST APIs. It is designed for AI and RAG-style applications built on top of the Databricks Lakehouse, enabling similarity search with metadata filters and tight integration with Unity Catalog and Delta Lake. (Read more)
Efficient Multi-vector Dense Retrieval with Bit Vectors (emvb) - emvb is an open-source implementation of the "Efficient Multi-vector Dense Retrieval with Bit Vectors" method, providing a specialized vector-search index for multi-vector dense retrieval using compact bit-vector representations to accelerate ANN search and reduce memory usage in vector database and retrieval systems. (Read more)
Foundations of Vector Retrieval - A comprehensive survey/tutorial paper that formalizes the principles, models, and system designs for vector retrieval, offering theoretical and practical foundations for modern vector databases and vector search engines. (Read more)
GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications - GaussDB-Vector is a large-scale, persistent, real-time vector database system designed specifically for LLM and AI applications. It provides native vector storage and similarity search capabilities, supporting low-latency, high-throughput vector operations and integration with large language model workloads. (Read more)
Hashing - A set of libraries and methods focused on hashing for similarity search in vector databases, directly impacting the performance of large-scale vector search systems. (Read more) Hashing Similarity Search resources Vector Search
Image Retrieval in the Wild - A CVPR 2020 tutorial on large-scale image retrieval in unconstrained environments, including methods and system considerations for vector-based image search relevant to vector database and ANN applications. (Read more) Tutorials Multimodal Vector Search
Implement two-tower retrieval for large-scale candidate generation - A Google Cloud reference architecture demonstrating an end-to-end two-tower retrieval system for large-scale candidate generation that uses Vertex AI and vector similarity search concepts to learn and serve semantic similarity between entities. (Read more) Rag Semantic Search architectures
IntelLabs's Vector Search Datasets - A collection of datasets curated by Intel Labs specifically for evaluating and benchmarking vector search algorithms and databases. (Read more) Datasets Vector Search Benchmark Evaluation
Introduction to Information Retrieval - Foundational IR textbook that includes content on vector‑space models and retrieval, providing essential background for understanding vector search and hybrid retrieval in modern vector databases. (Read more) resources Search Learning
Kinomoto.Mag AI - Kinomoto.Mag AI is a blog focused on AI tools, news, and tutorials, including curated lists of vector databases for AI applications. It serves as a resource hub for those interested in the latest innovations in vector databases and AI technologies. (Read more) blog Ai resources vector databases
KShivendu/awesome-vector-search - A curated list of awesome projects and research related to vector search, including dedicated vector databases, vector search libraries, performance benchmarks, and cost analysis resources. (Read more) awesome list Vector Search resources Open Source
LibHunt Vector Database Projects - A curated collection of open-source vector database projects, providing a centralized list for exploring and comparing solutions designed for vector search and AI applications. (Read more) Open Source vector databases resources Ai
Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search - Research paper proposing lossless compression techniques for vector identifiers in approximate nearest neighbor (ANN) search systems, aiming to reduce memory footprint and improve efficiency in large-scale vector databases and similarity search engines. (Read more)
Mastering Multimodal RAG - A course focused on mastering multimodal Retrieval Augmented Generation (RAG) and embeddings, which are fundamental components often stored and managed by vector databases. (Read more) Rag Multimodal Embeddings Tutorials
Mosaic AI Vector Search - Mosaic AI Vector Search is Databricks’ managed vector database and similarity search service for AI applications, providing high‑capacity, high‑performance vector indexing and querying with configurable endpoint types, including standard and storage‑optimized endpoints that scale to over one billion 768‑dimensional vectors. (Read more)
Multidimensional data / Vectors - A collection of resources, libraries, and databases focused on handling and searching multidimensional vector data, directly relevant for storing and querying vector embeddings in AI-powered applications. (Read more) resources vector data vector embeddings awesome list
MyScale Vector Database Benchmark - Benchmark framework and results from MyScale for comparing vector database and ANN index performance using large‑scale datasets and common query workloads relevant to AI applications. (Read more)
Neural Search in Action - A CVPR 2023 tutorial that demonstrates neural search systems in practice, including vector representations, similarity search, and scalable retrieval architectures closely related to vector databases. (Read more) Tutorials Neural Search Vector Search
OpenAI Cookbook - A collection of examples and guides from OpenAI, including best practices for working with embeddings, which are fundamental to vector search and vector database applications. (Read more) Openai Embeddings resources
Oracle AI Vector Search - Oracle AI Vector Search is Oracle’s integrated vector search capability within Oracle AI Database 26ai, enabling storage and querying of vector embeddings alongside traditional business data. It introduces a native VECTOR data type and supports high‑dimensional semantic similarity search for AI workloads such as chatbots, recommendation systems, anomaly detection, and multimedia search, while allowing embeddings to be used directly with Oracle machine learning algorithms. (Read more)
Passing the Baton: High Throughput Distributed Disk-Based Vector Search with BatANN - BatANN is a distributed, disk-based vector search system designed for high-throughput approximate nearest neighbor queries at scale, providing an architecture and methods applicable to large-scale vector databases that need efficient storage beyond memory. (Read more)
PDX: A Data Layout for Vector Similarity Search - PDX is a proposed data layout optimized for vector similarity search, focusing on memory and access efficiency for high-dimensional embeddings, making it relevant for the internal storage design of vector databases and ANN indexes. (Read more)
Quantization - Resources and tools on quantization techniques for vectors, which are essential for optimizing storage and retrieval in vector databases. (Read more) Quantization resources vector data Optimization
Systems - A focused category on complete vector database systems, their architectures, and implementations, directly relevant to anyone seeking production-ready vector database solutions. (Read more) vector database systems resources awesome list
Tree-based Methods - A curated list of tree-based approaches and systems for vector indexing and search, foundational for certain types of vector databases. (Read more) Tree Based vector indexing resources Vector Search
Typesense Cloud - Fully managed cloud service for the open-source Typesense search engine, including support for vector search and hybrid search use cases. (Read more) Managed Service Vector Search Hybrid Search
Understanding and Applying Text Embeddings (Vertex AI Short Course) - Short course by DeepLearning.AI and Google Cloud that teaches how to generate and use text embeddings with the Vertex AI Embeddings API for semantic search, classification, and question-answering systems, providing foundational knowledge for working with vector databases and retrieval. (Read more)
Vector Database Cloud - Vector Database Cloud is a managed cloud platform and ecosystem for building, deploying, and operating applications that use vector databases such as Qdrant and Milvus. It provides APIs, dashboards, and tooling tailored for AI and embedding-based workloads, enabling use cases like content recommendation and real-time fraud detection. (Read more)
Vector Search - Vector Search is Google Cloud Vertex AI’s managed vector search engine built on the ScaNN algorithm. It provides scalable, high‑performance vector similarity search for semantic search, recommendations, and generative AI applications, offering enterprise‑grade availability and the same underlying technology used in Google products like Search, YouTube, and Google Play. (Read more)
Vector Search and Embeddings (Google Cloud Skills Boost Course) - Google Cloud Skills Boost course that covers the fundamentals of vector search and text embeddings and shows how to build a vector search application on Vertex AI, including conceptual lessons, demos, and a practice lab. (Read more)
vector-io - Comprehensive vector data tooling library focused on working with vector embeddings and ANN data, useful for building, evaluating, and managing datasets and pipelines for vector databases and similarity search systems. (Read more)
vector-search-papers - A curated GitHub repository of research papers and technical blogs focused on vector search, approximate nearest neighbor search (ANN Search), and vector databases. This resource serves as a comprehensive directory for foundational and cutting-edge research, making it highly relevant for anyone building or exploring vector database technologies. (Read more) Vector Search Research papers Ann vector databases
VectorDB.Works - A web-based directory of vector database solutions, libraries, and resources for AI applications, serving as an accessible resource for exploring and comparing vector databases. (Read more) resources vector databases directory Ai
VectorHub - VectorHub is a resource and learning platform for developers and ML architects interested in integrating vector retrieval and search capabilities into their machine learning stacks, directly supporting vector database adoption and usage. (Read more) resources Vector Search Learning Open Source
Vertex AI Embeddings - Google Cloud’s managed embeddings service that generates text and multimodal vector representations for search, retrieval, and other AI applications. Frequently used alongside vector databases or vector search services to populate and update vector indexes. (Read more)
Vertex AI Feature Store - A managed feature store on Google Cloud that serves real-time feature data, often used alongside vector search to enrich or filter results returned from vector indexes in production recommendation and search systems. (Read more)
Vertex AI Pipelines - A serverless ML orchestration service on Google Cloud used to build automated pipelines that can generate embeddings and create or update vector search indexes, supporting MLOps workflows for vector database–backed search and recommendation systems. (Read more)
Vertex AI Search ranking API - A Google Cloud API that reranks documents based on semantic relevance using pretrained language models. It complements vector search by improving result ordering for content retrieved from vector databases or vector indexes. (Read more)
VLDB - New Trends in High-D Vector Similarity Search (Tutorial) - A VLDB conference tutorial focused on new trends and techniques for high-dimensional vector similarity search, covering core algorithms and system designs that underpin modern vector databases and large-scale ANN search. (Read more)
WARP: An Efficient Engine for Multi-Vector Retrieval - WARP is a research engine for efficient multi-vector retrieval, designed to improve performance of systems that store and search multiple embeddings per document—such as modern vector databases for RAG and semantic search workloads. (Read more)
Weaviate Recipes (Python) - Weaviate Python Recipes is a collection of Jupyter notebook examples showing how to use Weaviate as a vector database from Python, including ingestion, vector search, hybrid search, and integrations for AI and RAG workloads. (Read more)
Weaviate Recipes (TypeScript) - Weaviate TypeScript Recipes is a curated set of TypeScript code examples demonstrating how to interact with the Weaviate vector database, covering vector ingestion, querying, and AI-focused search patterns for JavaScript/TypeScript environments. (Read more)
weaviate-examples - Examples and resources for Weaviate, a popular open-source vector database optimized for storing and searching vector embeddings at scale. (Read more) Weaviate examples resources vector embeddings
XiaomingX/awesome-vector-database - A curated directory of resources, tools, tutorials, and libraries dedicated to vector databases, focusing on efficient data retrieval, similarity search, and machine learning applications. (Read more) vector databases resources Tutorials Similarity Search

LLM Tools

Cohere's re-ranker - A re-ranking tool provided by Cohere, which can be integrated into LLM applications via frameworks like LangChain to improve the relevance and order of retrieved documents from search systems, including those utilizing vector databases. (Read more) Re Ranking Llm Search
HuggingFace Text Embedding Server - A server that provides text embeddings, serving as a backend for embedding functions used with vector databases. (Read more) Embeddings Hugging Face Api
Ollama - A tool that allows users to run large language models locally, providing an easy way to set up and interact with various models, including integrations for generating and managing embeddings with vector databases. (Read more) Llm Local Tool
Elysia - Elysia is an open-source, decision-tree-based agentic system built on top of Weaviate that orchestrates tools and vector-search workflows, demonstrating how to build complex AI agents that leverage a vector database as a core component. (Read more) Rag tools Vector Search
Verba - Verba is a community-driven, open-source Retrieval-Augmented Generation (RAG) application that provides an end-to-end, user-friendly interface for building RAG workflows on top of a vector database, showcasing practical semantic search and retrieval patterns with Weaviate. (Read more) Rag Semantic Search Open Source

Multi Model & Hybrid Databases

Apache Cassandra Vector Search - Distributed NoSQL database with vector search capabilities via Storage-Attached Indexes (SAI) in Cassandra 5.0+. Uses Lucene HNSW for approximate nearest neighbor search. This is an OSS database under Apache 2.0 license. (Read more) Open Source Distributed Nosql
Elasticsearch Vector Search - Search and analytics engine with k-nearest neighbor (kNN) search for semantic similarity. Features approximate and exact kNN, HNSW indexing, and advanced quantization. This is commercial with OSS version available. (Read more) Commercial Open Source Search Engine
Rockset - Real-time analytics database with vector search capabilities, built on RocksDB with converged indexing. Acquired by OpenAI in 2024 to power retrieval infrastructure. This was a commercial service. (Read more) Commercial Real Time Analytics
Activeloop Deep Lake - Multi-modal vector database with tensor storage for vectors, images, texts, videos, and more. Features columnar storage format, time travel, ACID transactions, and terabyte-scale visualization for AI data management. (Read more) Multimodal Tensor Data Lake
Couchbase Vector Search - NoSQL database with vector search capabilities through Search Vector Indexes. Couchbase 8.0 introduces Hyperscale Vector Index for billion+ scale searches. This is a commercial database with free community edition. (Read more) Commercial Nosql Hybrid Search
CozoDB - General-purpose, transactional, relational-graph-vector database that uses Datalog for queries. Embeddable but capable of handling large amounts of data and concurrency with HNSW indices for high-performance vector similarity searches. (Read more) Graph Database Vector Search Datalog
NebulaGraph - Open-source distributed graph database designed for super large-scale graphs with billions of vertices and trillions of edges. Outperforms Neo4j on larger datasets while providing graph database capabilities for AI applications. (Read more) Graph Database Distributed Scalable
SingleStore - Distributed SQL database with built-in vector capabilities. Features SingleStore-V integrated vector system with credit-based pricing at $3.96 per compute credit. This is a commercial database. (Read more) Commercial Sql Distributed
StarRocks - Open-source high-performance analytical database with vector search capabilities. Features IVFPQ and HNSW indexing for approximate nearest neighbor search in v3.4+. This is an OSS database under Apache 2.0, a Linux Foundation project. (Read more) Open Source Analytics Hybrid Search

SDKs & Libraries

AutoTokenizer (Hugging Face Transformers) - A utility class from the Hugging Face Transformers library that automatically loads the correct tokenizer for a given pre-trained model. It is crucial for consistent text preprocessing and tokenization, a vital step before generating embeddings for vector database storage. (Read more) Nlp tokenization Hugging Face
Sentence-Transformers - A Python library for creating sentence, text, and image embeddings, enabling the conversion of text into high-dimensional numerical vectors that capture semantic meaning. It is essential for tasks like semantic search and Retrieval Augmented Generation (RAG), which often leverage vector databases. (Read more) Python Embeddings Semantic Search
SentenceTransformer - A Python library for generating high-quality sentence, text, and image embeddings. It simplifies the process of converting text into dense vector representations, which are fundamental for similarity search and storage in vector databases. (Read more) Python Embeddings Nlp
AHPQ.jl - AHPQ.jl is a Julia library providing training and inference for anisotropic hierarchical product quantization, compatible with ScaNN-style vector quantization and useful for building high-performance vector search pipelines. (Read more) product quantization Julia Vector Search
Amazon OpenSearch k-NN - Amazon OpenSearch's k-NN plugin enables scalable, efficient vector search using ANN algorithms (IVF, HNSW) directly within a managed OpenSearch cluster. It is directly relevant for building, querying, and scaling vector databases on AWS. (Read more) Vector Search Ann Managed Service OpenSearch
Annoy - An open-source library for approximate nearest neighbor search in high-dimensional spaces, often used as a backend for vector databases and search engines. (Read more) Open Source Ann High Dimensional Vector Search
Deep Searcher - Deep Searcher is a local open-source deep research solution that integrates Milvus and LangChain to provide advanced vector search and retrieval capabilities using open-source models. (Read more) Open Source Milvus Langchain Vector Search
DiskANN - DiskANN is a graph-based approximate nearest neighbor search (ANNS) system optimized for fast and accurate billion-point nearest neighbor search on a single node, leveraging SSD storage. It is highly relevant for large-scale vector database applications requiring efficient vector search at scale. (Read more) Ann High Performance Scalable Vector Search
EFANNA - EFANNA is an extremely fast approximate nearest neighbor search algorithm based on kNN graphs and randomized KD-trees. The provided implementation offers a high-performance ANN index suitable as a building block in custom vector search and retrieval infrastructure. (Read more) Ann High Performance vector indexing
FAISS - FAISS (Facebook AI Similarity Search) is a popular open-source library for efficient similarity search and clustering of dense vectors. Developed by Facebook/Meta, it supports billions of vectors and is widely used to power vector search engines and databases, especially where raw speed and scalability are needed. (Read more) Open Source Ann Similarity Search Scalable
FastText - FastText is an open-source library by Facebook for efficient learning of word representations and text classification. It generates high-dimensional vector embeddings used in vector databases for tasks like semantic search and document clustering. (Read more) Open Source vector embeddings Semantic Search Machine Learning
Gensim - Gensim is a Python library for topic modeling and vector space modeling, providing tools to generate high-dimensional vector embeddings from text data. These embeddings can be stored and efficiently searched in vector databases, making Gensim directly relevant to vector search use cases. (Read more) Python vector embeddings Open Source topic modeling
GloVe - GloVe is a widely used method for generating word embeddings using co-occurrence statistics from text corpora. These embeddings are commonly used as input to vector databases for semantic search and other vector-based information retrieval tasks. (Read more) vector embeddings Machine Learning Open Source Semantic Search
HNSW (Go) - A Go implementation of the HNSW approximate nearest neighbor search algorithm, enabling developers to embed efficient vector similarity search directly into Go services and custom vector database solutions. (Read more) Ann Go Vector Search
HNSW (Rust) - A Rust implementation of the HNSW (Hierarchical Navigable Small World) approximate nearest neighbor search algorithm, useful for building high-performance, memory-safe vector search components in Rust-based AI and retrieval systems. (Read more) Ann Rust Vector Search
hora - Hora is an efficient, open-source library for approximate nearest neighbor search, written in Rust. It offers high-performance vector search capabilities for AI and machine learning applications. (Read more) Open Source Ann Rust High Performance
Hugging Face Sentence Transformers Embedding Function for ChromaDB Java Client - An embedding function implementation within the ChromaDB Java client (tech.amikos.chromadb.embeddings.hf.HuggingFaceEmbeddingFunction) that utilizes Hugging Face's cloud-based inference API to generate vector embeddings for documents. (Read more) Embeddings Java Hugging Face
Hugging Face Tokenizers - A library from Hugging Face providing fast and customizable tokenization, a fundamental step for preparing text data for embedding models used with vector databases. (Read more) Nlp tokenization Hugging Face
IDEA - IDEA is an inverted, deduplication-aware index structure designed to improve storage efficiency and query performance for similarity search workloads. It is implemented as research code and targets high-dimensional vector and content-addressable data, making it relevant to large-scale vector database and ANN indexing systems. (Read more) Similarity Search Indexing High Dimensional
iRangeGraph - iRangeGraph is an ANN indexing approach and accompanying implementation for range-filtering nearest neighbor search. It provides a specialized graph-based index that supports vector similarity search under range constraints, making it directly useful as a component or reference implementation for advanced vector database indexing and retrieval. (Read more) Ann graph index Similarity Search
JinaEmbeddingFunction - A wrapper embedding function for Jina Embedding models, used to generate vector embeddings. (Read more) Embeddings Jina Api
jvector - jvector is a high-performance Java-based library and engine for vector search and approximate nearest neighbor indexing. (Read more) Ann Vector Search High Performance
LangChain - LangChain is an open-source framework that integrates with various vector databases, including Pinecone, Weaviate, and Chroma, to facilitate retrieval-augmented generation (RAG) and advanced AI workflows. (Read more) Open Source Rag Ai Integration
Langflow - Langflow is a platform that simplifies building AI agents by connecting models, vector stores, memory, and other AI building blocks. It is relevant to vector databases as it supports integration with vector stores for AI-powered agents. (Read more) Ai vector stores Integration Open Source
LibVQ - LibVQ is an open-source toolkit for optimizing vector quantization and efficient neural retrieval, offering training and indexing components that can serve as the core of high-performance approximate nearest neighbor search and vector database systems. (Read more) vector quantization Neural Search Ann
Milvus CLI - Milvus CLI is a command-line interface for managing and interacting with Milvus vector databases, allowing users to perform database operations and manage collections efficiently. (Read more) Milvus CLI Management vector databases
Milvus Lite - Milvus Lite is a lightweight, pip-installable variant of the Milvus vector database that runs as a library in notebooks or laptops, ideal for learning, experimentation, and rapid prototyping of AI and vector search applications. (Read more) Vector Search Lightweight Python
NearestNeighbors.jl - NearestNeighbors.jl is a Julia package implementing various nearest neighbor search algorithms and index structures for high-dimensional vector data. (Read more) Ann Julia Vector Search
Neighbor - Ruby gem for approximate nearest neighbor search that can integrate with pgvector and other backends to power vector similarity search in Ruby applications. (Read more) Ann Ruby Similarity Search
NMSLIB - NMSLIB is an efficient similarity search library and toolkit for high-dimensional vector spaces, supporting a variety of indexing algorithms for vector database use cases. (Read more) Open Source Ann Similarity Search High Dimensional
NSG - NSG is an approximate nearest neighbor search algorithm based on a sparse navigable graph structure designed for high-dimensional vector similarity search. The reference implementation provides a graph-based ANN index that can be integrated into custom vector retrieval systems. (Read more) Ann graph index Similarity Search
NVIDIA CAGRA - NVIDIA CAGRA is a GPU-accelerated graph-based library for approximate nearest neighbor searches, optimized for high-performance vector search leveraging modern GPU parallelism. It is suitable for scenarios requiring rapid, large-scale vector retrieval. (Read more) Gpu Acceleration Ann High Performance Vector Search
OpenAIEmbeddingFunction - An embedding function that utilizes the OpenAI API to compute vector embeddings, commonly used with vector databases. (Read more) Embeddings Openai Api
ParlayANN - ParlayANN is a scalable and deterministic parallel graph-based approximate nearest neighbor (ANN) search library. It provides parallel algorithms and implementations for high-dimensional vector similarity search, suitable as a core search component in large-scale vector database and retrieval systems. (Read more) Ann Parallel Scalable
PGVector - PostgreSQL supports vector indexing and similarity search via the PGVector extension, allowing relational databases to manage and retrieve vector embeddings efficiently. (Read more) Open Source Vector Search Postgresql Similarity Search
pgvector-cobol - COBOL bindings and examples for pgvector, letting legacy COBOL systems interact with PostgreSQL as a vector database. (Read more) SDK Pgvector vector store
pgvector-crystal - Crystal language client for pgvector, providing idiomatic Crystal access to vector operations in PostgreSQL. (Read more) SDK Pgvector vector store
pgvector-dotnet - .NET (C#, F#, Visual Basic) library for pgvector that exposes vector storage and similarity queries on PostgreSQL to .NET applications. (Read more) SDK Pgvector vector store
pgvector-elixir - Elixir wrapper and examples for pgvector, integrating PostgreSQL-based vector search into Elixir ecosystems like Phoenix. (Read more) SDK Pgvector vector store
pgvector-erlang - Erlang client and examples for pgvector, providing tools to run vector operations against PostgreSQL from Erlang systems. (Read more) SDK Pgvector vector store
pgvector-gleam - Gleam language client and examples for pgvector, allowing Gleam applications to perform vector similarity search using PostgreSQL. (Read more) SDK Pgvector vector store
pgvector-haskell - Haskell bindings and examples for pgvector, enabling Haskell applications to treat PostgreSQL as a vector database. (Read more) SDK Pgvector vector store
pgvector-lisp - Lisp bindings and examples for pgvector, allowing Common Lisp projects to leverage PostgreSQL as a vector store. (Read more) SDK Pgvector vector store
pgvector-node - JavaScript/TypeScript (Node.js) client for pgvector, enabling server-side JS apps to run vector queries on PostgreSQL. (Read more) SDK Pgvector vector store
pgvector-ocaml - OCaml client and examples for pgvector that provide access to vector indexing and nearest-neighbor search in PostgreSQL from OCaml code. (Read more) SDK Pgvector vector store
pgvector-pascal - Pascal bindings and examples for pgvector, supporting PostgreSQL-powered vector search from Pascal applications. (Read more) SDK Pgvector vector store
pgvector-perl - Perl client and examples for pgvector, exposing vector data types and similarity queries in PostgreSQL to Perl scripts and apps. (Read more) SDK Pgvector vector store
pgvector-prolog - Prolog client and examples for pgvector, enabling logic programs to interact with vector search capabilities in PostgreSQL. (Read more) SDK Pgvector vector store
pgvector-python - Python library and examples for pgvector, integrating Python AI/ML pipelines with PostgreSQL vector storage and similarity queries. (Read more) SDK Pgvector vector store
pgvector-ruby - Ruby client and examples for pgvector, integrating Ruby applications (including Rails) with PostgreSQL vector operations for AI use cases. (Read more) SDK Pgvector vector store
pgvector-rust - Rust client and examples for pgvector, offering idiomatic Rust APIs for embedding storage and similarity queries in PostgreSQL. (Read more) SDK Pgvector vector store
pgvector-swift - Swift bindings and examples for pgvector, allowing Swift and server-side Swift apps to use PostgreSQL as a vector database. (Read more) SDK Pgvector vector store
PilotANN - PilotANN is a memory-bounded GPU-accelerated framework for large-scale vector search, designed to improve performance and efficiency of approximate nearest neighbor (ANN) search workloads, making it relevant as a high-performance engine/component in vector database and vector search systems. (Read more) Gpu Acceleration Ann High Performance
Product-Quantization - Product-Quantization is a GitHub repository implementing the inverted multi-index structure for product-quantization-based approximate nearest neighbor search, providing building blocks for scalable vector search engines. (Read more) product quantization Ann vector indexing
pymilvus - pymilvus is the official Python SDK for Milvus, allowing developers to interact programmatically with the Milvus vector database. It provides utilities for transforming unstructured data into vector embeddings and supports advanced features such as reranking for optimized search results. The pymilvus[model] variant includes utilities for generating vector embeddings from text using built-in models. Python Milvus vector embeddings SDK
Qinco - Qinco is an open-source implementation from Facebook Research for Residual Quantization with Implicit Neural Codebooks. It provides quantization and indexing methods for compact vector representations to accelerate similarity and nearest neighbor search, making it relevant as a low-level vector indexing and compression component for vector databases and large-scale AI retrieval systems. (Read more) vector compression Similarity Search Open Source
RaBitQ - RaBitQ is an open-source library implementing the "Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search" method, providing vector quantization and compression techniques designed to improve efficiency and accuracy of ANN search engines and vector databases operating in high-dimensional spaces. (Read more) Ann vector compression High Dimensional
Reconfigurable Inverted Index - Reconfigurable Inverted Index (Rii) is a research project and open-source library for approximate nearest neighbor and similarity search over high-dimensional vectors. It focuses on flexible, reconfigurable inverted index structures that support efficient vector search, making it directly relevant as a vector-search engine component for AI and multimedia retrieval applications. (Read more) Ann vector indexing Similarity Search
RediSearch - RediSearch is a Redis module that provides high-performance vector search and similarity search capabilities on top of Redis, enabling advanced search and retrieval features for AI and data applications. (Read more) Vector Search Redis Open Source Similarity Search
RETA-LLM - RETA-LLM is a toolkit designed for retrieval-augmented large language models. It is directly relevant to vector databases as it involves retrieval-based methods that typically leverage vector search and vector databases to enhance language model capabilities through external knowledge retrieval. (Read more) Rag Llm Retrieval Vector Search
RTNN - RTNN is a research prototype system and codebase that accelerates high-dimensional nearest neighbor search using hardware ray tracing units on modern GPUs. It targets vector similarity search workloads common in AI applications, exploring ray-tracing hardware as an alternative acceleration path to traditional CPU- or CUDA-based ANN indexes. (Read more) Gpu Acceleration Ann Similarity Search
ScaNN - A library by Google Research for efficient vector similarity search, suitable for large-scale nearest neighbor applications in AI. (Read more) Open Source Ann Vector Search Ai
SimSIMD - Open‑source library providing fast SIMD‑accelerated implementations of similarity and distance computations (e.g., vector inner products and distances), serving as an efficient alternative to scipy.spatial.distance and numpy.inner for vector search and vector database workloads. (Read more) Similarity Search Optimization vector processing
spaCy - spaCy is an industrial-strength NLP library in Python that provides advanced tools for generating word, sentence, and document embeddings. These embeddings are commonly stored and searched in vector databases for NLP and semantic search applications. (Read more) Python vector embeddings Nlp Open Source
SPTAG - SPTAG is a distributed approximate nearest neighbor (ANN) library for building and searching large-scale vector indexes, supporting efficient and scalable vector search scenarios. (Read more) Open Source Ann Distributed Scalable
SymphonyQG - SymphonyQG is a research codebase and method that integrates vector quantization with graph-based indexing to build efficient approximate nearest neighbor (ANN) indexes for high-dimensional vector search. It targets vector database and similarity search scenarios where combining compact codes with navigable graphs can improve recall–latency tradeoffs and memory footprint. (Read more) Ann vector quantization graph index
Tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene, offering fast and scalable similarity search capabilities. While primarily focused on text, it supports efficient vector-based similarity searches, making it useful for vector search tasks. (Read more) Open Source full-text search Vector Search Scalable
usearch - usearch is a fast, open-source search and clustering engine designed for efficient vector search across multiple programming languages. (Read more) Open Source Vector Search Clustering multi-language
Voyager - Voyager is a Spotify open-source vector search library and service for efficient nearest neighbor search on large-scale vector datasets. (Read more) Ann Vector Search Open Source
vsag - vsag is an Alibaba open-source library implementing efficient vector search algorithms, including approximate nearest neighbor search for high-dimensional vectors. (Read more) Ann High Dimensional Vector Search
Word2vec - Word2vec is a popular machine learning technique for generating vector embeddings based on the distributional properties of words in large corpora. It is directly relevant to vector databases as it produces the high-dimensional vector representations stored and indexed by these databases for vector search and similarity tasks. (Read more) vector embeddings Machine Learning Open Source Python

Vector Database Engines

Deep Lake 4.0 - AI data lake with revolutionary index-on-the-lake technology enabling sub-second queries from S3. Features 10x cost efficiency vs in-memory DBs and 2x faster than alternatives. This is a commercial platform with OSS components. (Read more) Commercial Data Lake Multimodal
Vespa - Open-source AI search platform combining vector search, keyword retrieval, structured filtering, and ML ranking. Powers applications at Spotify, Yahoo, and Wix with sub-100ms response times. This is an OSS platform under Apache 2.0 with managed cloud option. (Read more) Open Source Enterprise Hybrid Search
YugabyteDB with pgvector - PostgreSQL-compatible distributed database with pgvector support and USearch integration, proven to handle billions of vectors with 96.56% recall and sub-second query latency. (Read more) Postgresql Distributed Open Source
Actian VectorAI DB - Edge-native vector database designed for deployment at remote locations and edge devices with no cloud dependency. Supports real-time decision making with sub-15ms query latency and operates independently during network outages. (Read more) Edge On Premises Offline
Couchbase Lite Vector - Developer-friendly, full-featured embedded NoSQL database with vector search for offline-first GenAI apps running on mobile, IoT devices, and web browsers with no internet dependencies. (Read more) Embedded Offline Mobile
ObjectBox Vector - On-device vector database with out-of-the-box data sync, designed for resource-efficiency on mobile, IoT, and embedded devices, enabling offline-first AI applications without internet dependency. (Read more) Edge Embedded Offline
Qdrant Edge - Lightweight embedded vector search engine designed for real-time vector search on edge devices like robots, kiosks, and mobile phones with limited computational resources and offline capabilities. (Read more) Edge Embedded Offline
ScyllaDB Vector Search - High-performance NoSQL database with vector search capabilities built on USearch library and shard-per-core architecture, storing vector embeddings alongside structured data in unified tables. (Read more) Nosql Distributed High Performance
Turso - SQLite-based database with native vector search capabilities built directly into the database without extensions. Based on libSQL fork of SQLite with support for DiskANN algorithm for approximate nearest neighbor search. This is a commercial solution with free tier available. (Read more) Commercial Sqlite Edge
Zvec - Lightweight embedded vector database for RAG systems useful in edge environments, running directly on devices with local vector search and no network latency or cloud dependencies. (Read more) Embedded Edge Lightweight

Cloud Services

Azure Cosmos DB NoSQL Vector Search - Microsoft's globally distributed multi-model database with native vector search using DiskANN algorithm. Features <20ms query latency and 43x lower cost vs Pinecone. This is a commercial managed service. (Read more) Commercial Azure Nosql
Vertex AI Vector Search - Google Cloud's vector search engine (formerly Matching Engine) built on ScaNN algorithm. Version 2.0 features unified data model with hybrid search and auto-generated embeddings. This is a commercial managed service. (Read more) Commercial Google Cloud managed service
AlloyDB for PostgreSQL with Vector Search - Google Cloud's PostgreSQL-compatible database with state-of-the-art HNSW index build performance, achieving 9x speedup over pgvector on CPU for vector workloads. (Read more) Postgresql Google Cloud Managed Service
AWS OpenSearch k-NN - Managed OpenSearch service with k-nearest neighbor search capabilities. Uses HNSW, Faiss, and Lucene libraries for approximate nearest neighbor searches. This is a commercial managed service. (Read more) Commercial Aws managed service
Turbopuffer - Serverless vector and full-text search database built on object storage with sub-10ms p50 latency. 10x cheaper than alternatives while hosting 2.5T+ documents and serving 10k+ queries per second. (Read more) Serverless Object Storage Cost Effective

Cloud Services

Amazon Aurora Machine Learning - A feature of Amazon Aurora that enables making calls to ML models like Amazon Bedrock or Amazon SageMaker through SQL functions, allowing direct generation of embeddings within the database and abstracting the vectorization process. (Read more) Machine Learning Embeddings Aws
Amazon Aurora Serverless v2 - An on-demand, auto-scaling configuration for Amazon Aurora DB instances that automatically adjusts compute and memory capacity based on load, integrated with Knowledge Bases for Amazon Bedrock to simplify vectorization and database capacity management. (Read more) Cloud Native Serverless Aws
Instaclustr Vector Database Management - A managed service and tooling offering from Instaclustr that helps teams operate and optimize vector databases for GenAI and Retrieval-Augmented Generation (RAG) workloads, providing expertise and infrastructure management for production deployments. (Read more) Managed Service Rag vector databases
MotherDuck - A cloud data warehouse that can be leveraged to store vector embeddings as List data types, enabling semantic search capabilities through SQL-based similarity functions within an existing data pipeline. (Read more) Cloud data warehousing vector embeddings
Nextbrick Managed Vector Database Service - A fully managed vector database infrastructure and operations service provided by Nextbrick. It focuses on deployment, configuration, tuning, scaling, security, and maintenance of vector databases for AI and similarity search workloads. The service handles sharding, replication, query optimization, backups, and disaster recovery so organizations can offload operational management and focus on building AI applications. (Read more) Managed Service vector database services
Qdrant Cloud Inference - Qdrant Cloud Inference is a managed inference service integrated with the Qdrant vector database, allowing users to generate embeddings and work with vector search pipelines directly in the cloud environment. (Read more) Managed Service Embeddings Vector Search
Snowflake - A cloud data platform that offers capabilities for storing and querying various data types, including vector embeddings, often used in conjunction with its data warehousing features. (Read more) Cloud data warehousing vector embeddings

Curated Resource Lists

Building Applications with Vector Databases - DeepLearning.AI course teaching six practical vector database applications using Pinecone, including RAG for LLMs, recommender systems, and hybrid search combining images and text. (Read more) Learning Tutorials Rag
Vector Database Market Trends 2026 - Comprehensive overview of vector database evolution in 2026, including the shift to vectors as data types, PostgreSQL dominance, 400% adoption surge, and $10.6B projected market by 2032. (Read more) Market Trends Survey
LangChain & Vector Databases in Production - Free comprehensive course from Activeloop with 60+ lessons and 10+ practical projects, teaching production-ready LLM applications with vector databases, trusted by 10,000+ engineers. (Read more) Learning Langchain Rag
Vector Database Fundamentals (Coursera) - IBM's comprehensive specialization providing job-ready vector database skills in one month, covering foundational knowledge for LLM-powered AI similarity searches, available for free enrollment. (Read more) Learning Tutorials Certification

Managed Vector Databases

Qdrant Cloud - Managed vector database service with 1GB free forever cluster (no credit card required). Fully managed with multi-cloud support across AWS, GCP, and Azure. This is a commercial managed service. (Read more) Commercial managed service Multi Cloud
Qdrant Hybrid Cloud - Industry-first managed vector database deployable in any environment - cloud, on-premise, or edge. Kubernetes-native with complete data sovereignty while maintaining managed service convenience. (Read more) Hybrid Cloud Kubernetes Enterprise
BagelDB - Collaborative vector database platform described as 'GitHub for AI data'. Features distributed storage, HNSW indexing, and supports private, collaborative, and public vector datasets. This is a commercial platform with open collaboration features. (Read more) Commercial Collaborative Distributed
DataStax Astra DB - Serverless vector database built on Apache Cassandra that empowers developers to build AI applications with real-time data handling. Features 20% higher relevance and 74x faster responses with advanced vector and knowledge graph capabilities. (Read more) Cassandra Serverless Enterprise
LanceDB Cloud - Fully managed serverless vector database service with automatic scaling and infrastructure management. Seamless transition from LanceDB OSS with the same SDK, starting at $16.03/month with $100 free credits. (Read more) Serverless Managed Cloud Native
Momento Vector Index - Serverless vector indexing service designed for real-time storage and retrieval of vector data. Developer-friendly with just 5 API calls to create complete indexes, featuring transparent pricing. This is a commercial managed service. (Read more) Commercial Serverless Real Time
Neon - Serverless Postgres with native pgvector support for vector embeddings and similarity search. Features instant provisioning, autoscaling, and scale-to-zero with separated compute and storage. This is a commercial managed service with free tier. (Read more) Commercial Serverless Postgresql
Nuclia - AI Search and RAG-as-a-Service platform with semantic search capabilities. Features NucliaDB open-source database. Acquired by Progress in 2025, now part of Progress Agentic RAG. This is a commercial service with OSS core (NucliaDB). (Read more) Commercial Open Source Rag
Supabase Vector - Open-source toolkit for developing AI applications using Postgres and pgvector. Provides managed PostgreSQL with built-in vector support, Python client (vecs), and AI features. This is a commercial managed service with OSS components. (Read more) Commercial Open Source Postgresql
Upstash Vector - Serverless vector database with pay-per-use pricing and scale-to-zero capability. Fully managed service that scales to billions of vectors with simple per-request pricing. This is a commercial managed service. (Read more) Commercial Serverless managed service
Vespa Cloud - Unified search and AI engine with seamless scaling, intelligent retrieval, and precision ranking. Goes beyond simple vector search with tensor support, multi-phase ranking, and hybrid retrieval blending semantic, textual, and structured signals at scale. (Read more) Hybrid Search Ranking Scalable

Sdks & Libraries

HNSWlib - Header-only C++/Python library for fast approximate nearest neighbor search implementing the HNSW algorithm. Used by Spotify and others, offers 10x speed increase over Annoy. This is an OSS library. (Read more) Open Source Hnsw Cpp
NVIDIA cuVS - GPU-accelerated vector search and clustering library from NVIDIA RAPIDS. Provides 8-12x faster index building and queries with multiple language support (C, C++, Python, Rust). This is an OSS library. (Read more) Open Source Gpu Acceleration Nvidia
ELPIS - Graph-based similarity search algorithm achieving 0.99 recall, building indexes 3-8x faster than competitors with 40% less memory. Answers 1-NN queries up to 10x faster than serial scan. (Read more) Ann Graph Based Research
GLASS - Leading graph-based ANN library optimized for approximate nearest neighbor search, offering competitive performance especially at lower recall levels across diverse datasets. (Read more) Ann Graph Based Cpp
hnswlib-rs - Pure-Rust implementation of HNSW algorithm for approximate nearest neighbor search. Decouples graph from vector storage for flexible deployment. Supports dense floating point and quantized int8 vectors. This is an OSS library. (Read more) Open Source Rust Hnsw
OdinANN - Billion-scale graph-based ANNS index with direct insertion capabilities. Achieves <1ms search latency with >10x less memory than in-memory indexes through GC-free design and update combining. (Read more) Ann Disk Based High Performance
PageANN - Disk-based approximate nearest neighbor search framework with page-aligned graph structure. Achieves 1.85x-10.83x higher throughput than state-of-the-art methods through optimized SSD utilization. (Read more) Ann Disk Based Open Source
PipeANN - Low-latency, billion-scale updatable graph-based vector store on SSD. Achieves <1ms search latency with 10x less memory than in-memory indexes through alignment of best-first search with SSD characteristics. (Read more) Ann Disk Based Open Source
PyNNDescent - Python implementation of Nearest Neighbor Descent for k-neighbor-graph construction and ANN search. Targets 80%-100% accuracy with fast performance and supports wide variety of distance metrics. This is an OSS library. (Read more) Open Source Python Ann
SPANN - Highly-efficient billion-scale approximate nearest neighbor search system from Microsoft Research. Uses a memory-disk hybrid architecture storing centroid points in memory and large posting lists on disk. (Read more) Ann Hybrid Search Microsoft
VectorDB - Lightweight Python package for storing and retrieving text using chunking, embeddings, and vector search. Powers AI features in Kagi Search with low latency and small memory footprint. This is an OSS library. (Read more) Open Source Python Lightweight

Vector Database Extensions

pgvecto.rs - PostgreSQL extension for scalable, low-latency vector search written in Rust. Features 20x faster HNSW than pgvector, with support for FP16, INT8, and binary vectors. This is an OSS extension. (Read more) Open Source Postgresql Rust
VectorChord - PostgreSQL extension for scalable, high-performance vector search, successor to pgvecto.rs. Features RaBitQ quantization enabling 6x cost savings vs Pinecone. Fully compatible with pgvector. This is an OSS extension. (Read more) Open Source Postgresql Quantization
Neo4j Vector Index - Vector search capabilities in Neo4j graph database using HNSW indexing. Enables combining knowledge graphs with semantic similarity search for hybrid retrieval that leverages both graph relationships and vector embeddings. (Read more) Graph Database Hnsw Knowledge Graph
pgai - Open-source PostgreSQL extension and Python library that automates embedding generation and synchronization for RAG and semantic search applications. Features pgai Vectorizer for declarative embedding pipelines. This is an OSS solution. (Read more) Open Source Postgresql Embedding
pgvectorscale - PostgreSQL extension that builds on pgvector for higher-performance embedding search with DiskANN indexing. Achieves 28x lower latency and 16x higher throughput than Pinecone at 75% less cost on 50M embeddings. (Read more) Postgresql Open Source Performance
PlanetScale Vectors - Vector search and storage for MySQL, now generally available. PlanetScale Vectors brings native vector capabilities to MySQL, allowing you to store and query vector embeddings alongside relational data without requiring a separate vector database. (Read more) Mysql Cloud Native Vector Search
Redis Vector Search - Native vector database capabilities in Redis combining ultra-low latency in-memory operations with vector similarity search. Redis 8.0 introduced vector sets as native data type for semantic search, RAG pipelines, and recommendations. (Read more) Redis In Memory Real Time
Timescale Vector - PostgreSQL-based vector search solution built on Timescale Cloud with pgai extensions including pgvector, pgvectorscale, and pgai. Features StreamingDiskANN index for high-performance embedding search at scale. (Read more) Postgresql Pgvector Time Series

Benchmarks & Evaluation

MTEB Leaderboard - Massive Text Embedding Benchmark leaderboard covering 58 datasets across 112 languages and 8 embedding tasks. Industry-standard benchmark for comparing text embedding models. (Read more) Benchmark Embeddings Evaluation
Big-ANN Benchmarks - Billion-scale approximate nearest neighbor search benchmark competition. Features datasets like SIFT1B, Deep1B with standardized evaluation metrics for comparing vector search algorithms at scale. (Read more) Benchmark Ann Competition
BigANN Benchmarks - Main competition for large-scale vector database algorithms held at NeurIPS conferences. Framework for evaluating approximate nearest neighbor search algorithms on billion-scale datasets with standardized metrics and datasets. (Read more) Benchmark Competition Ann
Deep1B Dataset - Billion-scale benchmark dataset containing 96-dimensional deep learning image embeddings. Provides real-world proxy for testing distributed systems and GPU-accelerated vector search at scale. (Read more) Benchmark Datasets Deep Learning
MMTEB - Massive Multilingual Text Embedding Benchmark covering over 500 quality-controlled evaluation tasks across 250+ languages, representing the largest multilingual collection of embedding model evaluation tasks. (Read more) Benchmark Multilingual Evaluation
MTEB (Massive Text Embedding Benchmark) - Comprehensive benchmark evaluating universal text embedding models across 1000+ languages and 58 datasets over 8 tasks. The industry standard for comparing embedding model performance. (Read more) Benchmark Evaluation Embeddings
SIFT1B Dataset - Billion-scale benchmark dataset containing 128-dimensional SIFT descriptors of one billion images. Widely used standard for evaluating approximate nearest neighbor search algorithms at scale. (Read more) Benchmark Datasets Ann
VectorDBBench - Open-source vector database benchmarking tool testing databases across production-critical scenarios including static collection, filtering, and streaming cases with modern embedding model datasets. (Read more) Benchmark Open Source Performance
ViDoRe - Visual Document Retrieval Benchmark defining standard evaluation protocols for vision-centric document and video retrieval with 26,000 pages and 3,099 queries across 6 languages from 12,000 man-hours of annotations. (Read more) Benchmark Multimodal Rag

Data Integration & Migration

Unstructured.io - Deep document parsing platform with strong OCR capabilities excelling at extracting structured data from complex layouts including multi-column PDFs, scanned documents, and forms. (Read more) Data Integration Ocr Document Parsing
Kanister for Vector Database Backup - Open-source CNCF Sandbox project enabling efficient and secure backup and restore strategies for vector databases on Kubernetes with cloud-native integration. (Read more) Backup Kubernetes Disaster Recovery
LlamaHub - Open-source repository with 160+ community-created data loaders, readers, tools, and connectors for LlamaIndex applications, covering formats from PDFs to Notion databases. (Read more) Data Integration Loaders Open Source
VectorFlow - Open-source high-throughput vector embedding pipeline for ingesting raw data, transforming into vectors, and loading into vector databases. Technology-agnostic with automatic retry and fault tolerance. (Read more) Etl Pipeline Open Source

Llm Frameworks

Haystack - Mature, modular open-source Python framework for building production-grade RAG pipelines, AI agents, and semantic search systems, trusted by The European Commission and The Economist. (Read more) Rag Python Enterprise
DSPy - Programming framework for RAG and AI applications with cutting-edge optimization capabilities, featuring the lowest framework overhead and automatic improvement based on example data. (Read more) Rag Python Optimization
txtai - All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows. Combines vector indexes (sparse/dense), graph networks and relational databases. This is an OSS framework. (Read more) Open Source Python Rag

Llm Tools

RAGAS - Research-backed RAG evaluation framework providing metrics for context precision, recall, faithfulness, and response relevancy to objectively measure LLM application performance. (Read more) Rag Evaluation Metrics
Datadog Vector Database Monitoring - Comprehensive observability solution for vector databases through Zilliz Cloud integration, providing metrics for QPS, latency, slow queries, and failure rates alongside full stack monitoring. (Read more) Observability Monitoring Integration
Langfuse - Open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management. Integrates with OpenTelemetry, LangChain, OpenAI SDK, and vector databases for RAG pipeline monitoring. (Read more) Observability Open Source Prompt Management
Langtrace - Open-source LLM observability tool built on OpenTelemetry standards. Automatically captures traces from LLM APIs, vector databases, and frameworks with support for over 30 popular providers. (Read more) Observability Open Source Opentelemetry
Monte Carlo Vector Database Observability - Data observability platform specifically supporting vector databases including Pinecone, providing comprehensive monitoring across the five pillars of data observability. (Read more) Observability Data Quality Monitoring
VectorAdmin - Universal vector database management UI and tool suite supporting multiple platforms including Pinecone, Chroma, Qdrant, and Weaviate for centralized administration. (Read more) Gui Management Tool

Machine Learning Models

all-MiniLM-L6-v2 - A compact and efficient pre-trained sentence embedding model, widely used for generating vector representations of text. It's a popular choice for applications requiring fast and accurate semantic search, often integrated with vector databases. (Read more) Embeddings Nlp Ai
OpenAI’s text-embedding-ada-002 - A pre-trained model used for extracting embeddings from content like PDFs, videos, and transcripts, which are then stored in vector databases for faster search. (Read more) Embeddings Ai Openai

Multi Model & Hybrid Databases

DuckDB - An in-memory, open-source, and free analytical database that speaks SQL, heavily based on vectorization. It can store and process vector embeddings using Array and List data types to enable vector search, bridging the gap between data engineering and AI workflows with fast response times. (Read more) In Memory Open Source Analytics Sql
Azure Cosmos DB Vector Indexing - Native vector indexing capability in Azure Cosmos DB that supports flat, quantizedFlat, and diskANN index types for efficient vector similarity search using the VectorDistance function. It enables low-latency, high-throughput, and cost-efficient vector search directly in Cosmos DB collections, with options for brute-force exact search (flat), compressed brute-force search (quantizedFlat), and approximate nearest neighbor search (diskANN). (Read more) Vector Search Diskann Cloud Native
Couchbase - A database platform that includes vector support, aiming to enhance developer productivity with AI tools like Capella IQ. (Read more) Nosql vector data Ai
OpenSearch Vector Search - OpenSearch Vector Search is the vector similarity search and AI search capability within the OpenSearch engine, supporting vector indices, ingestion of embedding data, and search methods including raw vector search, semantic search, hybrid search, multimodal search, and neural sparse search. It enables building RAG and conversational search applications using either user-provided embeddings or embeddings generated automatically by OpenSearch. (Read more) Vector Search Hybrid Search Semantic Search
SingleStoreDB (formerly MemSQL) - SingleStoreDB is an enterprise database that has supported vectors since 2017, in addition to exact keyword match, and recently announced support for additional vector indexes. (Read more) Enterprise Sql vector indexes
SurrealDB - A multi-model database that supports various data types and query languages, including capabilities for handling vector data. (Read more) multi-model vector data Nosql

Vector Database Extensions

k-NN plugin - An OpenSearch plugin that expands its capabilities with the custom knn_vector data type, enabling storage of embeddings and providing methods for k-NN similarity searches, including Approximate k-NN, Script Score k-NN, and Painless extensions. (Read more) OpenSearch k-NN Vector Search
faiss-quickeradc - faiss-quickeradc is an extension of FAISS that implements the Quicker ADC approach to accelerate product-quantization-based approximate nearest neighbor search using SIMD, improving performance in vector database retrieval. (Read more) Ann product quantization Optimization
HeatWave - A feature for MySQL that integrates vector store capabilities, allowing users to store and process vector embeddings for AI applications. (Read more) Mysql vector store extension
Lantern - Lantern is a PostgreSQL extension that enables efficient vector search capabilities, allowing users to perform similarity searches directly within their PostgreSQL databases. (Read more) Postgresql Vector Search extension
MariaDB Vector - MariaDB Vector is an extension or feature of MariaDB, providing capabilities for handling and querying vector data within the MariaDB ecosystem. (Read more) relational database Vector Search extension
Neo4j Vector Search - An enhancement to the Neo4j graph database providing vector search capabilities through dedicated indexes. (Read more) Graph Database Vector Search extension
OpenSearch Neural Search / Hybrid Search - Neural and hybrid search capability in OpenSearch that combines lexical queries with vector-based neural search using a pipeline of normalization and score combination techniques. It enables semantic (vector) search and hybrid search over indices such as neural_search_pqa, suitable for AI and vector database-style retrieval use cases. (Read more) Hybrid Search Semantic Search Vector Search
SuperDuperDB - Open-source AI-native database layer that adds vector search, model integration, and AI workflows on top of existing databases like MongoDB and Postgres. (Read more) Vector Search MongoDB Postgresql

Benchmarks & Evaluation

ANN-Benchmarks - ANN-Benchmarks is a benchmarking platform specifically for evaluating the performance of approximate nearest neighbor (ANN) search algorithms, which are foundational to vector database evaluation and comparison. (Read more) Benchmark Ann Evaluation Performance
BEIR - BEIR (Benchmarking IR) is a benchmark suite for evaluating information retrieval and vector search systems across multiple tasks and datasets. Useful for comparing vector database performance. (Read more) Benchmark Evaluation Vector Search Datasets
Billion-scale ANNS Benchmarks - A benchmarking resource for evaluating approximate nearest neighbor search (ANNS) methods on billion-scale datasets, highly relevant for assessing the scalability of vector databases. (Read more) Benchmark ANNS Scalability Performance
Milvus Sizing Tool - Milvus Sizing Tool helps users estimate the hardware and resource requirements needed to deploy Milvus based on their anticipated data scale and workload. (Read more) Milvus sizing Performance resource estimation
MyScale's Vector Database Benchmark - Benchmark results and tools by MyScale aimed at measuring the performance of vector databases in various search and retrieval tasks. (Read more) Benchmark vector databases Performance Retrieval
Qdrant's Vector Database Benchmarks - A set of benchmarks provided by Qdrant for evaluating vector databases, focusing on speed, scalability, and accuracy of vector search operations. (Read more) Benchmark vector databases Performance Scalability
SISAP Indexing Challenge - An annual competition focused on similarity search and indexing algorithms, including approximate nearest neighbor methods and high-dimensional vector indexing, providing benchmarks and results relevant to vector database research. (Read more) Benchmark Similarity Search Evaluation
WEAVESS - WEAVESS is an open-source benchmarking and evaluation framework for graph-based approximate nearest neighbor (ANN) search methods, providing code and experiments for large-scale vector similarity search. It is useful for researchers and practitioners comparing vector indexing algorithms for vector databases and AI search applications. (Read more) Ann Benchmark Similarity Search
Zeng, Xianzhi, et al. "CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion." - A 2024 paper introducing CANDY, a benchmark for continuous ANN search with a focus on dynamic data ingestion, crucial for next-generation vector databases. (Read more) Benchmark Ann dynamic data Vector Search

Commerce

Denser Retriever - Denser Retriever is a vector-based retrieval system designed for efficient similarity search and information access in AI and ML workloads. (Read more) Vector Search Similarity Search Ai Commercial
LiquidMetal AI - LiquidMetal AI is a platform providing intelligent storage with built-in AI capabilities, including vector database features for building advanced AI applications. (Read more) Ai vector databases Commercial intelligent storage
Meilisearch Vector Search - Meilisearch offers vector search capabilities as part of its search engine, enabling hybrid and semantic search for AI applications. (Read more) Vector Search Semantic Search Hybrid Search Commercial Ai
Qdrant Enterprise Solutions - Qdrant Enterprise Solutions provide enterprise‑grade deployments and support for the Qdrant vector database, including advanced security, high availability, SLAs, and integration services for large‑scale AI search and recommendation use cases. (Read more) Enterprise vector database services
QdrantCloud - QdrantCloud is the managed cloud version of Qdrant, a vector database tailored for AI-powered similarity search and matching. (Read more) Managed Service vector database Similarity Search Ai
Vectara - Vectara is a commercial vector database and search platform that enables semantic and hybrid AI-powered search using vector embeddings. (Read more) Commercial Vector Search Semantic Search Ai
vector-admin - A universal tool suite for managing vector databases such as Pinecone, Chroma, Qdrant, and Weaviate. Facilitates straightforward management and integration of multiple vector database systems. (Read more) Management tools vector databases Integration

Concepts & Definitions

Deep Learning for Search - Applied book on using deep learning for search, including dense vector representations, semantic search, and neural ranking, all directly relevant to building applications on top of vector databases. (Read more) Semantic Search Machine Learning resources
Foundations of Multidimensional and Metric Data Structures - Technical book covering theory and practice of multidimensional and metric data structures for similarity search, forming a theoretical basis for index structures used in vector databases. (Read more) Similarity Search metric space Data Structure
K-means Tree - K-means Tree is a clustering-based data structure that organizes high-dimensional vectors for fast similarity search and retrieval. It is used as an indexing method in some vector databases to optimize performance for vector search operations. (Read more) Clustering Data Structure Similarity Search High Dimensional
Locality-Sensitive Hashing - Locality-Sensitive Hashing (LSH) is an algorithmic technique for approximate nearest neighbor search in high-dimensional vector spaces, commonly used in vector databases to speed up similarity search while reducing memory footprint. (Read more) Ann Similarity Search High Dimensional Optimization
M-tree - M-tree is a dynamic index structure for organizing and searching large data sets in metric spaces, enabling efficient nearest neighbor queries and dynamic updates, which are important features for vector databases handling high-dimensional vectors. (Read more) Data Structure metric space nearest neighbor Dynamic Updates
Machine Learning Crash Course: Embeddings - Module of Google’s Machine Learning Crash Course that explains word and text embeddings, how they are obtained, and the difference between static and contextual embeddings, giving essential background for using vector representations in vector databases and similarity search systems. (Read more) Embedding Machine Learning Learning
Online Product Quantization (O-PQ) - Online Product Quantization (O-PQ) is a variant of product quantization designed to support dynamic or streaming data. It enables adaptive updating of quantization codebooks and codes in real-time, making it suitable for vector databases that handle evolving datasets. Ann dynamic data Vector Search Real Time
Optimized Product Quantization (OPQ) - Optimized Product Quantization (OPQ) enhances Product Quantization by optimizing space decomposition and codebooks, leading to lower quantization distortion and higher accuracy in vector search. OPQ is widely used in advanced vector databases for improving recall and search quality. Ann Optimization Vector Search accuracy
PQ (Product Quantization) - Product Quantization is a compression and indexing technique for vector search that splits vectors into subspaces and quantizes each part separately, allowing vector databases to store large-scale embeddings compactly while supporting efficient ANN search. (Read more) Quantization Ann vector compression
R-tree - R-tree is a tree data structure widely used for indexing multi-dimensional information such as vectors, supporting efficient spatial queries like nearest neighbor and range queries, which are essential in vector databases. (Read more) Data Structure spatial indexing Vector Search nearest neighbor
Spectral Hashing - Spectral Hashing is a method for approximate nearest neighbor search that uses spectral graph theory to generate compact binary codes, often applied in vector databases to enhance retrieval efficiency on large-scale, high-dimensional data. Ann Similarity Search Compression Optimization
Vector Database - A vector database is a specialized database designed to store, index, and retrieve unstructured data represented as high-dimensional vectors, enabling efficient semantic search, similarity search, and powering applications such as LLM long-term memory, semantic search, and recommendation systems. (Read more) vector databases definition Semantic Search Similarity Search

Data Integration & Migration

Airbyte Milvus Connector - The Airbyte Milvus connector lets users sync data from various Airbyte-supported sources into Milvus as a destination, enabling low-code vector data ingestion pipelines. (Read more) Integration migration vector data
Attu - Attu is a graphical user interface (GUI) tool for managing and administering Milvus vector databases. It simplifies tasks such as data exploration, schema management, and monitoring, making Milvus more accessible for a wide range of users. (Read more) Gui Management Milvus Open Source
Birdwatcher - Birdwatcher is a system debugging tool designed for the Milvus vector database. It provides advanced diagnostics to help developers and operators understand and troubleshoot Milvus deployments, ensuring robust vector search operations. (Read more) debugging Milvus Management Open Source
Kafka Connect Milvus Connector - The Kafka Connect Milvus Connector is a plugin for Kafka Connect that streams data into and out of Milvus, supporting real-time vector data ingestion pipelines. (Read more) Integration Real Time vector data
Milvus Backup Tool - Milvus Backup Tool provides backup and restore functionalities for Milvus vector databases, ensuring data safety and disaster recovery capabilities. Also referred to as Milvus Backup. (Read more) Milvus Backup restore Disaster Recovery
Milvus CDC - Milvus CDC (Change Data Capture) is a component of the Milvus ecosystem that enables data synchronization between Milvus and other systems. It is useful for maintaining up-to-date vector data pipelines and supporting real-time vector search applications. (Read more) Milvus data synchronization Real Time vector databases
Milvus Connectors - Milvus Connectors, such as the Spark-Milvus Connector, enable seamless integration of Milvus vector databases with third-party tools like Apache Spark for machine learning and data processing workflows. (Read more) Milvus Integration Machine Learning Apache Spark
Milvus Destination for Fivetran - The Milvus destination in Fivetran enables automated ELT pipelines that load data into Milvus as a vector database, supporting AI and similarity search workloads. (Read more) Integration Etl vector data
MindsDB Milvus Integration - MindsDB provides an integration with Milvus, enabling users to connect and manage vector data using SQL-like queries. This integration brings federated AI query capabilities across structured and unstructured data with Milvus as the vector database backend. (Read more) Milvus Integration Ai Sql
Spark-Milvus Connector - The Spark-Milvus Connector is an integration that allows Apache Spark jobs to read from and write to Milvus, enabling scalable ETL and analytics workflows for vector data. (Read more) Integration Apache Spark vector data
Vector Transport Service (VTS) - Vector Transport Service (VTS) is a tool for transporting vector data efficiently between Milvus clusters or environments, supporting large-scale data migration and synchronization. Vector Transmission Services (VTS) are tools for transferring data between Milvus and various data sources (like Zilliz clusters, Elasticsearch, Postgres/PgVector, or other Milvus instances), facilitating vector data migration and integration. (Read more) vector data migration Integration Milvus
VTS (Vector Transfer Service) - VTS is a data migration and connector service for Milvus that simplifies moving and synchronizing vector data between Milvus instances and external systems. (Read more) migration data synchronization Milvus

LLM Frameworks

Dify - Open-source LLM app development platform with an intuitive interface that combines AI workflow, RAG pipeline, agent capabilities, model management, and observability features for rapid prototyping and production deployment. (Read more) Open Source Rag Ai Agents
Embedchain - Open Source RAG Framework designed to be 'Conventional but Configurable', streamlining the creation of RAG applications with efficient data management, embeddings generation, and vector storage. (Read more) Rag Open Source Python
FlashRAG - Python toolkit for efficient RAG research providing 36 pre-processed benchmark datasets and 23 state-of-the-art RAG algorithms in a unified, modular framework for reproduction and development. (Read more) Rag Open Source Python
h2oGPT - Apache 2.0 open-source project for querying and summarizing documents or chatting with local private GPT LLMs. Supports Ollama, Mixtral, llama.cpp with persistent databases (Chroma, Weaviate, FAISS) and accurate embeddings. (Read more) Open Source Privacy Local Llm
LightRAG - Simple and efficient retrieval-augmented generation framework that combines document retrieval with generation, focusing on speed and ease of use. Designed to run on standard CPUs and laptops with minimal resource requirements. (Read more) Rag Lightweight Open Source
LLMWare - Retrieval-augmented generation framework that utilizes small, specialized models instead of large language models, significantly reducing computational and financial costs while offering cost-effective RAG solutions that can run on standard hardware. (Read more) Rag Cost Effective Open Source
NVIDIA NeMo Retriever - Collection of industry-leading Nemotron RAG models delivering 50% better accuracy, 15x faster multimodal PDF extraction, and 35x better storage efficiency for building enterprise-grade retrieval-augmented generation pipelines. (Read more) Rag Multimodal Microservices
Pathway - Python ETL framework for stream processing and real-time analytics with built-in vector search capabilities. Features real-time document synchronization, in-memory vector index, and adaptive RAG technology for always-current AI applications. (Read more) Real Time Streaming Rag
PrivateGPT - Production-ready AI project for private, local document Q&A using RAG. 100% private with no data leaving your environment, supporting offline operation with local LLMs and vector databases. (Read more) Privacy Local Rag
RAGatouille - Python library designed to simplify the integration and training of state-of-the-art late-interaction retrieval methods, particularly ColBERT, within RAG pipelines with a modular and user-friendly interface. (Read more) Rag Colbert Retrieval
RAGFlow - Open-source RAG engine based on deep document understanding with citation-backed responses. RAGFlow extracts tables, images, and structured data from complex documents, providing truthful question-answering with well-founded citations. (Read more) Open Source Rag Document Understanding
Vercel AI SDK - Free open-source TypeScript toolkit for building AI-powered applications with a unified API supporting 15+ providers including OpenAI, Anthropic, Google, and more. Created by the makers of Next.js for seamless AI integration. (Read more) Typescript Api Multi Provider

LLM Frameworks

LlamaIndex - LlamaIndex is a data framework for large language model (LLM) applications, providing tools to ingest, structure, and access private or domain-specific data, often integrating with vector databases for retrieval augmented generation (RAG). (Read more) Llm Rag framework

LLM Tools

ARES - RAG evaluation framework that trains lightweight judges for retrieval and generation scoring, refining evaluation by training specialized LLM judges on synthetic datasets to provide more reliable, confidence-aware judgments. (Read more) Evaluation Rag Open Source
Arize Phoenix - Open-source LLM tracing and evaluation solution built on OpenTelemetry for RAG evaluation. Provides automated instrumentation which records the execution path of LLM requests through multiple steps. (Read more) Observability Evaluation Opentelemetry
AWQ - Activation-aware Weight Quantization method that preserves model accuracy at 4-bit quantization by identifying and skipping important weights. Maintains 99%+ of original performance with moderate inference speed improvements. (Read more) Quantization Optimization Compression
Cohere Rerank - Proprietary neural network reranker accessed via API that processes query and document together as a cross-encoder to precisely judge relevance. Supports over 100 languages with Rerank 3 Nimble variant for faster production performance. (Read more) Reranking Api Multilingual
DeepEval - Simple open-source LLM evaluation framework similar to Pytest for unit testing LLM outputs. Provides 14+ targeted metrics for RAG and fine-tuning scenarios including hallucination, faithfulness, and contextual relevancy. (Read more) Evaluation Testing Rag
Docling - Open-source document parsing framework from IBM with 97.9% accuracy in complex table extraction and excellent text fidelity. Self-hostable solution for converting PDFs, spreadsheets, and scanned images into structured data for RAG pipelines. (Read more) Document Parsing Open Source Rag
Feder - Visualization tool for ANNS (Approximate Nearest Neighbor Search) algorithms enabling users to observe index structures, parameter configurations, and the complete vector similarity search process. (Read more) Visualization Ann Hnsw
FiftyOne - Computer vision interface for vector search with native integrations for Qdrant, Pinecone, LanceDB, and Milvus. Enables natural language search, configurable vector database backends, and visualization of search matches across billions of images. (Read more) Computer Vision Visualization Vector Search
GGUF - GPT-Generated Unified Format for storing quantized model weights, designed for CPU inference and consumer hardware. Enables running LLMs on laptops and edge devices with flexible layer offloading to GPU. (Read more) Quantization Cpu Format
GPTQ - Post-training quantization method for 4-bit weight compression that focuses on GPU inference performance. First quantization method to compress LLMs to 4-bit range while maintaining accuracy, minimizing mean squared error to weights. (Read more) Quantization Compression Optimization
Helicone - Open-source observability layer designed to help developers monitor and understand how their applications interact with large language models. Acts as a lightweight proxy between applications and LLM providers. (Read more) Observability Monitoring Open Source
LangSmith - LangChain's observability platform for monitoring, debugging, and evaluating LLM applications. Automatically traces every LLM call, captures prompts and outputs, tracks costs and latency, and enables systematic evaluation through dataset-based testing. (Read more) Observability Monitoring Langchain
LiteLLM - Open-source proxy and SDK that provides a single unified API to call and manage hundreds of different LLM providers and models with OpenAI-compatible endpoints. Simplifies multi-provider LLM integration. (Read more) Open Source Api Llm
llamafile - Single-file executable that bundles LLM weights and llama.cpp runtime. Distribute and run LLMs locally with no installation, including embedding generation via built-in server. (Read more) Local Llm Single File Embeddings
LlamaParse - High-performance document parsing service by LlamaIndex that consistently processes documents in about 6 seconds regardless of size. Returns rich Markdown and optional HTML tables with wide format support through hosted API. (Read more) Document Parsing Api Rag
Milvus WebUI - Built-in GUI introduced in Milvus v2.5 for system observability, offering real-time monitoring of system health, collection management, and query optimization from a unified dashboard. (Read more) Visualization Monitoring Milvus
Nomic Atlas - AI-ready data visualization platform for massive datasets of embeddings. Atlas enables interactive exploration of millions of vectors in your web browser, with automatic dimensionality reduction and semantic clustering. (Read more) Visualization Embeddings Analytics
Portkey - AI gateway that provides a unified interface to interact with 250+ AI models, offering advanced tools for control, visibility, and security in Generative AI applications. Integrates with vector databases for production-level routing and reliability. (Read more) Ai Gateway Observability Llm
Recursive Character Text Splitter - Document chunking strategy that splits text at hierarchical boundaries like paragraphs, sentences, or headings. Industry-standard approach recommended as starting point with 400-512 tokens and 10-20% overlap for optimal RAG performance. (Read more) Chunking Text Processing Rag
Semantic Chunker - Document chunking strategy that dynamically chooses split points between sentences based on embedding similarity rather than fixed sizes. Maintains semantic coherence by grouping related content together for improved RAG retrieval. (Read more) Chunking Semantic Search Embeddings
TruLens - Open-source solution for evaluating and tracing AI Agents and RAG applications using feedback functions to programmatically evaluate components of execution flow. Features the RAG Triad metrics for comprehensive evaluation. (Read more) Evaluation Rag Observability
Unstructured - Document parsing platform delivering strong content fidelity and precision with low hallucination rates. Achieves 100% accuracy on simple tables and 75% on complex structures with comprehensive enterprise document support. (Read more) Document Parsing Enterprise Rag
VectorDBZ - Enterprise-grade desktop application for managing and analyzing vector databases with interactive visualizations, supporting Qdrant, Weaviate, Milvus, ChromaDB, Pinecone, pgvector, and Elasticsearch. (Read more) Visualization Management Gui
Zep - Context engineering and agent memory platform for AI agents with sub-200ms latency. Zep uses a temporal knowledge graph architecture to deliver relationship-aware context from chat history, business data, documents, and app events. (Read more) Ai Agents Knowledge Graph Memory

Multi-Model & Hybrid Databases

Apache Kvrocks - Distributed key-value NoSQL database with experimental vector similarity search. Redis-compatible with RocksDB storage engine, adding HNSW-based vector indexing for large-scale vector data management. (Read more) Redis Compatible Distributed Vector Search
Deep Lake 4.0 (Activeloop) - Multimodal AI database for vectors, images, texts, videos, and more. Features index-on-the-lake technology for sub-second queries from object storage with 10x cost efficiency and 2x faster performance. (Read more) Multimodal Data Lake Cost Efficient
Memgraph - In-memory graph database with native vector search capabilities powered by USearch. Combines vector embeddings with knowledge graphs for GraphRAG, enabling semantic similarity search alongside graph traversal. (Read more) Graph Database Vector Search In Memory

Open Sources

AnythingLLM - AnythingLLM is an open-source AI application that integrates with vector databases to facilitate storage and retrieval of embeddings, supporting various AI and LLM workflows. (Read more) Open Source Ai Llm vector database
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data that is commonly used to facilitate efficient integration between vector databases and machine learning frameworks. It provides a standardized format for data exchange that is useful for storing and querying high-dimensional vectors in AI applications. (Read more) Open Source In Memory Data Integration Ai
arroy - Arroy is an open-source library for efficient similarity search and management of vector embeddings, useful in vector database systems. (Read more) Open Source vector embeddings Similarity Search Vector Search
Awesome-Moviate - Awesome-Moviate is a movie search and recommendation engine demo that combines BM25 keyword search, semantic vector search, and hybrid search using Weaviate as the underlying vector database, serving as a practical example of hybrid retrieval for media content. (Read more) Hybrid Search examples Open Source
Bleve - Bleve is an open-source search library with experimental support for vector search, enabling hybrid search and retrieval in applications. (Read more) Open Source search library Hybrid Search Vector Search
Crate - Crate is an open-source distributed SQL database with support for vector data types and vector search, suitable for AI-driven applications. (Read more) Open Source Distributed Sql Vector Search
cuVS - cuVS is an open-source library from RAPIDS for fast, GPU-accelerated vector search, useful for building high-performance vector databases. (Read more) Open Source Gpu Acceleration Vector Search High Performance
DocArray - An open-source library for creating, storing, and searching multimodal data and vector embeddings, supporting AI and ML workflows. (Read more) Open Source Multimodal vector embeddings Ai
Epsilla - Epsilla is an open-source vector database optimized for high-performance similarity search and scalable storage of vector embeddings. (Read more) Open Source vector database Similarity Search Scalable
frugal - A platform focused on transforming AI/ML operations with transparency, control, and cost optimization, including support for vector database tasks. (Read more) Open Source Ai ML vector database
Havenask - Havenask is an open-source distributed search engine with support for vector search, designed for large-scale AI and search applications. (Read more) Open Source Distributed Vector Search Ai
Healthsearch Demo - Healthsearch is an open-source demo application that uses Weaviate as a vector database to retrieve supplement products based on user-written reviews and queries, illustrating real-world semantic product search over vector embeddings. (Read more) Semantic Search examples Open Source
HelixDB - HelixDB is a powerful, open-source graph-vector database built in Rust, designed for intelligent data storage for Retrieval-Augmented Generation (RAG) and AI applications. It combines graph database features with vector search, making it directly relevant to AI and machine learning workflows that require vector data management. (Read more) Open Source Graph Database Vector Search Rag Rust
HVS (Hierarchical Graph Structure) - HVS is a graph-based index structure leveraging Voronoi diagrams for approximate nearest neighbor search in high-dimensional vector spaces. It is directly relevant to vector databases as it provides efficient similarity search capabilities for large-scale vector data. (Read more) Open Source Ann Graph Database Similarity Search
InfluxDB - InfluxDB 3 OSS provides high-performance time series workloads with new support for vector data, making it suitable for AI/ML and vector search applications. Relevant as a vector-capable database. (Read more) Open Source vector data Time Series Vector Search
Jina - Jina is an open-source neural search framework that delivers cloud-native neural and vector search solutions powered by deep learning for AI applications. It is also known as Jina Search, designed for building search systems powered by vector databases, making it highly relevant for applications involving AI, semantic search, and vector data management. (Read more) Open Source Neural Search Vector Search Cloud Native
KGraph - KGraph is an open-source library for fast approximate nearest neighbor search in high-dimensional vector spaces, applicable to vector database solutions. (Read more) Open Source Ann Similarity Search Vector Search
langchain4j - langchain4j is an open-source framework for developing LLM-powered Java applications, with built-in support for integrating vector databases as memory stores. (Read more) Open Source Llm Java vector database
llm-app - llm-app is an open-source project that provides an AI application framework with integrated support for vector databases, enabling the development of LLM-powered solutions. (Read more) Open Source Ai Llm vector database
MeiliSearch - MeiliSearch is an open-source, fast, and relevant search engine that supports vector search capabilities, making it suitable for AI applications requiring vector database functionality. (Read more) Open Source Search Engine Vector Search Ai
mem0 - mem0 is an open-source vector database focused on efficient storage and retrieval of high-dimensional embeddings for large-scale AI applications. (Read more) Open Source vector database High Dimensional Ai
MRPT - MRPT (Multi-Resolution Proximity Trees) is an open-source library for fast approximate nearest neighbor search in high-dimensional vector spaces, applicable to vector database backends. (Read more) Open Source Ann High Dimensional Vector Search
MuopDB - MuopDB is an open-source vector database designed for fast and scalable similarity search in AI applications. (Read more) Open Source vector database Similarity Search Scalable
nanopq - nanopq is a lightweight product quantization library for efficient vector compression and similarity search, which is an important feature for vector databases that need to store and query large-scale vector data efficiently. (Read more) Open Source Quantization vector compression Similarity Search
NGT - NGT (Neighborhood Graph and Tree) is an open-source vector search engine designed for fast and scalable approximate nearest neighbor search. (Read more) Open Source Vector Search Ann Scalable
OasysDB - OasysDB is an open-source vector database focused on efficient similarity search and management of high-dimensional data. (Read more) Open Source vector database Similarity Search High Dimensional
puck - Puck is an open-source vector search engine designed for fast similarity search and retrieval of embedding vectors. (Read more) Open Source Vector Search Similarity Search Embedding
RAFT - RAFT is a suite of GPU-accelerated libraries for data science, including support for vector search and similarity operations, often used in vector database scenarios. (Read more) Open Source Gpu Acceleration Vector Search data science
reor - reor is an open-source vector database solution focused on fast and scalable storage of high-dimensional vectors for AI and ML applications. (Read more) Open Source vector database Scalable Ai
sqlite-vec - sqlite-vec is an open-source extension for SQLite that adds vector data types and similarity search, enabling lightweight vector database capabilities. (Read more) Open Source Sqlite vector data Similarity Search
Valkey - Valkey is an open-source in-memory key-value data store that supports vector search operations, making it useful for AI and machine learning vector database workloads. It is also a specialized open-source vector database designed for efficient management and retrieval of high-dimensional vector data, offering advanced APIs and optimized storage for AI workloads. (Read more) Open Source Vector Search In Memory Ai

Relational Databases

CockroachDB - CockroachDB is a cloud-native, distributed SQL database that now supports vector data, combining traditional SQL queries with efficient vector search capabilities, ensuring data resilience, availability, scalability, and strong consistency. (Read more) Sql vector data Distributed
PostgreSQL - A powerful, open-source relational database that can be extended with modules like pgvector to support efficient storage and similarity search of vector embeddings, effectively functioning as a vector database. (Read more) Open Source relational database Pgvector

Research Papers & Surveys

A Brief Survey of Vector Databases - This survey paper provides an overview of the landscape, technologies, and applications of vector databases, making it a valuable resource for understanding the field. vector databases Survey Applications technologies
A Comprehensive Survey on Vector Database - A comprehensive academic survey that explores the architecture, storage, retrieval techniques, and challenges associated with vector databases. It categorizes algorithmic approaches to approximate nearest neighbor search (ANNS) and discusses how vector databases can be integrated with large language models, offering valuable insights and foundational knowledge for understanding and building vector database systems. (Read more) vector databases Survey ANNS Architecture
ACL 2023 Tutorial: Retrieval-Based Language Models and Applications - This ACL 2023 tutorial reviews retrieval-based language models, which often rely on vector databases and vector search systems to retrieve relevant context. The tutorial covers methods and applications central to the use of vector databases in modern NLP systems. (Read more) Tutorials Retrieval vector databases Applications
ACORN - ACORN is a performant and predicate-agnostic search system for vector embeddings and structured data, enhancing the capability of vector databases to handle complex queries over high-dimensional data efficiently. (Read more) vector embeddings search system predicate-agnostic Research
Adanns - Adanns is a framework for adaptive semantic search, focusing on efficient and scalable similarity search in high-dimensional vector spaces. Its relevance to 'Awesome Vector Databases' lies in its support for advanced vector search techniques suitable for AI and machine learning applications. (Read more) Semantic Search Similarity Search Ai Machine Learning Research
AiSAQ - AiSAQ is an all-in-storage approximate nearest neighbor search system that uses product quantization to enable DRAM-free vector similarity search, serving as a specialized vector search/indexing approach for large-scale information retrieval. (Read more) Ann Similarity Search vector indexing
BANG - BANG is a billion-scale approximate nearest neighbor search system optimized for single GPU execution, enabling high-performance vector search in vector database environments at massive scale. (Read more) Ann Gpu Acceleration High Performance Vector Search Research
Cagra - Cagra provides highly parallel graph construction and approximate nearest neighbor search for GPUs, supporting large-scale vector database operations and efficient similarity search. (Read more) graph construction Ann Gpu Acceleration Similarity Search Research
DET-LSH - DET-LSH is a locality-sensitive hashing scheme that introduces a dynamic encoding tree structure to accelerate approximate nearest neighbor (ANN) search in high-dimensional spaces. While it is a research algorithm rather than a production database, it directly targets the core operation behind vector databases—efficient ANN search over vector embeddings—and is relevant for designing or optimizing vector indexing components within vector database systems. (Read more) Ann Hashing High Dimensional
Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs - This paper introduces the HNSW algorithm, which is widely adopted in vector databases and search engines for its efficient and robust performance on high-dimensional data. HNSW is foundational in powering modern vector search systems. (Read more) Hnsw Ann Vector Search Research
Efficient Locality Sensitive Hashing - This work by Jingfan Meng is a comprehensive research thesis on efficient locality-sensitive hashing (LSH), covering algorithmic solutions, core primitives, and applications for approximate nearest neighbor search. It is relevant to vector databases because LSH-based indexing is a foundational technique for scalable similarity search over high-dimensional vectors, informing the design of vector indexes, retrieval engines, and similarity search modules in modern vector database systems. (Read more) Ann Similarity Search Hashing
Graph-based Methods - A category of vector database solutions and algorithms leveraging graph-based approaches for efficient similarity search and vector indexing, which are core to many vector database implementations in AI applications. (Read more) Graph Database Similarity Search vector indexing Ai
GTS - GTS is a GPU-based tree index for fast similarity search over high-dimensional vector data, providing an efficient ANN index structure that can be integrated into or used to build high-performance vector database systems. (Read more) Similarity Search Ann Gpu Acceleration
LANNS: a web-scale approximate nearest neighbor lookup system - A scalable system for approximate nearest neighbor search at web-scale, relevant for implementing and understanding vector database infrastructure for high-dimensional data. (Read more) Ann Scalability Vector Search Research
Li, Wen, et al. "Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement." - An influential paper analyzing and improving approximate nearest neighbor search methods for high-dimensional data, highly relevant for developing and understanding vector databases. Ann High Dimensional Vector Search Research
Maze - Maze is a web-scale video deduplication system that relies on large-scale approximate nearest neighbor vector search over video embeddings to detect and remove duplicate or near-duplicate videos efficiently. While not a general-purpose vector database, it represents a specialized, production-scale application of vector search infrastructure for multimedia content management. (Read more) Ann Applications Multimodal
OneSparse: A Unified System for Multi-index Vector Search - A unified system designed for efficient multi-index vector search, directly addressing large-scale vector database performance and scalability challenges. (Read more) Vector Search Performance Scalability Research
SOAR - SOAR is a set of improved algorithms on top of ScaNN that accelerate vector search by introducing controlled redundancy and multi-cluster assignment, enabling faster approximate nearest neighbor retrieval with smaller indexes in large‑scale vector databases and search systems. (Read more) Ann Vector Search Optimization
Starling - Starling is an I/O-efficient, disk-resident graph index framework tailored for high-dimensional vector similarity search on large data segments, supporting the scalable storage and retrieval needs of vector databases. (Read more) graph index Similarity Search Scalable Research
Towards Reliable Vector Database Management Systems: A Software Testing Roadmap for 2030 - An academic paper providing a comprehensive overview of the architecture, empirical defects, and future research roadmap for Vector Database Management Systems (VDBMS). This resource is directly relevant for understanding the current state and challenges in building and testing reliable vector databases. (Read more) vector databases Testing roadmap reliability
VDBMS Architecture Overview - An overview of the architectural components common to Vector Database Management Systems (VDBMS), which are designed to efficiently store, index, and query high-dimensional vector embeddings. This provides foundational knowledge for anyone interested in the internal workings of vector databases. (Read more) Research Architecture vector databases High Dimensional
VDBMS Testing Research Roadmap Paper - A research paper that proposes the first structured roadmap for testing Vector Database Management Systems (VDBMS), analyzing bugs, vulnerabilities, and test challenges unique to vector databases. It provides insights and future directions for improving the reliability and robustness of vector databases. (Read more) Research Testing vector databases roadmap
VDBMS Testing Roadmap - A comprehensive research roadmap addressing the unique challenges of testing vector database management systems (VDBMS), including approaches for test input generation, oracle definition, and test evaluation tailored to vector databases. The work highlights the complexities of high-dimensional vector data, approximate search semantics, and integration with AI/LLM pipelines, making it a valuable resource for advancing reliability and trustworthiness in vector databases. (Read more) vector databases Testing roadmap Ai
Vector Database Group @ NTU - A research group focused on advancing the theory and practice of vector databases, providing resources, publications, and tools related to vector database technology. (Read more) Research vector databases resources Ai
Vector database management systems: Fundamental concepts, use-cases, and current challenges - A comprehensive research paper outlining the fundamental concepts, practical use-cases, and current challenges in the field of vector database management systems. vector databases Use Cases challenges Survey

Research Papers & Surveys

BatANN - Distributed disk-based approximate nearest neighbor system achieving near-linear throughput scaling. Delivers 6.21-6.49x throughput improvement over scatter-gather baseline with sub-6ms latency on 10 servers. (Read more) Ann Distributed Research
FreshDiskANN - Fast and accurate graph-based ANN index for streaming similarity search, enabling real-time updates on billion-point indexes using a single machine with real-time freshness. (Read more) Ann Graph Based Dynamic Updates
LoRANN - Low-Rank Matrix Factorization algorithm for Approximate Nearest Neighbor Search, offering competitive performance with faster query times than leading libraries at various recall levels. (Read more) Ann Algorithm Optimization
MCGI - Manifold-Consistent Graph Indexing for billion-scale disk-resident vector search. Leverages Local Intrinsic Dimensionality to achieve 5.8x throughput improvement over DiskANN on high-dimensional datasets. (Read more) Ann Research Disk Based
PECANN - Parallel Efficient Clustering with graph-based Approximate Nearest Neighbor search, providing efficient clustering algorithms optimized for high-dimensional vector spaces. (Read more) Ann Clustering Parallel
SLIM (Sparsified Late Interaction Multi-Vector Retrieval) - Efficient multi-vector retrieval system using sparsified late interaction with inverted indexes. Achieves 40% less storage and 83% lower latency than ColBERT-v2 while maintaining competitive accuracy. (Read more) Retrieval Research Sparse
SPFresh - Incremental in-place update system for billion-scale vector search from Microsoft Research. Maintains 2.41x lower P99.9 latency than baselines while supporting efficient vector updates with minimal resource overhead. (Read more) Ann Research Dynamic Updates

SDKs & Libraries

chromem-go - Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence, designed for simplicity and performance in Go applications. (Read more) Golang Embedded In Memory
Dense Passage Retrieval (DPR) - Set of tools and models from Meta AI Research for open domain Q&A using dense representations, outperforming BM25 by 9%-19% in passage retrieval accuracy with a dual-encoder BERT framework. (Read more) Retrieval Open Source Nlp
FlagEmbedding - Open-source retrieval and RAG framework from BAAI featuring the BGE embedding model series. BGE-M3 supports multi-functionality (dense, sparse, multi-vector), multi-linguality (100+ languages), and multi-granularity (up to 8192 tokens). (Read more) Open Source Embeddings Multilingual
FLANN - Fast Library for Approximate Nearest Neighbors containing a collection of algorithms optimized for nearest neighbor search in high dimensional spaces with automatic algorithm and parameter selection. (Read more) Ann Open Source Cpp
FlashRank - Ultra-lite and super-fast Python reranking library based on SoTA cross-encoders and LLMs, running on CPU with the tiniest reranking model in the world at ~4MB with no PyTorch dependency. (Read more) Reranking Lightweight Open Source
Graphiti - Open-source framework for building temporally-aware knowledge graphs that power AI agent memory. Graphiti tracks when facts were true and maintains historical context, combining semantic search with graph traversal. (Read more) Open Source Knowledge Graph Temporal
hnswlib-node - Node.js bindings for HNSWlib implementing approximate nearest-neighbor search. Provides fast HNSW-based vector similarity search for JavaScript/TypeScript applications with file persistence support. (Read more) Nodejs Javascript Hnsw
Ollama Embeddings - Local embedding generation through Ollama supporting models like nomic-embed-text and mxbai-embed-large. Enables completely offline embeddings with no subscription fees or API costs, ideal for privacy-focused RAG applications. (Read more) Embeddings Local Privacy
RAPIDS cuVS - GPU-accelerated vector search library from NVIDIA providing approximate nearest neighbors and clustering algorithms with up to 12x faster index builds and 4.7x lower search latency through GPU parallelization. (Read more) Gpu Acceleration Nvidia Performance

Security & Governance

Cloaked AI - Application-layer encryption solution for securing vector embeddings and enabling searchable/queryable encryption in vector databases, protecting AI data without compromising search functionality. (Read more) Encryption Security Privacy

Security & Governance

Privacera AI Governance (PAIG) - Privacera AI Governance (PAIG) is a solution designed to secure and govern AI data, including safeguarding vector databases and embeddings, ensuring data privacy and compliance for AI applications. (Read more) data governance Security compliance

🍺 Contribute

Please give us ⭐ on Github, it helps!

⭐ Star History

Legal

All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship.

This directory may include content generated by artificial intelligence (AI). While efforts have been made to ensure the accuracy and reliability of the information, we make no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability, or availability of the information contained herein. Users are advised to independently verify the information before making decisions based on it.

We disclaim any responsibility for errors, omissions, or inaccuracies in the content, whether generated by humans, AI, or any other means. By using this directory, you agree to use it at your own risk and acknowledge that the information provided may not always be current or accurate.

If you believe that your intellectual property rights or other legal rights have been infringed, please contact us immediately at legal@ever.co and we will take appropriate action.

License

Shield:

This work is licensed under a

Creative Commons Attribution-ShareAlike 4.0 International License.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
details		details
LICENSE.md		LICENSE.md
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Awesome Vector Databases

🔥 Acknowledgements

📑 Table of Contents

Machine Learning Models

Concepts & Definitions

Vector Database Engines

Managed Vector Databases

Curated Resource Lists

LLM Tools

Multi Model & Hybrid Databases

SDKs & Libraries

Vector Database Engines

Cloud Services

Cloud Services

Curated Resource Lists

Managed Vector Databases

Sdks & Libraries

Vector Database Extensions

Benchmarks & Evaluation

Data Integration & Migration

Llm Frameworks

Llm Tools

Machine Learning Models

Multi Model & Hybrid Databases

Vector Database Extensions

Benchmarks & Evaluation

Commerce

Concepts & Definitions

Data Integration & Migration

LLM Frameworks

LLM Frameworks

LLM Tools

Multi-Model & Hybrid Databases

Open Sources

Relational Databases

Research Papers & Surveys

Research Papers & Surveys

SDKs & Libraries

Security & Governance

Security & Governance

🍺 Contribute

⭐ Star History

Legal

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages