A curated list of vector database solutions, libraries, and resources for AI applications.
This directory was built and is maintained using the Ever Works Directory Builder platform.
The public-facing website is based on the open-source Directory Website Template.
- Machine Learning Models (35)
- Concepts & Definitions (74)
- Vector Database Engines (62)
- Managed Vector Databases (14)
- Curated Resource Lists (55)
- LLM Tools (5)
- Multi Model & Hybrid Databases (9)
- SDKs & Libraries (71)
- Vector Database Engines (10)
- Cloud Services (5)
- Cloud Services (7)
- Curated Resource Lists (4)
- Managed Vector Databases (11)
- Sdks & Libraries (11)
- Vector Database Extensions (8)
- Benchmarks & Evaluation (9)
- Data Integration & Migration (4)
- Llm Frameworks (3)
- Llm Tools (6)
- Machine Learning Models (2)
- Multi Model & Hybrid Databases (6)
- Vector Database Extensions (8)
- Benchmarks & Evaluation (9)
- Commerce (7)
- Concepts & Definitions (12)
- Data Integration & Migration (12)
- LLM Frameworks (12)
- LLM Frameworks (1)
- LLM Tools (24)
- Multi-Model & Hybrid Databases (3)
- Open Sources (31)
- Relational Databases (2)
- Research Papers & Surveys (25)
- Research Papers & Surveys (7)
- SDKs & Libraries (9)
- Security & Governance (1)
- Security & Governance (1)
- BGE-VL - State-of-the-art multimodal embedding model from BAAI supporting text-to-image, image-to-text, and compositional visual search. Trained on the MegaPairs dataset with over 26 million retrieval triplets. (Read more)
MultimodalOpen SourceVisual Search - ColBERT - Late interaction architecture for efficient and effective passage search. Encodes queries and documents independently using BERT, then performs token-level similarity via maxsim operator for strong generalization. (Read more)
RetrievalOpen SourceNlp - ColBERTv2 - Advanced multi-vector retrieval model creating token-level embeddings with late interaction mechanism, featuring denoised supervision and improved memory efficiency over original ColBERT. (Read more)
Late InteractionEmbeddingsRetrieval - Jina Embeddings v4 - Universal multimodal embedding model from Jina AI supporting text and images through unified pathway. Built on Qwen2.5-VL-3B-Instruct, outperforms proprietary models on visually rich document retrieval. This is a commercial API with free tier, though OSS weights available. (Read more)
CommercialMultimodalOpen Source - Nomic Embed Text - First fully reproducible open-source text embedding model with 8,192 context length. v2 introduces Mixture-of-Experts architecture for multilingual embeddings. Outperforms OpenAI models on benchmarks. This is an OSS model under Apache 2.0 license. (Read more)
Open SourceEmbeddingMultilingual - NV-Embed - NVIDIA's generalist embedding model achieving record 69.32 score on MTEB benchmark. Fine-tuned from Llama architecture with improved techniques for training LLMs as embedding models. (Read more)
EmbeddingsNvidiaLlm - pinecone-sparse-english-v0 - Learned sparse embedding model built on DeepImpact architecture, outperforming BM25 by up to 44% on TREC benchmarks for high-precision keyword search and hybrid retrieval. (Read more)
SparseEmbeddingsHybrid Search - Qwen3 Embedding - Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions. (Read more)
MultilingualOpen SourceEmbeddings - voyage-3-large - State-of-the-art general-purpose and multilingual embedding model from Voyage AI that ranks first across eight domains spanning 100 datasets, outperforming OpenAI and Cohere models by significant margins. (Read more)
EmbeddingsMultilingualApi - BGE Reranker Base - Open-source cross-encoder reranking model from BAAI that enhances RAG retrieval quality by examining query-document pairs individually. Self-hostable with Apache 2.0 licensing for cost-effective production deployments. (Read more)
RerankingOpen SourceRag - BGE-reranker-v2-m3 - Open-source multilingual reranking model from BAAI supporting 100+ languages with Apache 2.0 licensing, matching Cohere's latency on GPU with zero ongoing costs for production deployments. (Read more)
RerankingMultilingualOpen Source - Cohere Embed v3 - Commercial text embedding model from Cohere with multilingual support and 1,024-dimensional vectors. Optimized for semantic search and retrieval tasks. This is a commercial API service with pay-per-use pricing. (Read more)
CommercialEmbeddingApi - Cohere Embed v4 - Multilingual, multimodal enterprise embedding model supporting over 100 programming languages and primary business languages with advanced quantization for cost optimization. (Read more)
EmbeddingsMultilingualMultimodal - ColPali - Vision Language Model trained to produce high-quality multi-vector embeddings from document page images for efficient retrieval, eliminating need for OCR pipelines with ColBERT-style late interaction. (Read more)
MultimodalDocument RetrievalVision - E5 Embeddings - Open-source text embedding models from Microsoft supporting 100+ languages. Features small, base, and large variants with weakly-supervised contrastive pre-training. This is an OSS model family released by Microsoft Research. (Read more)
Open SourceMicrosoftMultilingual - E5-Mistral-7B-Instruct - Open-source embeddings model from Microsoft initialized from Mistral-7B-v0.1, achieving state-of-the-art BEIR score of 56.9 for English text embedding and retrieval tasks with 4096-dimensional vectors. (Read more)
EmbeddingsOpen SourceInstruction Based - Gemini Embedding 2 - Google's first natively multimodal embedding model that maps text, images, video, audio and documents into a single embedding space. Supports over 100 languages with flexible output dimensions using Matryoshka Representation Learning. (Read more)
MultimodalEmbeddingsGoogle - GTE Embeddings - General Text Embeddings from Alibaba DAMO Academy trained on large-scale relevance pairs. Available in three sizes (large, base, small) with GTE-v1.5 supporting 8192 context length. (Read more)
EmbeddingsOpen SourceMultilingual - INSTRUCTOR - Instruction-finetuned text embedding model that generates task-specific embeddings based on natural language instructions. One embedder for any task, achieving state-of-the-art results across 70 diverse datasets without additional fine-tuning. (Read more)
Instruction BasedEmbeddingsTask Specific - Jina ColBERT v2 - Groundbreaking multilingual information retrieval model supporting 89 languages with token-level embeddings and late interaction. Features Matryoshka embeddings for flexible efficiency-precision tradeoffs and 8192 token input context. (Read more)
EmbeddingMultilingualColbert - jina-embeddings-v5 - Jina AI's latest embedding model achieving the highest multilingual performance among models under 1B parameters with 71.7 average MTEB score and 67.7 MMTEB score. (Read more)
EmbeddingsMultilingualOpen Source - Llama-Embed-Nemotron-8B - Universal text embedding model from NVIDIA achieving state-of-the-art performance on MMTEB leaderboard, optimized for retrieval, reranking, semantic similarity, and classification with 4,096-dimensional embeddings. (Read more)
EmbeddingsMultilingualNvidia - Mixedbread AI - AI startup providing state-of-the-art embedding and reranking models through accessible APIs, offering both open-source and proprietary models optimized for various use cases. (Read more)
EmbeddingsRe RankingApi - ModernBERT Embed - Open-source embedding model from Nomic AI based on ModernBERT-base with 149M parameters. Supports 8192 token sequences and Matryoshka Representation Learning for 3x memory reduction. (Read more)
Open SourceEmbeddingsNlp - mxbai-embed-large - State-of-the-art large embedding model from Mixedbread AI, ranked first among similar-sized models, supporting Matryoshka Representation Learning and binary quantization with 700M+ training pairs. (Read more)
EmbeddingsOpen SourceMatryoshka - Nomic Embed Text v2 - Open-source multilingual embedding model using Mixture-of-Experts architecture, achieving excellent semantic performance with efficient inference and full offline support. (Read more)
EmbeddingsMultilingualOpen Source - nomic-embed-text-v2-moe - Multilingual MoE text embedding model excelling at multilingual retrieval with SoTA performance compared to ~300M parameter models, supporting ~100 languages with Matryoshka Embeddings trained on 1.6B pairs. (Read more)
EmbeddingsMultilingualLocal - Reranking Models - Cross-encoder models that rerank initial retrieval results for improved relevance. More accurate than bi-encoders but slower, typically applied to top-k candidates. (Read more)
RerankingCross EncoderRag - SPLADE - Learned sparse embedding model using BERT for term weighting and expansion. Outperforms BM25 in information retrieval by identifying semantic connections between words while maintaining interpretability. (Read more)
Sparse EmbeddingsLearnedInterpretable - text-embedding-3-large - OpenAI's flagship text embedding model with up to 3,072 dimensions, offering best-in-class performance and accuracy for English tasks with adjustable output sizes to optimize storage costs. (Read more)
OpenaiEmbeddingsApi - UForm - Pocket-sized multimodal AI for content understanding across multilingual texts, images, and video. Up to 5x faster than OpenAI CLIP with quantization-aware embeddings and support for 20+ languages. (Read more)
MultimodalEmbeddingsMultilingual - Voyage AI Embeddings - Commercial embedding models built for enterprise-grade semantic search and RAG applications. Features voyage-3 and voyage-3-large models with multimodal support. This is a commercial API service with usage-based pricing. (Read more)
CommercialEmbeddingMultimodal - Voyage Multimodal 3.5 - Next-generation multimodal embedding model built for retrieval over text, images, and videos, supporting Matryoshka embeddings with 4.56% higher accuracy than Cohere Embed v4 on visual document retrieval. (Read more)
MultimodalEmbeddingsVideo - voyage-4 - Latest Voyage AI embedding model family featuring shared embedding space with MoE architecture, supporting flexible output dimensions and advanced quantization options for cost optimization. (Read more)
EmbeddingsMultilingualQuantization - voyage-multimodal-3 - Voyage AI's first all-in-one multimodal embedding model supporting interleaved text and content-rich images including screenshots, PDFs, slide decks, tables, and figures. (Read more)
MultimodalEmbeddingsVisual Search
- Cascading Retrieval - Advanced retrieval approach combining dense vectors, sparse vectors, and reranking in a multi-stage pipeline, achieving up to 48% better performance than single-method retrieval. (Read more)
Hybrid SearchRagRetrieval - HNSW-IF - Hybrid billion-scale vector search method combining HNSW with inverted file indexes, enabling cost-efficient search by keeping centroids in memory while storing vectors on disk. (Read more)
HnswDisk BasedScalability - Matryoshka Embeddings - Representation learning approach encoding information at multiple granularities, allowing embeddings to be truncated while maintaining performance. Enables 14x smaller sizes and 5x faster search. (Read more)
EmbeddingsOptimizationResearch - RecursiveCharacterTextSplitter - LangChain's hierarchical text chunking strategy achieving 85-90% accuracy by recursively splitting using progressively finer separators to preserve semantic boundaries. (Read more)
ChunkingText ProcessingRag - Vector Index Comparison Guide (Flat, HNSW, IVF) - Comprehensive comparison of vector indexing strategies including Flat, HNSW, and IVF approaches. Covers performance characteristics, memory requirements, and use case recommendations for 2026. (Read more)
IndexingComparisonBest Practices - ACORN Algorithm - Performant and predicate-agnostic search algorithm for vector embeddings with structured data. Uses two-hop graph expansion to maintain high recall under selective filters in Weaviate. (Read more)
AnnGraph BasedFiltering - Approximate Nearest Neighbors (ANN) - Family of algorithms trading perfect accuracy for speed in high-dimensional similarity search. Enables sub-linear query time with 90%+ recall on billion-scale datasets. (Read more)
AlgorithmAnnSearch - Ball-Tree - Tree-based spatial data structure organizing vectors using spherical regions instead of axis-aligned splits, making it better suited for high-dimensional data compared to KD-trees. (Read more)
Tree BasedIndexingHigh Dimensional - Binary Quantization - Extreme vector compression technique converting each dimension to a single bit (0 or 1), achieving 32x memory reduction and enabling ultra-fast Hamming distance calculations with acceptable accuracy trade-offs. (Read more)
QuantizationCompressionOptimization - BM25 - Best Matching 25 ranking function for information retrieval that ranks documents based on query term frequency with length normalization. Core component of hybrid search RAG systems combining keyword and semantic search. (Read more)
Information RetrievalRankingKeyword Search - BM25 (Okapi BM25) - Probabilistic ranking function for estimating document relevance to search queries. Industry standard for keyword search, combining term frequency, rarity, and length normalization into a single scoring model. (Read more)
RankingInformation RetrievalKeyword Search - BM42 - Experimental sparse embedding approach combining exact keyword search with transformer intelligence, integrating sparse and dense vector searches for improved RAG results, developed by Qdrant. (Read more)
SparseHybrid SearchExperimental - Chunk Overlap Strategy - Text chunking technique using 10-20% overlap between consecutive chunks to preserve context continuity and prevent information loss at chunk boundaries for improved retrieval. (Read more)
ChunkingRagText Processing - Chunking Strategies for RAG - Methods for splitting documents into optimal pieces for vector embedding and retrieval. Includes fixed-size, recursive, semantic, and agentic chunking approaches. (Read more)
RagDocument ProcessingChunking - Context Precision - RAG evaluation metric assessing retriever's ability to rank relevant chunks higher than irrelevant ones, measuring context relevance and ranking quality for optimal retrieval. (Read more)
RagEvaluationMetrics - Context Recall - RAG evaluation metric measuring whether retrieved context contains all information required to produce ideal output, assessing completeness and sufficiency of retrieval. (Read more)
RagEvaluationRetrieval - Context Window - Maximum number of tokens an embedding model or LLM can process in a single input. Critical parameter for vector databases affecting chunk sizes, with modern models supporting 512 to 32,000+ tokens for long-document understanding. (Read more)
LlmEmbeddingsArchitecture - Contextual Retrieval - Anthropic's RAG technique that prepends chunk-specific explanatory context before embedding, reducing failed retrievals by 49% (67% with reranking). Uses Contextual Embeddings and Contextual BM25. (Read more)
RagRetrievalContext - Cosine Similarity - Fundamental similarity metric for vector search measuring the cosine of the angle between vectors. Range from -1 to 1, with 1 indicating identical direction regardless of magnitude. (Read more)
SimilarityDistance MetricVector Search - Cross-Encoder - Neural reranking architecture that examines full query-document pairs simultaneously for deeper semantic understanding, achieving higher accuracy than bi-encoders at the cost of computational efficiency. (Read more)
RerankingNeural NetworksNlp - Dot Product - Vector similarity metric measuring both directional similarity and magnitude of vectors. Used by many LLMs for training and equivalent to cosine similarity for normalized data. Reports both angle and magnitude information. (Read more)
SimilarityDistance MetricLlm - Dot Product (Inner Product) - Similarity metric computing sum of element-wise products between vectors. Efficient for normalized vectors, equivalent to cosine similarity when vectors are unit length. (Read more)
SimilarityDistance MetricVector Search - Dot Product Similarity - Vector similarity metric combining both angle and magnitude information for comprehensive similarity measurement, equivalent to cosine similarity when vectors are normalized. (Read more)
Similarity SearchMetricsAlgorithm - Embedding Dimensionality - The size of vector embeddings, typically ranging from 384 to 4096 dimensions. Higher dimensions capture more information but increase storage, compute, and latency costs. (Read more)
EmbeddingsOptimizationDimensions - Embedding Fine-Tuning - Process of adapting pre-trained embedding models to specific domains or tasks for improved performance. Techniques include supervised fine-tuning, contrastive learning, and domain adaptation to optimize embeddings for particular use cases. (Read more)
EmbeddingsFine TuningMachine Learning - Embedding Models Overview - Neural networks that convert text, images, or other data into dense vector representations. Enable semantic understanding by mapping similar concepts to nearby points in vector space. (Read more)
EmbeddingsModelsNeural Networks - Euclidean Distance - Straight-line distance metric between vectors in multidimensional space, sensitive to both magnitude and direction, ideal when embedding magnitude carries important information. (Read more)
Similarity SearchMetricsAlgorithm - Euclidean Distance (L2 Distance) - Distance metric measuring straight-line distance between vectors in multi-dimensional space. Lower values indicate higher similarity, with 0 meaning identical vectors. (Read more)
Distance MetricSimilarityVector Search - Faithfulness - RAG evaluation metric measuring whether generated answers accurately align with retrieved context without hallucination, ensuring factual grounding of LLM responses. (Read more)
RagEvaluationLlm - Filtered Vector Search - Combining vector similarity search with metadata filtering. Enables queries like find similar documents published after 2023 in category Technology. (Read more)
FilteringMetadataHybrid Search - Filtered Vector Search Guide - Complete guide to metadata filtering in vector search covering pre-filtering, post-filtering, and hybrid approaches. Addresses the Achilles heel of vector search with modern solutions. (Read more)
FilteringMetadataBest Practices - Hamming Distance - Distance metric for binary vectors counting the number of positions at which corresponding bits differ, computed efficiently using XOR and popcount operations for ultra-fast similarity search. (Read more)
Distance MetricBinarySimilarity - HCNNG - Hierarchical Clustering-based Nearest Neighbor Graph using MST to connect dataset points through multiple hierarchical clusters. Performs efficient guided search instead of traditional greedy routing. (Read more)
AnnGraph BasedClustering - HNSW (Hierarchical Navigable Small World) - Graph-based approximate nearest neighbor algorithm with logarithmic search complexity. Industry standard for high-dimensional vector search with excellent recall-speed tradeoff. (Read more)
AlgorithmGraph BasedAnn - Hybrid Search - Search approach combining keyword-based (BM25) and semantic (vector) search for best of both worlds. Uses fusion techniques like RRF to merge results. (Read more)
SearchHybridFusion - Hybrid Search Best Practices - Comprehensive guide to combining BM25 keyword search with vector semantic search using reciprocal rank fusion and reranking. Essential pattern for production RAG systems in 2026. (Read more)
Hybrid SearchRagBest Practices - IVF - Inverted File Index vector search algorithm that partitions high-dimensional vectors into clusters using k-means, enabling efficient nearest neighbor search by restricting searches to relevant clusters and dramatically reducing search space. (Read more)
AlgorithmIndexingAnn - IVF (Inverted File Index) - Clustering-based approximate nearest neighbor algorithm that partitions vector space into Voronoi cells. Fast search through coarse-to-fine strategy, often combined with Product Quantization (IVF-PQ). (Read more)
AlgorithmClusteringAnn - IVF-FLAT - Inverted File index with FLAT (uncompressed) vectors, partitioning the vector space into clusters with centroids, offering a balance between search speed and accuracy for approximate nearest neighbor search. (Read more)
IndexingIvfClustering - IVF-FLAT Index - Inverted File Index with flat vectors using K-means clustering to partition high-dimensional space into regions, enhancing search efficiency by narrowing search area through neighbor partitions. (Read more)
IndexingAlgorithmAnn - IVF-PQ (Inverted File with Product Quantization) - Vector indexing method combining inverted file index with product quantization for memory-efficient search. Reduces storage from 128x4 bytes to 32x1 bytes (1/16th) while maintaining search quality. (Read more)
QuantizationIndexingCompression - KD-Tree - Tree-based data structure for organizing vectors through recursive axis-aligned partitioning, enabling logarithmic time complexity searches for balanced data but struggling with high-dimensional spaces. (Read more)
Tree BasedIndexingData Structure - Late Interaction - Retrieval paradigm where query and document tokens are encoded separately and interactions computed at search time, combining efficiency of bi-encoders with expressiveness of cross-encoders. (Read more)
RetrievalColbertNeural Search - Locality Sensitive Hashing (LSH) - Algorithmic technique for approximate nearest neighbor search in high-dimensional spaces using hash functions to map similar items to the same buckets with high probability. (Read more)
HashingAnnAlgorithm - Manhattan Distance - Vector distance metric calculating the sum of absolute differences between vector components. Measures grid-like distance and is robust to outliers, with faster calculation as data dimensionality increases. (Read more)
SimilarityDistance MetricHigh Dimensional - Matryoshka Representation Learning - Training technique enabling flexible embedding dimensions by learning representations where truncated vectors maintain good performance, achieving 75% cost savings when using smaller dimensions. (Read more)
EmbeddingsOptimizationMachine Learning - MaxSim - Maximum Similarity late interaction function introduced by ColBERT for ranking. Calculates cosine similarity between query and document token embeddings, keeping maximum score per query token for highly effective long-document retrieval. (Read more)
ColbertRankingLate Interaction - MaxSim Operator - Scoring function used in late interaction models like ColBERT that computes query-document relevance by finding maximum similarity between each query token and document tokens, then summing. (Read more)
Late InteractionColbertRanking - MSTG (Multi-Stage Tree Graph) - Hierarchical vector index developed by MyScale overcoming IVF limitations through multi-layered design, creating multiple layers unlike IVF's single layer of cluster vectors for improved search performance. (Read more)
IndexingTree BasedHierarchical - Multimodal Embeddings - Vector representations mapping different data types (text, images, audio, video) into a shared embedding space. Enables cross-modal search and understanding. (Read more)
MultimodalEmbeddingsCross Modal - NSW (Navigable Small World) - Graph-based algorithm for approximate nearest neighbor search where vertices represent vectors and edges are constructed heuristically. Foundation for HNSW with (poly/)logarithmic search complexity using greedy routing. (Read more)
AnnGraph BasedAlgorithm - Product Quantization (PQ) - Vector compression technique that splits high-dimensional vectors into subvectors and quantizes each independently, achieving significant memory reduction while enabling approximate similarity search. (Read more)
QuantizationCompressionOptimization - Product Quantization Compression - Lossy vector compression dividing vectors into subvectors for independent quantization. Achieves 8-64x storage reduction while enabling fast approximate distance computation via lookup tables. (Read more)
CompressionQuantizationPq - RAG (Retrieval-Augmented Generation) - AI technique combining information retrieval with LLM generation. Retrieves relevant context from knowledge base before generating responses, reducing hallucinations and enabling grounded answers. (Read more)
RagLlmRetrieval - Reciprocal Rank Fusion - Method for combining ranked lists from multiple retrieval systems in hybrid search. Standard technique in RAG pipelines for fusing BM25 and dense vector results before reranking, creating diverse high-confidence candidate sets. (Read more)
Hybrid SearchRankingFusion - Reciprocal Rank Fusion (RRF) - Hybrid search algorithm combining results from multiple ranking systems by computing reciprocal ranks, commonly used to merge dense vector search with sparse keyword search for improved retrieval. (Read more)
Hybrid SearchRankingFusion - Reranking - Two-stage retrieval pattern where initial candidates from vector/keyword search are re-scored using more sophisticated models. Combines fast initial retrieval with accurate final ranking using cross-encoders or ColBERT for 15-40% accuracy improvements. (Read more)
RagRankingRetrieval - Scalar Quantization - Vector compression technique reducing precision of each vector component from 32-bit floats to 8-bit integers, achieving 4x memory reduction with minimal accuracy loss for vector search. (Read more)
QuantizationCompressionOptimization - Semantic Caching - AI caching pattern that stores vector embeddings of LLM queries and responses, serving cached results when new queries are semantically similar. Cuts LLM costs by 50%+ with millisecond response times versus seconds for fresh calls. (Read more)
CachingOptimizationLlm - Semantic Chunking - Advanced text splitting technique using embeddings to divide documents based on semantic content instead of arbitrary positions, preserving cohesive ideas within chunks for improved RAG performance. (Read more)
ChunkingRagText Processing - Semantic Search - Search technique understanding meaning and context rather than exact keyword matching. Uses vector embeddings to find semantically similar content even with different wording. (Read more)
SearchEmbeddingsSemantics - TreeAH - Vector index type based on Google's ScaNN algorithm combining tree-like structure with Asymmetric Hashing quantization, optimized for batch queries with 10x faster index generation and smaller memory footprint. (Read more)
IndexingQuantizationGoogle - Vamana - Graph-based indexing algorithm powering Microsoft's DiskANN. Uses flat graph structure with minimized search diameter for efficient disk-based nearest neighbor search with 40x GPU speedup available via NVIDIA cuVS. (Read more)
AnnGraph BasedAlgorithm - Vector Database Backup and Recovery Guide - Best practices for backup and disaster recovery in vector databases. Covers full/incremental backups, replication strategies, and cloud-native approaches for safeguarding high-dimensional embeddings. (Read more)
BackupDisaster RecoveryBest Practices - Vector Database Cost Optimization Guide - Comprehensive strategies for reducing vector database costs including storage management, compute optimization, and monitoring. Covers cloud pricing trends and hidden costs in 2026. (Read more)
Cost OptimizationCloudBest Practices - Vector Database Monitoring - Observability practices for vector databases including query latency, recall metrics, storage utilization, and index health monitoring. (Read more)
MonitoringObservabilityOperations - Vector Database Performance Tuning Guide - Comprehensive guide covering index optimization, quantization, caching, and parameter tuning for vector databases. Includes techniques for balancing performance, cost, and accuracy at scale. (Read more)
PerformanceOptimizationBest Practices - Vector Database Sharding - Distributing vector data across multiple nodes for horizontal scaling. Enables handling billions of vectors by partitioning data and parallelizing queries. (Read more)
ShardingScalabilityDistributed - Vector Database Sharding Strategies - Comprehensive guide to sharding approaches for distributed vector databases including range-based, hash-based, geographic, and vector-aware clustering methods for horizontal scaling. (Read more)
ScalabilityDistributedSharding - Vector Database Use Cases - Applications of vector databases across industries including semantic search, RAG systems, recommendations, anomaly detection, and multimodal search. (Read more)
Use CasesApplicationsAi - Vector Dimensionality - Number of components in an embedding vector, typically ranging from 128 to 4096 dimensions. Higher dimensions can capture more information but increase storage, computation, and costs. Critical design parameter for vector databases. (Read more)
EmbeddingsOptimizationArchitecture - Vector Index Types - Overview of indexing structures for approximate nearest neighbor search including HNSW (graph-based), IVF (clustering), LSH (hashing), and tree-based approaches. (Read more)
IndexingAlgorithmsAnn - Vector Quantization Techniques - Methods for compressing vector embeddings to reduce storage and memory costs. Includes scalar quantization, product quantization, and binary quantization with varying compression-accuracy tradeoffs. (Read more)
CompressionOptimizationCost Reduction - Vector Similarity Search - Finding nearest vectors in high-dimensional space based on distance or similarity metrics. Core operation of vector databases enabling semantic search, recommendations, and RAG. (Read more)
SimilaritySearchVectors
- Data Cloud Vector Database - Built into the Salesforce platform, Data Cloud Vector Database ingests various large datasets from customer interactions, classifies and organizes unstructured data, and merges it with structured data to enrich customer profiles and store as metadata in Data Cloud. It enhances generative AI by providing more relevant, accurate, and up-to-date responses through improved data retrieval and semantic search capabilities. (Read more)
EnterpriseCloud Nativevector database - Instaclustr - Instaclustr offers comprehensive managed services for vector databases, handling deployment, configuration, ongoing maintenance, tuning, optimization, scalability, security, and data protection. This allows organizations to offload the complexities of managing their vector database infrastructure and focus on their core business objectives. (Read more)
Managed ServiceCloudEnterprise - Qdrant Vector Database - Qdrant is an open‑source vector database designed for high‑performance similarity search and AI applications such as RAG, recommendation systems, advanced semantic search, anomaly detection, and AI agents. It provides scalable storage and retrieval of vector embeddings with features like filtering, hybrid search, and production‑grade APIs for integrating with machine learning workloads. (Read more)
vector databaseOpen SourceHybrid Search - Qwak - A platform designed to simplify the building, management, and deployment of Large Language Model (LLM) applications, enabling rapid operationalization of context-aware LLMs and offering integration with its Vector Store. (Read more)
MLOpsLlmplatform - vector engine for OpenSearch Serverless - An on-demand serverless configuration for OpenSearch Service that simplifies the operational complexities of managing OpenSearch domains, integrated with Knowledge Bases for Amazon Bedrock to support generative AI applications. (Read more)
Cloud NativeServerlessOpenSearch - Aerospike - A multi-model AI database designed for high-throughput vector processing at scale, supporting real-time AI use cases with a patented Hybrid Memory Architecture and efficient infrastructure usage, capable of handling large volumes of data and concurrent users. (Read more)
multi-modelReal TimeScalable - AllegroGraph - A database that incorporates neuro-symbolic AI and offers a managed service (AllegroGraph Cloud) for neuro-symbolic AI knowledge graphs, indicating its relevance to advanced AI applications, likely including vector capabilities. (Read more)
Graph DatabaseAiKnowledge Graph - Amazon Web Services Vector Search - AWS has introduced vector search in several of its managed database services, including OpenSearch, Bedrock, MemoryDB, Neptune, and Amazon Q, making it a comprehensive platform for vector search solutions. (Read more)
Cloud NativeVector SearchManaged ServiceEnterprise - Apache Cassandra - Apache Cassandra is a distributed NoSQL database that is adding native support for high-dimensional vector storage and approximate nearest neighbor search, making it a scalable choice for AI and vector search workloads. (Read more)
NosqlDistributedVector SearchScalable - AstraDB - AstraDB (also known as Astra DB by DataStax) is a cloud-native vector database built on Apache Cassandra, supporting real-time AI applications with scalable vector search. It is designed for large-scale deployments and features a user-friendly Data API, robust vector capabilities, and automation for AI-powered applications. (Read more)
Cloud NativeVector SearchScalableAi - Blaze - An emerging solution diversifying the options available to data engineers in the vector database landscape. (Read more)
vector databaseemergingdata engineering - ChromaDB - ChromaDB (also known as Chroma or chroma-core) is an open-source vector database focused on LLM applications, emphasizing simplicity and in-memory HNSW-based dense vector search. It is suited for prototyping, metadata filtering, and offers a user-friendly interface for building and testing vector search applications, though it currently lacks hybrid and distributed features. (Read more)
Open SourceIn MemoryVector SearchLlm - citrus - A distributed vector database designed for scalable and efficient vector similarity search. It is purpose-built for handling large-scale vector data and search workloads. (Read more)
Open SourceDistributedVector SearchScalable - ClickHouse - ClickHouse is an open-source column-oriented database that supports vectorized computation and now offers vector search features. Its architecture enables efficient real-time analytics and vector operations, making it a relevant choice for vector database use cases. (Read more)
Open SourceAnalyticsVector SearchReal Time - Cottontail DB - Cottontail DB is an open-source vector database for storing and searching high-dimensional data, with features geared towards research and production environments. (Read more)
Open Sourcevector databasesHigh DimensionalVector Search - DataFusion - A general-purpose analytical engine with built-in vector processing capabilities, excelling at traditional analytical workloads and efficient handling of vector operations. It is an example of a vector engine. (Read more)
analytical enginevector processingOpen Source - Datastax - Datastax offers a vector search solution integrated with its database platform, enabling approximate similarity search and hybrid queries for enterprise use cases. (Read more)
EnterpriseVector SearchHybrid SearchSimilarity Search - Deep Lake - Deep Lake is a vector database designed as a data lake for AI, capable of storing and managing vector embeddings, text, images, and videos. It utilizes a tensor format for efficient querying and integration with AI algorithms, making it suitable for similarity search and machine learning workflows. It is open-source and tailored for handling unstructured and multimodal data, with seamless integration with frameworks like PyTorch and TensorFlow. (Read more)
Open SourceVector SearchAiMultimodal - Elasticsearch - Elasticsearch is a distributed search engine supporting various data types, including vectors, and provides scalable vector search capabilities, making it a popular choice for modern AI-powered applications. It can be extended with the k-NN plugin to provide scalable vector search using HNSW and Lucene, enabling hybrid semantic and keyword search capabilities. (Read more)
Open SourceVector SearchHybrid SearchScalable - Google Cloud Vertex AI Vector Search - Google Cloud Platform offers vector search as part of its Vertex AI suite, enabling scalable and integrated vector search capabilities for AI-driven applications. (Read more)
Cloud NativeVector SearchAiScalable - Google Vertex AI - Google Vertex AI offers managed vector search capabilities as part of its AI platform, supporting hybrid and semantic search for text, image, and other embeddings. (Read more)
Managed ServiceVector SearchHybrid SearchSemantic SearchCloud Native - HAKES - HAKES is a system designed for efficient data search using embedding vectors at scale, making it a relevant solution for vector database applications. (Read more)
Vector SearchScalableEmbeddings - Infinity - Infinity is an AI-native database built for LLM applications, offering fast hybrid search of dense vectors, sparse vectors, tensors, and full-text data. (Read more)
AiLlmHybrid SearchVector Search - JaguarDB - JaguarDB is a database solution, identified as a vector database in the context of the provided research. (Read more)
vector databaseCommercialHigh Performance - KDB - KDB is a high-performance vector database supporting billion-scale vector search, with features aimed at enterprises needing large-scale vector storage and retrieval. (Read more)
EnterpriseScalableVector SearchHigh Performance - KDB.AI - KDB.AI is a proprietary vector database and search engine designed for real-time AI applications. It offers advanced vector search, integrates with popular ML tools, and supports temporal and semantic context for embeddings. KDB.AI Server is a high-performance vector database and search engine from KX, designed for real-time analytics and AI applications requiring rapid similarity search. (Read more)
proprietaryReal TimeAiVector Search - LanceDB - LanceDB is a columnar vector database optimized for real-time AI use cases and analytics workloads, providing efficient vector storage and fast similarity search. (Read more)
Vector SearchReal TimeAnalyticsAi - Manu - A cloud-native vector database management system designed for efficient storage and retrieval of vector embeddings. Directly relevant as a vector database platform. (Read more)
vector databasesCloud NativeVector SearchScalable - Marqo - Marqo is an open-source neural search engine that leverages vector representations to enable semantic search over textual data. It abstracts vector database complexity and provides a high-level interface for building advanced search applications. (Read more)
Open SourceSemantic SearchVector SearchAi - Microsoft Azure AI Search - Azure AI Search provides vector search capabilities as a managed service, supporting approximate KNN, hybrid search, and integration with other Azure AI tools. (Read more)
Managed ServiceVector SearchHybrid SearchCloud Native - Microsoft Azure Vector Database - Microsoft Azure offers vector search support across multiple database services, enabling developers to leverage vector search in cloud-native and enterprise scenarios. (Read more)
Cloud NativeVector SearchEnterpriseScalable - Milvus - Milvus is a mature, open-source vector database maintained by Zilliz, supporting large-scale similarity search with multiple indexing strategies and GPU acceleration. It includes variants such as Milvus Lite (lightweight version), Milvus Standalone (single-machine deployment), and Milvus Distributed (Kubernetes-based deployment for large scale). (Read more)
Open SourceVector SearchScalableGpu Acceleration - Milvus Distributed - Milvus Distributed is a horizontally scalable, distributed deployment of the Milvus vector database designed for enterprise workloads, offering high reliability and the ability to handle billions of vectors with a comprehensive management toolkit. (Read more)
DistributedScalablevector database - Milvus Standalone - Milvus Standalone is a single-machine deployment option of the Milvus vector database that provides a complete, production-ready vector search engine suitable for datasets up to millions of vectors. (Read more)
vector databasesingle-nodeSimilarity Search - MongoDB - MongoDB is a general-purpose database that now includes vector search capabilities, enabling light vector workloads alongside traditional database functionality. MongoDB Atlas, the managed cloud offering, includes vector search built on Lucene, supporting ANN queries and hybrid search. MongoDB Atlas Search integrates powerful vector search capabilities directly within MongoDB. (Read more)
Vector SearchHybrid SearchNosqlManaged Service - MongoDB Atlas Vector Search - A vector search capability integrated within MongoDB Atlas, enabling vector-based retrieval and similarity search over unstructured data. Relevant for users seeking vector search in a popular database platform. MongoDB Vector Search is an integrated feature in MongoDB Atlas that enables efficient vector-based search within a comprehensive document database, supporting up to 2,048 dimensions and hybrid search capabilities. (Read more)
Cloud NativeVector Searchdocument databaseHybrid Search - MyScale - A relational database engine extended with native vector search capabilities, allowing for scalable and efficient similarity search in combination with SQL queries. (Read more)
Vector SearchSqlScalableHybrid Search - Neo4j - Neo4j is a graph database that has added vector search capabilities, providing unique and effective approaches for retrieval augmented generation (RAG) and other AI applications. (Read more)
Graph DatabaseVector SearchRagAi - NucliaDB - NucliaDB is a commercial vector database that enables semantic and vector search across unstructured data, supporting advanced AI and ML-powered applications. (Read more)
CommercialVector SearchSemantic SearchAi - ObjectBox - A high-performance embedded database for edge devices and mobile, offering vector search capabilities for AI applications. (Read more)
EmbeddedEdgeMobile - OpenSearch - OpenSearch is a fully open-source, community-driven search and analytics suite that supports vector search, providing a transparent and flexible alternative for organizations seeking advanced search features. (Read more)
Open SourceVector SearchAnalyticsScalable - Oracle Database Vector Search - Oracle's core database now includes vector search capabilities, enabling enterprises to perform scalable vector queries natively as part of their data management workflows. Oracle includes vector search capabilities in its database platform, supporting approximate KNN and hybrid search for enterprise-scale use cases. (Read more)
EnterpriseVector SearchHybrid SearchKNN - orama - Orama is a lightweight search engine that supports vector and hybrid search functionalities, suitable for browser, server, or edge environments. (Read more)
Open SourceVector SearchHybrid SearchLightweight - Photon Engine - A general-purpose analytical engine with built-in vector processing capabilities, excelling at traditional analytical workloads and efficient handling of vector operations. It is an example of a vector engine. (Read more)
analytical enginevector processingPerformance - Qdrant - Qdrant is a dedicated vector database and similarity search engine supporting advanced filtering and efficient retrieval, suitable for faceted search and retrieval-augmented generation. It offers self-hosted and cloud deployment options, making it highly relevant for vector search applications. (Read more)
Open SourceVector SearchSimilarity SearchRag - Quokka - An emerging solution diversifying the options available to data engineers in the vector database landscape. (Read more)
vector databaseemergingdata engineering - Qwak Vector Store - Qwak provides a vector store solution engineered for optimized storage and querying of vector embeddings, offering efficient search capabilities, high performance, scalability, and data retrieval by identifying similarities among data points. (Read more)
vector storeScalableEmbeddings - Redis - Redis, while primarily an in-memory data store, offers vector search capabilities through its RediSearch and RedisAI modules, enabling vector similarity searches and deep learning model management for existing Redis users. With the RediSearch module, Redis extends its functionality to support native vector search, indexing, and hybrid queries, making it suitable for real-time AI and semantic search applications. (Read more)
Vector SearchIn MemoryHybrid SearchReal Time - seekdb - seekdb is OceanBase’s experimental vector database component for high-performance nearest neighbor search over embedding vectors. (Read more)
Annvector databaseHigh Performance - Solr - Solr is a mature open-source search engine that has incorporated vector search capabilities, making it relevant for enterprises looking to implement vector-based search alongside traditional keyword search. (Read more)
Open SourceVector SearchHybrid SearchEnterprise - tinyvector - tinyvector is a minimal vector database / ANN engine focused on simplicity and compact implementation for educational and small-scale similarity search uses. (Read more)
AnnSimilarity SearchLightweight - Transwarp Hippo - Transwarp Hippo is an enterprise-grade, cloud-native distributed vector database designed for scalable vector operations, including similarity search and clustering, targeting massive datasets and real-time recommendation systems. (Read more)
EnterpriseCloud NativeDistributedVector Search - Trieve - Trieve provides an all-in-one infrastructure for vector search, recommendations, retrieval-augmented generation (RAG), and analytics, accessible via API for seamless integration. (Read more)
Open SourceVector SearchRagAnalytics - Typesense - Typesense is an open-source search engine that supports hybrid search, including vector search capabilities, providing an alternative to proprietary vector search solutions. (Read more)
Open SourceHybrid SearchVector Searchfull-text search - Vald - Vald is an open-source, highly scalable distributed vector search engine known for its asynchronous auto-indexing and ability to efficiently handle large-scale vector data in real time, making it suitable for demanding vector search applications. (Read more)
Open SourceDistributedScalableReal Time - vearch - Vearch is a distributed vector search engine designed for AI-native applications, enabling scalable and efficient similarity search across large datasets. (Read more)
Open SourceDistributedVector SearchAi - Vector Databases - A critical emerging technology focused on processing, storing, and retrieving vast amounts of high-dimensional vector data rapidly and efficiently. Unlike traditional databases, they offer unique advantages for use cases such as image and video recognition, natural language processing (NLP), and Retrieval-Augmented Generation (RAG). (Read more)
vector databasesAiRag - Vector.ai - Vector.ai offers commercial vector database solutions for efficient high-dimensional similarity search and machine learning applications. (Read more)
CommercialVector SearchMachine LearningSimilarity Search - Vespa.ai - Vespa.ai is a scalable open-source platform for real-time big data serving and vector search. It supports vector similarity search and is used for applications like retrieval augmented generation and e-commerce search, making it highly relevant for vector database and vector search use cases. (Read more)
Open SourceVector SearchReal TimeScalable - Vexvault - Vexvault is an open-source vector database designed for efficient storage, management, and similarity search of high-dimensional vector data. (Read more)
Open Sourcevector databasesSimilarity SearchHigh Dimensional - Weaviate - Weaviate is an open-source, cloud-native vector database that supports fast semantic search, modular extensions, and graph-like querying, making it an ideal solution for building scalable, modern vector search applications. (Read more)
Open SourceCloud NativeSemantic SearchScalable - Zilliz Cloud - Zilliz Cloud is a fully managed vector database service powered by Milvus, offering hassle-free deployment, scalability, and high performance for vector search applications. (Read more)
Cloud NativeManaged ServiceVector SearchMilvus
- AlloyDB - Google Cloud's fully managed, PostgreSQL-compatible database service that offers vector capabilities, leveraging the power of PostgreSQL and pgvector for AI applications. (Read more)
Managed ServicePostgresqlCloud - Azure Database for PostgreSQL - Microsoft Azure's managed service for PostgreSQL, which supports the pgvector extension, enabling robust vector database capabilities in the cloud for AI and machine learning workloads. (Read more)
Managed ServiceCloud NativePostgresql - DataRobot Vector Databases - The DataRobot vector databases feature provides FAISS-based internal vector databases and connections to external vector databases such as Pinecone, Elasticsearch, and Milvus. It supports creating and configuring vector databases, adding internal and external data sources, versioning internal and connected databases, and registering and deploying vector databases within the DataRobot AI platform to power retrieval-augmented generation and other AI use cases. (Read more)
vector databasesRagManaged Service - Pinecone - Pinecone is a fully managed vector database designed for high‑performance semantic search and AI applications. It provides scalable, low-latency storage and retrieval of vector embeddings, allowing developers to build semantic search, recommendation, and RAG (Retrieval-Augmented Generation) systems without managing infrastructure. (Read more)
Managed Servicevector databaseSemantic Search - Amazon DocumentDB (with MongoDB compatibility) - An AWS document database service compatible with MongoDB, identified as a great choice for vector database needs. (Read more)
Managed Servicedocument databaseMongoDB - Amazon RDS for PostgreSQL - A managed relational database service from AWS that can host PostgreSQL, including specific community versions, and is a suitable choice for deploying the pgvector extension for vector storage. (Read more)
Managed ServiceCloud NativePostgresql - Aurora PostgreSQL-Compatible - An AWS database service compatible with PostgreSQL, identified as a great choice for vector database needs. (Read more)
Managed ServiceCloud NativePostgresql - Azure Cosmos DB - A vector database solution provided by Microsoft Azure. (Read more)
Managed ServiceCloud NativeAzure - Cloudflare Vectorize - Cloudflare Vectorize is a managed vector database/indexing service integrated with Cloudflare Workers AI. It stores and searches high-dimensional vector embeddings (such as text embeddings) using configurable dimensions and distance metrics like cosine and euclidean, automatically handling index optimization and regeneration when new data is inserted. (Read more)
Managed Servicevector databaseCloud Native - DataRobot Vector Database - DataRobot Vector Database is a managed vector store capability within the DataRobot AI Platform that allows users to create, register, deploy, and update vector databases for AI workloads, including RAG and semantic search. It integrates with NVIDIA NIM embeddings and supports both built-in and bring-your-own embeddings for building production-grade vector search solutions. (Read more)
Managed ServiceRagSemantic Search - DataRobot Vector Databases (GenAI) - A premium vector database capability within the DataRobot Generative AI platform that stores chunked unstructured text and their embeddings for retrieval-augmented generation (RAG). Users can create vector database objects, connect supported data sources from the DataRobot Data Registry, configure embeddings and chunking, and attach these vector databases to LLM blueprints in the playground to ground model responses in proprietary data. (Read more)
Ragvector storeEnterprise - Instaclustr for Managed Apache Cassandra 5.0 - A managed service offering Apache Cassandra 5.0, which can be utilized as a vector database for AI applications. (Read more)
Managed ServiceCassandraNosql - Instaclustr for PostgreSQL - A managed service for PostgreSQL that includes support for pgvector, enabling PostgreSQL to function as a vector database for AI workloads. (Read more)
Managed ServicePostgresqlAi - Weaviate Cloud - Weaviate Cloud is the fully managed cloud deployment of the Weaviate vector database, providing a hosted environment for building and operating AI applications with scalable vector search, without managing infrastructure. (Read more)
Managed ServiceCloud NativeVector Search
- MongoDB Vector Search - MongoDB Vector Search turns MongoDB into a full-featured vector database, enabling approximate and exact nearest neighbor search over vector embeddings stored alongside operational data. It supports semantic similarity search, retrieval-augmented generation (RAG) for AI applications, and lets you combine vector search with full‑text search and structured filters in the same query. Available on supported MongoDB Atlas clusters, it integrates with popular AI frameworks and services for building intelligent, agentic systems. (Read more)
- Survey of Vector Database Management Systems - A comprehensive 2023 survey that systematically analyzes the design, architecture, indexing techniques, and system implementations of modern vector database management systems, serving as a foundational reference for understanding the vector database ecosystem used in AI applications. (Read more)
- Vector DB Feature Matrix - A collaboratively maintained Google Sheets matrix comparing features, capabilities, and characteristics of many vector databases and approximate nearest neighbor libraries, useful for selecting solutions for AI and similarity search applications. (Read more)
- Algolia Vector Search - Algolia’s vector search capability that augments its search-as-a-service platform with semantic and similarity search using embeddings. (Read more)
- Alibaba Cloud OpenSearch Vector Search - Alibaba Cloud’s OpenSearch service with vector search support for semantic retrieval and intelligent search applications. (Read more)
- Awesome papers and technical blogs on vector DB - A curated collection of papers and technical blogs focused on vector databases, semantic-based vector search, and approximate nearest neighbor search (ANN Search). These resources are essential for understanding and building large-scale information retrieval systems and vector databases. (Read more)
vector databasesResearchblogsAnnSemantic Search - Awesome Vector Databases - A curated list of vector database solutions, libraries, and resources tailored for AI applications. Categorizes items by license and type, providing a valuable directory for those seeking vector database technologies. (Read more)
awesome listresourcesvector databasesOpen Source - awesome-vector-database - A curated awesome list compiling resources, tools, vector databases, and research relevant to vector search and storage. Serves as a meta-resource for exploring the vector database ecosystem. (Read more)
vector databasesresourcestoolsawesome list - awesome-vector-databases-data - A data repository that powers the 'Awesome Vector Databases' curated list, collecting structured information about vector database solutions, libraries, and resources for AI applications. Directly supports the discovery and categorization of vector database tools. (Read more)
resourcesawesome listvector databasesOpen Source - awesome-vector-search - A curated collection of libraries, services, and research papers focused on vector search, including vector database technologies and related resources. (Read more)
Vector Searchlibrariesresourcespapers - Chroma - Chroma is an open-source AI-native vector database that provides semantic, full-text, and regex search as a memory layer for LLM and RAG applications. (Read more)
- Databricks Vector Search - Databricks Vector Search is a managed vector search capability in Databricks that lets you create and maintain vector search indexes over Delta tables. It supports multiple modes for providing vector embeddings, including Databricks-computed embeddings (Delta Sync Index with managed embeddings), self-managed precomputed embeddings (Delta Sync Index with self-managed embeddings), and Direct Vector Access Index where clients directly manage vector updates via REST APIs. It is designed for AI and RAG-style applications built on top of the Databricks Lakehouse, enabling similarity search with metadata filters and tight integration with Unity Catalog and Delta Lake. (Read more)
- Efficient Multi-vector Dense Retrieval with Bit Vectors (emvb) - emvb is an open-source implementation of the "Efficient Multi-vector Dense Retrieval with Bit Vectors" method, providing a specialized vector-search index for multi-vector dense retrieval using compact bit-vector representations to accelerate ANN search and reduce memory usage in vector database and retrieval systems. (Read more)
- Foundations of Vector Retrieval - A comprehensive survey/tutorial paper that formalizes the principles, models, and system designs for vector retrieval, offering theoretical and practical foundations for modern vector databases and vector search engines. (Read more)
- GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications - GaussDB-Vector is a large-scale, persistent, real-time vector database system designed specifically for LLM and AI applications. It provides native vector storage and similarity search capabilities, supporting low-latency, high-throughput vector operations and integration with large language model workloads. (Read more)
- Hashing - A set of libraries and methods focused on hashing for similarity search in vector databases, directly impacting the performance of large-scale vector search systems. (Read more)
HashingSimilarity SearchresourcesVector Search - Image Retrieval in the Wild - A CVPR 2020 tutorial on large-scale image retrieval in unconstrained environments, including methods and system considerations for vector-based image search relevant to vector database and ANN applications. (Read more)
TutorialsMultimodalVector Search - Implement two-tower retrieval for large-scale candidate generation - A Google Cloud reference architecture demonstrating an end-to-end two-tower retrieval system for large-scale candidate generation that uses Vertex AI and vector similarity search concepts to learn and serve semantic similarity between entities. (Read more)
RagSemantic Searcharchitectures - IntelLabs's Vector Search Datasets - A collection of datasets curated by Intel Labs specifically for evaluating and benchmarking vector search algorithms and databases. (Read more)
DatasetsVector SearchBenchmarkEvaluation - Introduction to Information Retrieval - Foundational IR textbook that includes content on vector‑space models and retrieval, providing essential background for understanding vector search and hybrid retrieval in modern vector databases. (Read more)
resourcesSearchLearning - Kinomoto.Mag AI - Kinomoto.Mag AI is a blog focused on AI tools, news, and tutorials, including curated lists of vector databases for AI applications. It serves as a resource hub for those interested in the latest innovations in vector databases and AI technologies. (Read more)
blogAiresourcesvector databases - KShivendu/awesome-vector-search - A curated list of awesome projects and research related to vector search, including dedicated vector databases, vector search libraries, performance benchmarks, and cost analysis resources. (Read more)
awesome listVector SearchresourcesOpen Source - LibHunt Vector Database Projects - A curated collection of open-source vector database projects, providing a centralized list for exploring and comparing solutions designed for vector search and AI applications. (Read more)
Open Sourcevector databasesresourcesAi - Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search - Research paper proposing lossless compression techniques for vector identifiers in approximate nearest neighbor (ANN) search systems, aiming to reduce memory footprint and improve efficiency in large-scale vector databases and similarity search engines. (Read more)
- Mastering Multimodal RAG - A course focused on mastering multimodal Retrieval Augmented Generation (RAG) and embeddings, which are fundamental components often stored and managed by vector databases. (Read more)
RagMultimodalEmbeddingsTutorials - Mosaic AI Vector Search - Mosaic AI Vector Search is Databricks’ managed vector database and similarity search service for AI applications, providing high‑capacity, high‑performance vector indexing and querying with configurable endpoint types, including standard and storage‑optimized endpoints that scale to over one billion 768‑dimensional vectors. (Read more)
- Multidimensional data / Vectors - A collection of resources, libraries, and databases focused on handling and searching multidimensional vector data, directly relevant for storing and querying vector embeddings in AI-powered applications. (Read more)
resourcesvector datavector embeddingsawesome list - MyScale Vector Database Benchmark - Benchmark framework and results from MyScale for comparing vector database and ANN index performance using large‑scale datasets and common query workloads relevant to AI applications. (Read more)
- Neural Search in Action - A CVPR 2023 tutorial that demonstrates neural search systems in practice, including vector representations, similarity search, and scalable retrieval architectures closely related to vector databases. (Read more)
TutorialsNeural SearchVector Search - OpenAI Cookbook - A collection of examples and guides from OpenAI, including best practices for working with embeddings, which are fundamental to vector search and vector database applications. (Read more)
OpenaiEmbeddingsresources - Oracle AI Vector Search - Oracle AI Vector Search is Oracle’s integrated vector search capability within Oracle AI Database 26ai, enabling storage and querying of vector embeddings alongside traditional business data. It introduces a native VECTOR data type and supports high‑dimensional semantic similarity search for AI workloads such as chatbots, recommendation systems, anomaly detection, and multimedia search, while allowing embeddings to be used directly with Oracle machine learning algorithms. (Read more)
- Passing the Baton: High Throughput Distributed Disk-Based Vector Search with BatANN - BatANN is a distributed, disk-based vector search system designed for high-throughput approximate nearest neighbor queries at scale, providing an architecture and methods applicable to large-scale vector databases that need efficient storage beyond memory. (Read more)
- PDX: A Data Layout for Vector Similarity Search - PDX is a proposed data layout optimized for vector similarity search, focusing on memory and access efficiency for high-dimensional embeddings, making it relevant for the internal storage design of vector databases and ANN indexes. (Read more)
- Quantization - Resources and tools on quantization techniques for vectors, which are essential for optimizing storage and retrieval in vector databases. (Read more)
Quantizationresourcesvector dataOptimization - Systems - A focused category on complete vector database systems, their architectures, and implementations, directly relevant to anyone seeking production-ready vector database solutions. (Read more)
vector databasesystemsresourcesawesome list - Tree-based Methods - A curated list of tree-based approaches and systems for vector indexing and search, foundational for certain types of vector databases. (Read more)
Tree Basedvector indexingresourcesVector Search - Typesense Cloud - Fully managed cloud service for the open-source Typesense search engine, including support for vector search and hybrid search use cases. (Read more)
Managed ServiceVector SearchHybrid Search - Understanding and Applying Text Embeddings (Vertex AI Short Course) - Short course by DeepLearning.AI and Google Cloud that teaches how to generate and use text embeddings with the Vertex AI Embeddings API for semantic search, classification, and question-answering systems, providing foundational knowledge for working with vector databases and retrieval. (Read more)
- Vector Database Cloud - Vector Database Cloud is a managed cloud platform and ecosystem for building, deploying, and operating applications that use vector databases such as Qdrant and Milvus. It provides APIs, dashboards, and tooling tailored for AI and embedding-based workloads, enabling use cases like content recommendation and real-time fraud detection. (Read more)
- Vector Search - Vector Search is Google Cloud Vertex AI’s managed vector search engine built on the ScaNN algorithm. It provides scalable, high‑performance vector similarity search for semantic search, recommendations, and generative AI applications, offering enterprise‑grade availability and the same underlying technology used in Google products like Search, YouTube, and Google Play. (Read more)
- Vector Search and Embeddings (Google Cloud Skills Boost Course) - Google Cloud Skills Boost course that covers the fundamentals of vector search and text embeddings and shows how to build a vector search application on Vertex AI, including conceptual lessons, demos, and a practice lab. (Read more)
- vector-io - Comprehensive vector data tooling library focused on working with vector embeddings and ANN data, useful for building, evaluating, and managing datasets and pipelines for vector databases and similarity search systems. (Read more)
- vector-search-papers - A curated GitHub repository of research papers and technical blogs focused on vector search, approximate nearest neighbor search (ANN Search), and vector databases. This resource serves as a comprehensive directory for foundational and cutting-edge research, making it highly relevant for anyone building or exploring vector database technologies. (Read more)
Vector SearchResearchpapersAnnvector databases - VectorDB.Works - A web-based directory of vector database solutions, libraries, and resources for AI applications, serving as an accessible resource for exploring and comparing vector databases. (Read more)
resourcesvector databasesdirectoryAi - VectorHub - VectorHub is a resource and learning platform for developers and ML architects interested in integrating vector retrieval and search capabilities into their machine learning stacks, directly supporting vector database adoption and usage. (Read more)
resourcesVector SearchLearningOpen Source - Vertex AI Embeddings - Google Cloud’s managed embeddings service that generates text and multimodal vector representations for search, retrieval, and other AI applications. Frequently used alongside vector databases or vector search services to populate and update vector indexes. (Read more)
- Vertex AI Feature Store - A managed feature store on Google Cloud that serves real-time feature data, often used alongside vector search to enrich or filter results returned from vector indexes in production recommendation and search systems. (Read more)
- Vertex AI Pipelines - A serverless ML orchestration service on Google Cloud used to build automated pipelines that can generate embeddings and create or update vector search indexes, supporting MLOps workflows for vector database–backed search and recommendation systems. (Read more)
- Vertex AI Search ranking API - A Google Cloud API that reranks documents based on semantic relevance using pretrained language models. It complements vector search by improving result ordering for content retrieved from vector databases or vector indexes. (Read more)
- VLDB - New Trends in High-D Vector Similarity Search (Tutorial) - A VLDB conference tutorial focused on new trends and techniques for high-dimensional vector similarity search, covering core algorithms and system designs that underpin modern vector databases and large-scale ANN search. (Read more)
- WARP: An Efficient Engine for Multi-Vector Retrieval - WARP is a research engine for efficient multi-vector retrieval, designed to improve performance of systems that store and search multiple embeddings per document—such as modern vector databases for RAG and semantic search workloads. (Read more)
- Weaviate Recipes (Python) - Weaviate Python Recipes is a collection of Jupyter notebook examples showing how to use Weaviate as a vector database from Python, including ingestion, vector search, hybrid search, and integrations for AI and RAG workloads. (Read more)
- Weaviate Recipes (TypeScript) - Weaviate TypeScript Recipes is a curated set of TypeScript code examples demonstrating how to interact with the Weaviate vector database, covering vector ingestion, querying, and AI-focused search patterns for JavaScript/TypeScript environments. (Read more)
- weaviate-examples - Examples and resources for Weaviate, a popular open-source vector database optimized for storing and searching vector embeddings at scale. (Read more)
Weaviateexamplesresourcesvector embeddings - XiaomingX/awesome-vector-database - A curated directory of resources, tools, tutorials, and libraries dedicated to vector databases, focusing on efficient data retrieval, similarity search, and machine learning applications. (Read more)
vector databasesresourcesTutorialsSimilarity Search
- Cohere's re-ranker - A re-ranking tool provided by Cohere, which can be integrated into LLM applications via frameworks like LangChain to improve the relevance and order of retrieved documents from search systems, including those utilizing vector databases. (Read more)
Re RankingLlmSearch - HuggingFace Text Embedding Server - A server that provides text embeddings, serving as a backend for embedding functions used with vector databases. (Read more)
EmbeddingsHugging FaceApi - Ollama - A tool that allows users to run large language models locally, providing an easy way to set up and interact with various models, including integrations for generating and managing embeddings with vector databases. (Read more)
LlmLocalTool - Elysia - Elysia is an open-source, decision-tree-based agentic system built on top of Weaviate that orchestrates tools and vector-search workflows, demonstrating how to build complex AI agents that leverage a vector database as a core component. (Read more)
RagtoolsVector Search - Verba - Verba is a community-driven, open-source Retrieval-Augmented Generation (RAG) application that provides an end-to-end, user-friendly interface for building RAG workflows on top of a vector database, showcasing practical semantic search and retrieval patterns with Weaviate. (Read more)
RagSemantic SearchOpen Source
- Apache Cassandra Vector Search - Distributed NoSQL database with vector search capabilities via Storage-Attached Indexes (SAI) in Cassandra 5.0+. Uses Lucene HNSW for approximate nearest neighbor search. This is an OSS database under Apache 2.0 license. (Read more)
Open SourceDistributedNosql - Elasticsearch Vector Search - Search and analytics engine with k-nearest neighbor (kNN) search for semantic similarity. Features approximate and exact kNN, HNSW indexing, and advanced quantization. This is commercial with OSS version available. (Read more)
CommercialOpen SourceSearch Engine - Rockset - Real-time analytics database with vector search capabilities, built on RocksDB with converged indexing. Acquired by OpenAI in 2024 to power retrieval infrastructure. This was a commercial service. (Read more)
CommercialReal TimeAnalytics - Activeloop Deep Lake - Multi-modal vector database with tensor storage for vectors, images, texts, videos, and more. Features columnar storage format, time travel, ACID transactions, and terabyte-scale visualization for AI data management. (Read more)
MultimodalTensorData Lake - Couchbase Vector Search - NoSQL database with vector search capabilities through Search Vector Indexes. Couchbase 8.0 introduces Hyperscale Vector Index for billion+ scale searches. This is a commercial database with free community edition. (Read more)
CommercialNosqlHybrid Search - CozoDB - General-purpose, transactional, relational-graph-vector database that uses Datalog for queries. Embeddable but capable of handling large amounts of data and concurrency with HNSW indices for high-performance vector similarity searches. (Read more)
Graph DatabaseVector SearchDatalog - NebulaGraph - Open-source distributed graph database designed for super large-scale graphs with billions of vertices and trillions of edges. Outperforms Neo4j on larger datasets while providing graph database capabilities for AI applications. (Read more)
Graph DatabaseDistributedScalable - SingleStore - Distributed SQL database with built-in vector capabilities. Features SingleStore-V integrated vector system with credit-based pricing at $3.96 per compute credit. This is a commercial database. (Read more)
CommercialSqlDistributed - StarRocks - Open-source high-performance analytical database with vector search capabilities. Features IVFPQ and HNSW indexing for approximate nearest neighbor search in v3.4+. This is an OSS database under Apache 2.0, a Linux Foundation project. (Read more)
Open SourceAnalyticsHybrid Search
- AutoTokenizer (Hugging Face Transformers) - A utility class from the Hugging Face Transformers library that automatically loads the correct tokenizer for a given pre-trained model. It is crucial for consistent text preprocessing and tokenization, a vital step before generating embeddings for vector database storage. (Read more)
NlptokenizationHugging Face - Sentence-Transformers - A Python library for creating sentence, text, and image embeddings, enabling the conversion of text into high-dimensional numerical vectors that capture semantic meaning. It is essential for tasks like semantic search and Retrieval Augmented Generation (RAG), which often leverage vector databases. (Read more)
PythonEmbeddingsSemantic Search - SentenceTransformer - A Python library for generating high-quality sentence, text, and image embeddings. It simplifies the process of converting text into dense vector representations, which are fundamental for similarity search and storage in vector databases. (Read more)
PythonEmbeddingsNlp - AHPQ.jl - AHPQ.jl is a Julia library providing training and inference for anisotropic hierarchical product quantization, compatible with ScaNN-style vector quantization and useful for building high-performance vector search pipelines. (Read more)
product quantizationJuliaVector Search - Amazon OpenSearch k-NN - Amazon OpenSearch's k-NN plugin enables scalable, efficient vector search using ANN algorithms (IVF, HNSW) directly within a managed OpenSearch cluster. It is directly relevant for building, querying, and scaling vector databases on AWS. (Read more)
Vector SearchAnnManaged ServiceOpenSearch - Annoy - An open-source library for approximate nearest neighbor search in high-dimensional spaces, often used as a backend for vector databases and search engines. (Read more)
Open SourceAnnHigh DimensionalVector Search - Deep Searcher - Deep Searcher is a local open-source deep research solution that integrates Milvus and LangChain to provide advanced vector search and retrieval capabilities using open-source models. (Read more)
Open SourceMilvusLangchainVector Search - DiskANN - DiskANN is a graph-based approximate nearest neighbor search (ANNS) system optimized for fast and accurate billion-point nearest neighbor search on a single node, leveraging SSD storage. It is highly relevant for large-scale vector database applications requiring efficient vector search at scale. (Read more)
AnnHigh PerformanceScalableVector Search - EFANNA - EFANNA is an extremely fast approximate nearest neighbor search algorithm based on kNN graphs and randomized KD-trees. The provided implementation offers a high-performance ANN index suitable as a building block in custom vector search and retrieval infrastructure. (Read more)
AnnHigh Performancevector indexing - FAISS - FAISS (Facebook AI Similarity Search) is a popular open-source library for efficient similarity search and clustering of dense vectors. Developed by Facebook/Meta, it supports billions of vectors and is widely used to power vector search engines and databases, especially where raw speed and scalability are needed. (Read more)
Open SourceAnnSimilarity SearchScalable - FastText - FastText is an open-source library by Facebook for efficient learning of word representations and text classification. It generates high-dimensional vector embeddings used in vector databases for tasks like semantic search and document clustering. (Read more)
Open Sourcevector embeddingsSemantic SearchMachine Learning - Gensim - Gensim is a Python library for topic modeling and vector space modeling, providing tools to generate high-dimensional vector embeddings from text data. These embeddings can be stored and efficiently searched in vector databases, making Gensim directly relevant to vector search use cases. (Read more)
Pythonvector embeddingsOpen Sourcetopic modeling - GloVe - GloVe is a widely used method for generating word embeddings using co-occurrence statistics from text corpora. These embeddings are commonly used as input to vector databases for semantic search and other vector-based information retrieval tasks. (Read more)
vector embeddingsMachine LearningOpen SourceSemantic Search - HNSW (Go) - A Go implementation of the HNSW approximate nearest neighbor search algorithm, enabling developers to embed efficient vector similarity search directly into Go services and custom vector database solutions. (Read more)
AnnGoVector Search - HNSW (Rust) - A Rust implementation of the HNSW (Hierarchical Navigable Small World) approximate nearest neighbor search algorithm, useful for building high-performance, memory-safe vector search components in Rust-based AI and retrieval systems. (Read more)
AnnRustVector Search - hora - Hora is an efficient, open-source library for approximate nearest neighbor search, written in Rust. It offers high-performance vector search capabilities for AI and machine learning applications. (Read more)
Open SourceAnnRustHigh Performance - Hugging Face Sentence Transformers Embedding Function for ChromaDB Java Client - An embedding function implementation within the ChromaDB Java client (tech.amikos.chromadb.embeddings.hf.HuggingFaceEmbeddingFunction) that utilizes Hugging Face's cloud-based inference API to generate vector embeddings for documents. (Read more)
EmbeddingsJavaHugging Face - Hugging Face Tokenizers - A library from Hugging Face providing fast and customizable tokenization, a fundamental step for preparing text data for embedding models used with vector databases. (Read more)
NlptokenizationHugging Face - IDEA - IDEA is an inverted, deduplication-aware index structure designed to improve storage efficiency and query performance for similarity search workloads. It is implemented as research code and targets high-dimensional vector and content-addressable data, making it relevant to large-scale vector database and ANN indexing systems. (Read more)
Similarity SearchIndexingHigh Dimensional - iRangeGraph - iRangeGraph is an ANN indexing approach and accompanying implementation for range-filtering nearest neighbor search. It provides a specialized graph-based index that supports vector similarity search under range constraints, making it directly useful as a component or reference implementation for advanced vector database indexing and retrieval. (Read more)
Anngraph indexSimilarity Search - JinaEmbeddingFunction - A wrapper embedding function for Jina Embedding models, used to generate vector embeddings. (Read more)
EmbeddingsJinaApi - jvector - jvector is a high-performance Java-based library and engine for vector search and approximate nearest neighbor indexing. (Read more)
AnnVector SearchHigh Performance - LangChain - LangChain is an open-source framework that integrates with various vector databases, including Pinecone, Weaviate, and Chroma, to facilitate retrieval-augmented generation (RAG) and advanced AI workflows. (Read more)
Open SourceRagAiIntegration - Langflow - Langflow is a platform that simplifies building AI agents by connecting models, vector stores, memory, and other AI building blocks. It is relevant to vector databases as it supports integration with vector stores for AI-powered agents. (Read more)
Aivector storesIntegrationOpen Source - LibVQ - LibVQ is an open-source toolkit for optimizing vector quantization and efficient neural retrieval, offering training and indexing components that can serve as the core of high-performance approximate nearest neighbor search and vector database systems. (Read more)
vector quantizationNeural SearchAnn - Milvus CLI - Milvus CLI is a command-line interface for managing and interacting with Milvus vector databases, allowing users to perform database operations and manage collections efficiently. (Read more)
MilvusCLIManagementvector databases - Milvus Lite - Milvus Lite is a lightweight, pip-installable variant of the Milvus vector database that runs as a library in notebooks or laptops, ideal for learning, experimentation, and rapid prototyping of AI and vector search applications. (Read more)
Vector SearchLightweightPython - NearestNeighbors.jl - NearestNeighbors.jl is a Julia package implementing various nearest neighbor search algorithms and index structures for high-dimensional vector data. (Read more)
AnnJuliaVector Search - Neighbor - Ruby gem for approximate nearest neighbor search that can integrate with pgvector and other backends to power vector similarity search in Ruby applications. (Read more)
AnnRubySimilarity Search - NMSLIB - NMSLIB is an efficient similarity search library and toolkit for high-dimensional vector spaces, supporting a variety of indexing algorithms for vector database use cases. (Read more)
Open SourceAnnSimilarity SearchHigh Dimensional - NSG - NSG is an approximate nearest neighbor search algorithm based on a sparse navigable graph structure designed for high-dimensional vector similarity search. The reference implementation provides a graph-based ANN index that can be integrated into custom vector retrieval systems. (Read more)
Anngraph indexSimilarity Search - NVIDIA CAGRA - NVIDIA CAGRA is a GPU-accelerated graph-based library for approximate nearest neighbor searches, optimized for high-performance vector search leveraging modern GPU parallelism. It is suitable for scenarios requiring rapid, large-scale vector retrieval. (Read more)
Gpu AccelerationAnnHigh PerformanceVector Search - OpenAIEmbeddingFunction - An embedding function that utilizes the OpenAI API to compute vector embeddings, commonly used with vector databases. (Read more)
EmbeddingsOpenaiApi - ParlayANN - ParlayANN is a scalable and deterministic parallel graph-based approximate nearest neighbor (ANN) search library. It provides parallel algorithms and implementations for high-dimensional vector similarity search, suitable as a core search component in large-scale vector database and retrieval systems. (Read more)
AnnParallelScalable - PGVector - PostgreSQL supports vector indexing and similarity search via the PGVector extension, allowing relational databases to manage and retrieve vector embeddings efficiently. (Read more)
Open SourceVector SearchPostgresqlSimilarity Search - pgvector-cobol - COBOL bindings and examples for pgvector, letting legacy COBOL systems interact with PostgreSQL as a vector database. (Read more)
SDKPgvectorvector store - pgvector-crystal - Crystal language client for pgvector, providing idiomatic Crystal access to vector operations in PostgreSQL. (Read more)
SDKPgvectorvector store - pgvector-dotnet - .NET (C#, F#, Visual Basic) library for pgvector that exposes vector storage and similarity queries on PostgreSQL to .NET applications. (Read more)
SDKPgvectorvector store - pgvector-elixir - Elixir wrapper and examples for pgvector, integrating PostgreSQL-based vector search into Elixir ecosystems like Phoenix. (Read more)
SDKPgvectorvector store - pgvector-erlang - Erlang client and examples for pgvector, providing tools to run vector operations against PostgreSQL from Erlang systems. (Read more)
SDKPgvectorvector store - pgvector-gleam - Gleam language client and examples for pgvector, allowing Gleam applications to perform vector similarity search using PostgreSQL. (Read more)
SDKPgvectorvector store - pgvector-haskell - Haskell bindings and examples for pgvector, enabling Haskell applications to treat PostgreSQL as a vector database. (Read more)
SDKPgvectorvector store - pgvector-lisp - Lisp bindings and examples for pgvector, allowing Common Lisp projects to leverage PostgreSQL as a vector store. (Read more)
SDKPgvectorvector store - pgvector-node - JavaScript/TypeScript (Node.js) client for pgvector, enabling server-side JS apps to run vector queries on PostgreSQL. (Read more)
SDKPgvectorvector store - pgvector-ocaml - OCaml client and examples for pgvector that provide access to vector indexing and nearest-neighbor search in PostgreSQL from OCaml code. (Read more)
SDKPgvectorvector store - pgvector-pascal - Pascal bindings and examples for pgvector, supporting PostgreSQL-powered vector search from Pascal applications. (Read more)
SDKPgvectorvector store - pgvector-perl - Perl client and examples for pgvector, exposing vector data types and similarity queries in PostgreSQL to Perl scripts and apps. (Read more)
SDKPgvectorvector store - pgvector-prolog - Prolog client and examples for pgvector, enabling logic programs to interact with vector search capabilities in PostgreSQL. (Read more)
SDKPgvectorvector store - pgvector-python - Python library and examples for pgvector, integrating Python AI/ML pipelines with PostgreSQL vector storage and similarity queries. (Read more)
SDKPgvectorvector store - pgvector-ruby - Ruby client and examples for pgvector, integrating Ruby applications (including Rails) with PostgreSQL vector operations for AI use cases. (Read more)
SDKPgvectorvector store - pgvector-rust - Rust client and examples for pgvector, offering idiomatic Rust APIs for embedding storage and similarity queries in PostgreSQL. (Read more)
SDKPgvectorvector store - pgvector-swift - Swift bindings and examples for pgvector, allowing Swift and server-side Swift apps to use PostgreSQL as a vector database. (Read more)
SDKPgvectorvector store - PilotANN - PilotANN is a memory-bounded GPU-accelerated framework for large-scale vector search, designed to improve performance and efficiency of approximate nearest neighbor (ANN) search workloads, making it relevant as a high-performance engine/component in vector database and vector search systems. (Read more)
Gpu AccelerationAnnHigh Performance - Product-Quantization - Product-Quantization is a GitHub repository implementing the inverted multi-index structure for product-quantization-based approximate nearest neighbor search, providing building blocks for scalable vector search engines. (Read more)
product quantizationAnnvector indexing - pymilvus - pymilvus is the official Python SDK for Milvus, allowing developers to interact programmatically with the Milvus vector database. It provides utilities for transforming unstructured data into vector embeddings and supports advanced features such as reranking for optimized search results. The pymilvus[model] variant includes utilities for generating vector embeddings from text using built-in models.
PythonMilvusvector embeddingsSDK - Qinco - Qinco is an open-source implementation from Facebook Research for Residual Quantization with Implicit Neural Codebooks. It provides quantization and indexing methods for compact vector representations to accelerate similarity and nearest neighbor search, making it relevant as a low-level vector indexing and compression component for vector databases and large-scale AI retrieval systems. (Read more)
vector compressionSimilarity SearchOpen Source - RaBitQ - RaBitQ is an open-source library implementing the "Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search" method, providing vector quantization and compression techniques designed to improve efficiency and accuracy of ANN search engines and vector databases operating in high-dimensional spaces. (Read more)
Annvector compressionHigh Dimensional - Reconfigurable Inverted Index - Reconfigurable Inverted Index (Rii) is a research project and open-source library for approximate nearest neighbor and similarity search over high-dimensional vectors. It focuses on flexible, reconfigurable inverted index structures that support efficient vector search, making it directly relevant as a vector-search engine component for AI and multimedia retrieval applications. (Read more)
Annvector indexingSimilarity Search - RediSearch - RediSearch is a Redis module that provides high-performance vector search and similarity search capabilities on top of Redis, enabling advanced search and retrieval features for AI and data applications. (Read more)
Vector SearchRedisOpen SourceSimilarity Search - RETA-LLM - RETA-LLM is a toolkit designed for retrieval-augmented large language models. It is directly relevant to vector databases as it involves retrieval-based methods that typically leverage vector search and vector databases to enhance language model capabilities through external knowledge retrieval. (Read more)
RagLlmRetrievalVector Search - RTNN - RTNN is a research prototype system and codebase that accelerates high-dimensional nearest neighbor search using hardware ray tracing units on modern GPUs. It targets vector similarity search workloads common in AI applications, exploring ray-tracing hardware as an alternative acceleration path to traditional CPU- or CUDA-based ANN indexes. (Read more)
Gpu AccelerationAnnSimilarity Search - ScaNN - A library by Google Research for efficient vector similarity search, suitable for large-scale nearest neighbor applications in AI. (Read more)
Open SourceAnnVector SearchAi - SimSIMD - Open‑source library providing fast SIMD‑accelerated implementations of similarity and distance computations (e.g., vector inner products and distances), serving as an efficient alternative to scipy.spatial.distance and numpy.inner for vector search and vector database workloads. (Read more)
Similarity SearchOptimizationvector processing - spaCy - spaCy is an industrial-strength NLP library in Python that provides advanced tools for generating word, sentence, and document embeddings. These embeddings are commonly stored and searched in vector databases for NLP and semantic search applications. (Read more)
Pythonvector embeddingsNlpOpen Source - SPTAG - SPTAG is a distributed approximate nearest neighbor (ANN) library for building and searching large-scale vector indexes, supporting efficient and scalable vector search scenarios. (Read more)
Open SourceAnnDistributedScalable - SymphonyQG - SymphonyQG is a research codebase and method that integrates vector quantization with graph-based indexing to build efficient approximate nearest neighbor (ANN) indexes for high-dimensional vector search. It targets vector database and similarity search scenarios where combining compact codes with navigable graphs can improve recall–latency tradeoffs and memory footprint. (Read more)
Annvector quantizationgraph index - Tantivy - Tantivy is a full-text search engine library inspired by Apache Lucene, offering fast and scalable similarity search capabilities. While primarily focused on text, it supports efficient vector-based similarity searches, making it useful for vector search tasks. (Read more)
Open Sourcefull-text searchVector SearchScalable - usearch - usearch is a fast, open-source search and clustering engine designed for efficient vector search across multiple programming languages. (Read more)
Open SourceVector SearchClusteringmulti-language - Voyager - Voyager is a Spotify open-source vector search library and service for efficient nearest neighbor search on large-scale vector datasets. (Read more)
AnnVector SearchOpen Source - vsag - vsag is an Alibaba open-source library implementing efficient vector search algorithms, including approximate nearest neighbor search for high-dimensional vectors. (Read more)
AnnHigh DimensionalVector Search - Word2vec - Word2vec is a popular machine learning technique for generating vector embeddings based on the distributional properties of words in large corpora. It is directly relevant to vector databases as it produces the high-dimensional vector representations stored and indexed by these databases for vector search and similarity tasks. (Read more)
vector embeddingsMachine LearningOpen SourcePython
- Deep Lake 4.0 - AI data lake with revolutionary index-on-the-lake technology enabling sub-second queries from S3. Features 10x cost efficiency vs in-memory DBs and 2x faster than alternatives. This is a commercial platform with OSS components. (Read more)
CommercialData LakeMultimodal - Vespa - Open-source AI search platform combining vector search, keyword retrieval, structured filtering, and ML ranking. Powers applications at Spotify, Yahoo, and Wix with sub-100ms response times. This is an OSS platform under Apache 2.0 with managed cloud option. (Read more)
Open SourceEnterpriseHybrid Search - YugabyteDB with pgvector - PostgreSQL-compatible distributed database with pgvector support and USearch integration, proven to handle billions of vectors with 96.56% recall and sub-second query latency. (Read more)
PostgresqlDistributedOpen Source - Actian VectorAI DB - Edge-native vector database designed for deployment at remote locations and edge devices with no cloud dependency. Supports real-time decision making with sub-15ms query latency and operates independently during network outages. (Read more)
EdgeOn PremisesOffline - Couchbase Lite Vector - Developer-friendly, full-featured embedded NoSQL database with vector search for offline-first GenAI apps running on mobile, IoT devices, and web browsers with no internet dependencies. (Read more)
EmbeddedOfflineMobile - ObjectBox Vector - On-device vector database with out-of-the-box data sync, designed for resource-efficiency on mobile, IoT, and embedded devices, enabling offline-first AI applications without internet dependency. (Read more)
EdgeEmbeddedOffline - Qdrant Edge - Lightweight embedded vector search engine designed for real-time vector search on edge devices like robots, kiosks, and mobile phones with limited computational resources and offline capabilities. (Read more)
EdgeEmbeddedOffline - ScyllaDB Vector Search - High-performance NoSQL database with vector search capabilities built on USearch library and shard-per-core architecture, storing vector embeddings alongside structured data in unified tables. (Read more)
NosqlDistributedHigh Performance - Turso - SQLite-based database with native vector search capabilities built directly into the database without extensions. Based on libSQL fork of SQLite with support for DiskANN algorithm for approximate nearest neighbor search. This is a commercial solution with free tier available. (Read more)
CommercialSqliteEdge - Zvec - Lightweight embedded vector database for RAG systems useful in edge environments, running directly on devices with local vector search and no network latency or cloud dependencies. (Read more)
EmbeddedEdgeLightweight
- Azure Cosmos DB NoSQL Vector Search - Microsoft's globally distributed multi-model database with native vector search using DiskANN algorithm. Features <20ms query latency and 43x lower cost vs Pinecone. This is a commercial managed service. (Read more)
CommercialAzureNosql - Vertex AI Vector Search - Google Cloud's vector search engine (formerly Matching Engine) built on ScaNN algorithm. Version 2.0 features unified data model with hybrid search and auto-generated embeddings. This is a commercial managed service. (Read more)
CommercialGoogle Cloudmanaged service - AlloyDB for PostgreSQL with Vector Search - Google Cloud's PostgreSQL-compatible database with state-of-the-art HNSW index build performance, achieving 9x speedup over pgvector on CPU for vector workloads. (Read more)
PostgresqlGoogle CloudManaged Service - AWS OpenSearch k-NN - Managed OpenSearch service with k-nearest neighbor search capabilities. Uses HNSW, Faiss, and Lucene libraries for approximate nearest neighbor searches. This is a commercial managed service. (Read more)
CommercialAwsmanaged service - Turbopuffer - Serverless vector and full-text search database built on object storage with sub-10ms p50 latency. 10x cheaper than alternatives while hosting 2.5T+ documents and serving 10k+ queries per second. (Read more)
ServerlessObject StorageCost Effective
- Amazon Aurora Machine Learning - A feature of Amazon Aurora that enables making calls to ML models like Amazon Bedrock or Amazon SageMaker through SQL functions, allowing direct generation of embeddings within the database and abstracting the vectorization process. (Read more)
Machine LearningEmbeddingsAws - Amazon Aurora Serverless v2 - An on-demand, auto-scaling configuration for Amazon Aurora DB instances that automatically adjusts compute and memory capacity based on load, integrated with Knowledge Bases for Amazon Bedrock to simplify vectorization and database capacity management. (Read more)
Cloud NativeServerlessAws - Instaclustr Vector Database Management - A managed service and tooling offering from Instaclustr that helps teams operate and optimize vector databases for GenAI and Retrieval-Augmented Generation (RAG) workloads, providing expertise and infrastructure management for production deployments. (Read more)
Managed ServiceRagvector databases - MotherDuck - A cloud data warehouse that can be leveraged to store vector embeddings as List data types, enabling semantic search capabilities through SQL-based similarity functions within an existing data pipeline. (Read more)
Clouddata warehousingvector embeddings - Nextbrick Managed Vector Database Service - A fully managed vector database infrastructure and operations service provided by Nextbrick. It focuses on deployment, configuration, tuning, scaling, security, and maintenance of vector databases for AI and similarity search workloads. The service handles sharding, replication, query optimization, backups, and disaster recovery so organizations can offload operational management and focus on building AI applications. (Read more)
Managed Servicevector databaseservices - Qdrant Cloud Inference - Qdrant Cloud Inference is a managed inference service integrated with the Qdrant vector database, allowing users to generate embeddings and work with vector search pipelines directly in the cloud environment. (Read more)
Managed ServiceEmbeddingsVector Search - Snowflake - A cloud data platform that offers capabilities for storing and querying various data types, including vector embeddings, often used in conjunction with its data warehousing features. (Read more)
Clouddata warehousingvector embeddings
- Building Applications with Vector Databases - DeepLearning.AI course teaching six practical vector database applications using Pinecone, including RAG for LLMs, recommender systems, and hybrid search combining images and text. (Read more)
LearningTutorialsRag - Vector Database Market Trends 2026 - Comprehensive overview of vector database evolution in 2026, including the shift to vectors as data types, PostgreSQL dominance, 400% adoption surge, and $10.6B projected market by 2032. (Read more)
MarketTrendsSurvey - LangChain & Vector Databases in Production - Free comprehensive course from Activeloop with 60+ lessons and 10+ practical projects, teaching production-ready LLM applications with vector databases, trusted by 10,000+ engineers. (Read more)
LearningLangchainRag - Vector Database Fundamentals (Coursera) - IBM's comprehensive specialization providing job-ready vector database skills in one month, covering foundational knowledge for LLM-powered AI similarity searches, available for free enrollment. (Read more)
LearningTutorialsCertification
- Qdrant Cloud - Managed vector database service with 1GB free forever cluster (no credit card required). Fully managed with multi-cloud support across AWS, GCP, and Azure. This is a commercial managed service. (Read more)
Commercialmanaged serviceMulti Cloud - Qdrant Hybrid Cloud - Industry-first managed vector database deployable in any environment - cloud, on-premise, or edge. Kubernetes-native with complete data sovereignty while maintaining managed service convenience. (Read more)
Hybrid CloudKubernetesEnterprise - BagelDB - Collaborative vector database platform described as 'GitHub for AI data'. Features distributed storage, HNSW indexing, and supports private, collaborative, and public vector datasets. This is a commercial platform with open collaboration features. (Read more)
CommercialCollaborativeDistributed - DataStax Astra DB - Serverless vector database built on Apache Cassandra that empowers developers to build AI applications with real-time data handling. Features 20% higher relevance and 74x faster responses with advanced vector and knowledge graph capabilities. (Read more)
CassandraServerlessEnterprise - LanceDB Cloud - Fully managed serverless vector database service with automatic scaling and infrastructure management. Seamless transition from LanceDB OSS with the same SDK, starting at $16.03/month with $100 free credits. (Read more)
ServerlessManagedCloud Native - Momento Vector Index - Serverless vector indexing service designed for real-time storage and retrieval of vector data. Developer-friendly with just 5 API calls to create complete indexes, featuring transparent pricing. This is a commercial managed service. (Read more)
CommercialServerlessReal Time - Neon - Serverless Postgres with native pgvector support for vector embeddings and similarity search. Features instant provisioning, autoscaling, and scale-to-zero with separated compute and storage. This is a commercial managed service with free tier. (Read more)
CommercialServerlessPostgresql - Nuclia - AI Search and RAG-as-a-Service platform with semantic search capabilities. Features NucliaDB open-source database. Acquired by Progress in 2025, now part of Progress Agentic RAG. This is a commercial service with OSS core (NucliaDB). (Read more)
CommercialOpen SourceRag - Supabase Vector - Open-source toolkit for developing AI applications using Postgres and pgvector. Provides managed PostgreSQL with built-in vector support, Python client (vecs), and AI features. This is a commercial managed service with OSS components. (Read more)
CommercialOpen SourcePostgresql - Upstash Vector - Serverless vector database with pay-per-use pricing and scale-to-zero capability. Fully managed service that scales to billions of vectors with simple per-request pricing. This is a commercial managed service. (Read more)
CommercialServerlessmanaged service - Vespa Cloud - Unified search and AI engine with seamless scaling, intelligent retrieval, and precision ranking. Goes beyond simple vector search with tensor support, multi-phase ranking, and hybrid retrieval blending semantic, textual, and structured signals at scale. (Read more)
Hybrid SearchRankingScalable
- HNSWlib - Header-only C++/Python library for fast approximate nearest neighbor search implementing the HNSW algorithm. Used by Spotify and others, offers 10x speed increase over Annoy. This is an OSS library. (Read more)
Open SourceHnswCpp - NVIDIA cuVS - GPU-accelerated vector search and clustering library from NVIDIA RAPIDS. Provides 8-12x faster index building and queries with multiple language support (C, C++, Python, Rust). This is an OSS library. (Read more)
Open SourceGpu AccelerationNvidia - ELPIS - Graph-based similarity search algorithm achieving 0.99 recall, building indexes 3-8x faster than competitors with 40% less memory. Answers 1-NN queries up to 10x faster than serial scan. (Read more)
AnnGraph BasedResearch - GLASS - Leading graph-based ANN library optimized for approximate nearest neighbor search, offering competitive performance especially at lower recall levels across diverse datasets. (Read more)
AnnGraph BasedCpp - hnswlib-rs - Pure-Rust implementation of HNSW algorithm for approximate nearest neighbor search. Decouples graph from vector storage for flexible deployment. Supports dense floating point and quantized int8 vectors. This is an OSS library. (Read more)
Open SourceRustHnsw - OdinANN - Billion-scale graph-based ANNS index with direct insertion capabilities. Achieves <1ms search latency with >10x less memory than in-memory indexes through GC-free design and update combining. (Read more)
AnnDisk BasedHigh Performance - PageANN - Disk-based approximate nearest neighbor search framework with page-aligned graph structure. Achieves 1.85x-10.83x higher throughput than state-of-the-art methods through optimized SSD utilization. (Read more)
AnnDisk BasedOpen Source - PipeANN - Low-latency, billion-scale updatable graph-based vector store on SSD. Achieves <1ms search latency with 10x less memory than in-memory indexes through alignment of best-first search with SSD characteristics. (Read more)
AnnDisk BasedOpen Source - PyNNDescent - Python implementation of Nearest Neighbor Descent for k-neighbor-graph construction and ANN search. Targets 80%-100% accuracy with fast performance and supports wide variety of distance metrics. This is an OSS library. (Read more)
Open SourcePythonAnn - SPANN - Highly-efficient billion-scale approximate nearest neighbor search system from Microsoft Research. Uses a memory-disk hybrid architecture storing centroid points in memory and large posting lists on disk. (Read more)
AnnHybrid SearchMicrosoft - VectorDB - Lightweight Python package for storing and retrieving text using chunking, embeddings, and vector search. Powers AI features in Kagi Search with low latency and small memory footprint. This is an OSS library. (Read more)
Open SourcePythonLightweight
- pgvecto.rs - PostgreSQL extension for scalable, low-latency vector search written in Rust. Features 20x faster HNSW than pgvector, with support for FP16, INT8, and binary vectors. This is an OSS extension. (Read more)
Open SourcePostgresqlRust - VectorChord - PostgreSQL extension for scalable, high-performance vector search, successor to pgvecto.rs. Features RaBitQ quantization enabling 6x cost savings vs Pinecone. Fully compatible with pgvector. This is an OSS extension. (Read more)
Open SourcePostgresqlQuantization - Neo4j Vector Index - Vector search capabilities in Neo4j graph database using HNSW indexing. Enables combining knowledge graphs with semantic similarity search for hybrid retrieval that leverages both graph relationships and vector embeddings. (Read more)
Graph DatabaseHnswKnowledge Graph - pgai - Open-source PostgreSQL extension and Python library that automates embedding generation and synchronization for RAG and semantic search applications. Features pgai Vectorizer for declarative embedding pipelines. This is an OSS solution. (Read more)
Open SourcePostgresqlEmbedding - pgvectorscale - PostgreSQL extension that builds on pgvector for higher-performance embedding search with DiskANN indexing. Achieves 28x lower latency and 16x higher throughput than Pinecone at 75% less cost on 50M embeddings. (Read more)
PostgresqlOpen SourcePerformance - PlanetScale Vectors - Vector search and storage for MySQL, now generally available. PlanetScale Vectors brings native vector capabilities to MySQL, allowing you to store and query vector embeddings alongside relational data without requiring a separate vector database. (Read more)
MysqlCloud NativeVector Search - Redis Vector Search - Native vector database capabilities in Redis combining ultra-low latency in-memory operations with vector similarity search. Redis 8.0 introduced vector sets as native data type for semantic search, RAG pipelines, and recommendations. (Read more)
RedisIn MemoryReal Time - Timescale Vector - PostgreSQL-based vector search solution built on Timescale Cloud with pgai extensions including pgvector, pgvectorscale, and pgai. Features StreamingDiskANN index for high-performance embedding search at scale. (Read more)
PostgresqlPgvectorTime Series
- MTEB Leaderboard - Massive Text Embedding Benchmark leaderboard covering 58 datasets across 112 languages and 8 embedding tasks. Industry-standard benchmark for comparing text embedding models. (Read more)
BenchmarkEmbeddingsEvaluation - Big-ANN Benchmarks - Billion-scale approximate nearest neighbor search benchmark competition. Features datasets like SIFT1B, Deep1B with standardized evaluation metrics for comparing vector search algorithms at scale. (Read more)
BenchmarkAnnCompetition - BigANN Benchmarks - Main competition for large-scale vector database algorithms held at NeurIPS conferences. Framework for evaluating approximate nearest neighbor search algorithms on billion-scale datasets with standardized metrics and datasets. (Read more)
BenchmarkCompetitionAnn - Deep1B Dataset - Billion-scale benchmark dataset containing 96-dimensional deep learning image embeddings. Provides real-world proxy for testing distributed systems and GPU-accelerated vector search at scale. (Read more)
BenchmarkDatasetsDeep Learning - MMTEB - Massive Multilingual Text Embedding Benchmark covering over 500 quality-controlled evaluation tasks across 250+ languages, representing the largest multilingual collection of embedding model evaluation tasks. (Read more)
BenchmarkMultilingualEvaluation - MTEB (Massive Text Embedding Benchmark) - Comprehensive benchmark evaluating universal text embedding models across 1000+ languages and 58 datasets over 8 tasks. The industry standard for comparing embedding model performance. (Read more)
BenchmarkEvaluationEmbeddings - SIFT1B Dataset - Billion-scale benchmark dataset containing 128-dimensional SIFT descriptors of one billion images. Widely used standard for evaluating approximate nearest neighbor search algorithms at scale. (Read more)
BenchmarkDatasetsAnn - VectorDBBench - Open-source vector database benchmarking tool testing databases across production-critical scenarios including static collection, filtering, and streaming cases with modern embedding model datasets. (Read more)
BenchmarkOpen SourcePerformance - ViDoRe - Visual Document Retrieval Benchmark defining standard evaluation protocols for vision-centric document and video retrieval with 26,000 pages and 3,099 queries across 6 languages from 12,000 man-hours of annotations. (Read more)
BenchmarkMultimodalRag
- Unstructured.io - Deep document parsing platform with strong OCR capabilities excelling at extracting structured data from complex layouts including multi-column PDFs, scanned documents, and forms. (Read more)
Data IntegrationOcrDocument Parsing - Kanister for Vector Database Backup - Open-source CNCF Sandbox project enabling efficient and secure backup and restore strategies for vector databases on Kubernetes with cloud-native integration. (Read more)
BackupKubernetesDisaster Recovery - LlamaHub - Open-source repository with 160+ community-created data loaders, readers, tools, and connectors for LlamaIndex applications, covering formats from PDFs to Notion databases. (Read more)
Data IntegrationLoadersOpen Source - VectorFlow - Open-source high-throughput vector embedding pipeline for ingesting raw data, transforming into vectors, and loading into vector databases. Technology-agnostic with automatic retry and fault tolerance. (Read more)
EtlPipelineOpen Source
- Haystack - Mature, modular open-source Python framework for building production-grade RAG pipelines, AI agents, and semantic search systems, trusted by The European Commission and The Economist. (Read more)
RagPythonEnterprise - DSPy - Programming framework for RAG and AI applications with cutting-edge optimization capabilities, featuring the lowest framework overhead and automatic improvement based on example data. (Read more)
RagPythonOptimization - txtai - All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows. Combines vector indexes (sparse/dense), graph networks and relational databases. This is an OSS framework. (Read more)
Open SourcePythonRag
- RAGAS - Research-backed RAG evaluation framework providing metrics for context precision, recall, faithfulness, and response relevancy to objectively measure LLM application performance. (Read more)
RagEvaluationMetrics - Datadog Vector Database Monitoring - Comprehensive observability solution for vector databases through Zilliz Cloud integration, providing metrics for QPS, latency, slow queries, and failure rates alongside full stack monitoring. (Read more)
ObservabilityMonitoringIntegration - Langfuse - Open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management. Integrates with OpenTelemetry, LangChain, OpenAI SDK, and vector databases for RAG pipeline monitoring. (Read more)
ObservabilityOpen SourcePrompt Management - Langtrace - Open-source LLM observability tool built on OpenTelemetry standards. Automatically captures traces from LLM APIs, vector databases, and frameworks with support for over 30 popular providers. (Read more)
ObservabilityOpen SourceOpentelemetry - Monte Carlo Vector Database Observability - Data observability platform specifically supporting vector databases including Pinecone, providing comprehensive monitoring across the five pillars of data observability. (Read more)
ObservabilityData QualityMonitoring - VectorAdmin - Universal vector database management UI and tool suite supporting multiple platforms including Pinecone, Chroma, Qdrant, and Weaviate for centralized administration. (Read more)
GuiManagementTool
- all-MiniLM-L6-v2 - A compact and efficient pre-trained sentence embedding model, widely used for generating vector representations of text. It's a popular choice for applications requiring fast and accurate semantic search, often integrated with vector databases. (Read more)
EmbeddingsNlpAi - OpenAI’s text-embedding-ada-002 - A pre-trained model used for extracting embeddings from content like PDFs, videos, and transcripts, which are then stored in vector databases for faster search. (Read more)
EmbeddingsAiOpenai
- DuckDB - An in-memory, open-source, and free analytical database that speaks SQL, heavily based on vectorization. It can store and process vector embeddings using Array and List data types to enable vector search, bridging the gap between data engineering and AI workflows with fast response times. (Read more)
In MemoryOpen SourceAnalyticsSql - Azure Cosmos DB Vector Indexing - Native vector indexing capability in Azure Cosmos DB that supports flat, quantizedFlat, and diskANN index types for efficient vector similarity search using the VectorDistance function. It enables low-latency, high-throughput, and cost-efficient vector search directly in Cosmos DB collections, with options for brute-force exact search (flat), compressed brute-force search (quantizedFlat), and approximate nearest neighbor search (diskANN). (Read more)
Vector SearchDiskannCloud Native - Couchbase - A database platform that includes vector support, aiming to enhance developer productivity with AI tools like Capella IQ. (Read more)
Nosqlvector dataAi - OpenSearch Vector Search - OpenSearch Vector Search is the vector similarity search and AI search capability within the OpenSearch engine, supporting vector indices, ingestion of embedding data, and search methods including raw vector search, semantic search, hybrid search, multimodal search, and neural sparse search. It enables building RAG and conversational search applications using either user-provided embeddings or embeddings generated automatically by OpenSearch. (Read more)
Vector SearchHybrid SearchSemantic Search - SingleStoreDB (formerly MemSQL) - SingleStoreDB is an enterprise database that has supported vectors since 2017, in addition to exact keyword match, and recently announced support for additional vector indexes. (Read more)
EnterpriseSqlvector indexes - SurrealDB - A multi-model database that supports various data types and query languages, including capabilities for handling vector data. (Read more)
multi-modelvector dataNosql
- k-NN plugin - An OpenSearch plugin that expands its capabilities with the custom
knn_vectordata type, enabling storage of embeddings and providing methods for k-NN similarity searches, including Approximate k-NN, Script Score k-NN, and Painless extensions. (Read more)OpenSearchk-NNVector Search - faiss-quickeradc - faiss-quickeradc is an extension of FAISS that implements the Quicker ADC approach to accelerate product-quantization-based approximate nearest neighbor search using SIMD, improving performance in vector database retrieval. (Read more)
Annproduct quantizationOptimization - HeatWave - A feature for MySQL that integrates vector store capabilities, allowing users to store and process vector embeddings for AI applications. (Read more)
Mysqlvector storeextension - Lantern - Lantern is a PostgreSQL extension that enables efficient vector search capabilities, allowing users to perform similarity searches directly within their PostgreSQL databases. (Read more)
PostgresqlVector Searchextension - MariaDB Vector - MariaDB Vector is an extension or feature of MariaDB, providing capabilities for handling and querying vector data within the MariaDB ecosystem. (Read more)
relational databaseVector Searchextension - Neo4j Vector Search - An enhancement to the Neo4j graph database providing vector search capabilities through dedicated indexes. (Read more)
Graph DatabaseVector Searchextension - OpenSearch Neural Search / Hybrid Search - Neural and hybrid search capability in OpenSearch that combines lexical queries with vector-based neural search using a pipeline of normalization and score combination techniques. It enables semantic (vector) search and hybrid search over indices such as
neural_search_pqa, suitable for AI and vector database-style retrieval use cases. (Read more)Hybrid SearchSemantic SearchVector Search - SuperDuperDB - Open-source AI-native database layer that adds vector search, model integration, and AI workflows on top of existing databases like MongoDB and Postgres. (Read more)
Vector SearchMongoDBPostgresql
- ANN-Benchmarks - ANN-Benchmarks is a benchmarking platform specifically for evaluating the performance of approximate nearest neighbor (ANN) search algorithms, which are foundational to vector database evaluation and comparison. (Read more)
BenchmarkAnnEvaluationPerformance - BEIR - BEIR (Benchmarking IR) is a benchmark suite for evaluating information retrieval and vector search systems across multiple tasks and datasets. Useful for comparing vector database performance. (Read more)
BenchmarkEvaluationVector SearchDatasets - Billion-scale ANNS Benchmarks - A benchmarking resource for evaluating approximate nearest neighbor search (ANNS) methods on billion-scale datasets, highly relevant for assessing the scalability of vector databases. (Read more)
BenchmarkANNSScalabilityPerformance - Milvus Sizing Tool - Milvus Sizing Tool helps users estimate the hardware and resource requirements needed to deploy Milvus based on their anticipated data scale and workload. (Read more)
MilvussizingPerformanceresource estimation - MyScale's Vector Database Benchmark - Benchmark results and tools by MyScale aimed at measuring the performance of vector databases in various search and retrieval tasks. (Read more)
Benchmarkvector databasesPerformanceRetrieval - Qdrant's Vector Database Benchmarks - A set of benchmarks provided by Qdrant for evaluating vector databases, focusing on speed, scalability, and accuracy of vector search operations. (Read more)
Benchmarkvector databasesPerformanceScalability - SISAP Indexing Challenge - An annual competition focused on similarity search and indexing algorithms, including approximate nearest neighbor methods and high-dimensional vector indexing, providing benchmarks and results relevant to vector database research. (Read more)
BenchmarkSimilarity SearchEvaluation - WEAVESS - WEAVESS is an open-source benchmarking and evaluation framework for graph-based approximate nearest neighbor (ANN) search methods, providing code and experiments for large-scale vector similarity search. It is useful for researchers and practitioners comparing vector indexing algorithms for vector databases and AI search applications. (Read more)
AnnBenchmarkSimilarity Search - Zeng, Xianzhi, et al. "CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion." - A 2024 paper introducing CANDY, a benchmark for continuous ANN search with a focus on dynamic data ingestion, crucial for next-generation vector databases. (Read more)
BenchmarkAnndynamic dataVector Search
- Denser Retriever - Denser Retriever is a vector-based retrieval system designed for efficient similarity search and information access in AI and ML workloads. (Read more)
Vector SearchSimilarity SearchAiCommercial - LiquidMetal AI - LiquidMetal AI is a platform providing intelligent storage with built-in AI capabilities, including vector database features for building advanced AI applications. (Read more)
Aivector databasesCommercialintelligent storage - Meilisearch Vector Search - Meilisearch offers vector search capabilities as part of its search engine, enabling hybrid and semantic search for AI applications. (Read more)
Vector SearchSemantic SearchHybrid SearchCommercialAi - Qdrant Enterprise Solutions - Qdrant Enterprise Solutions provide enterprise‑grade deployments and support for the Qdrant vector database, including advanced security, high availability, SLAs, and integration services for large‑scale AI search and recommendation use cases. (Read more)
Enterprisevector databaseservices - QdrantCloud - QdrantCloud is the managed cloud version of Qdrant, a vector database tailored for AI-powered similarity search and matching. (Read more)
Managed Servicevector databaseSimilarity SearchAi - Vectara - Vectara is a commercial vector database and search platform that enables semantic and hybrid AI-powered search using vector embeddings. (Read more)
CommercialVector SearchSemantic SearchAi - vector-admin - A universal tool suite for managing vector databases such as Pinecone, Chroma, Qdrant, and Weaviate. Facilitates straightforward management and integration of multiple vector database systems. (Read more)
Managementtoolsvector databasesIntegration
- Deep Learning for Search - Applied book on using deep learning for search, including dense vector representations, semantic search, and neural ranking, all directly relevant to building applications on top of vector databases. (Read more)
Semantic SearchMachine Learningresources - Foundations of Multidimensional and Metric Data Structures - Technical book covering theory and practice of multidimensional and metric data structures for similarity search, forming a theoretical basis for index structures used in vector databases. (Read more)
Similarity Searchmetric spaceData Structure - K-means Tree - K-means Tree is a clustering-based data structure that organizes high-dimensional vectors for fast similarity search and retrieval. It is used as an indexing method in some vector databases to optimize performance for vector search operations. (Read more)
ClusteringData StructureSimilarity SearchHigh Dimensional - Locality-Sensitive Hashing - Locality-Sensitive Hashing (LSH) is an algorithmic technique for approximate nearest neighbor search in high-dimensional vector spaces, commonly used in vector databases to speed up similarity search while reducing memory footprint. (Read more)
AnnSimilarity SearchHigh DimensionalOptimization - M-tree - M-tree is a dynamic index structure for organizing and searching large data sets in metric spaces, enabling efficient nearest neighbor queries and dynamic updates, which are important features for vector databases handling high-dimensional vectors. (Read more)
Data Structuremetric spacenearest neighborDynamic Updates - Machine Learning Crash Course: Embeddings - Module of Google’s Machine Learning Crash Course that explains word and text embeddings, how they are obtained, and the difference between static and contextual embeddings, giving essential background for using vector representations in vector databases and similarity search systems. (Read more)
EmbeddingMachine LearningLearning - Online Product Quantization (O-PQ) - Online Product Quantization (O-PQ) is a variant of product quantization designed to support dynamic or streaming data. It enables adaptive updating of quantization codebooks and codes in real-time, making it suitable for vector databases that handle evolving datasets.
Anndynamic dataVector SearchReal Time - Optimized Product Quantization (OPQ) - Optimized Product Quantization (OPQ) enhances Product Quantization by optimizing space decomposition and codebooks, leading to lower quantization distortion and higher accuracy in vector search. OPQ is widely used in advanced vector databases for improving recall and search quality.
AnnOptimizationVector Searchaccuracy - PQ (Product Quantization) - Product Quantization is a compression and indexing technique for vector search that splits vectors into subspaces and quantizes each part separately, allowing vector databases to store large-scale embeddings compactly while supporting efficient ANN search. (Read more)
QuantizationAnnvector compression - R-tree - R-tree is a tree data structure widely used for indexing multi-dimensional information such as vectors, supporting efficient spatial queries like nearest neighbor and range queries, which are essential in vector databases. (Read more)
Data Structurespatial indexingVector Searchnearest neighbor - Spectral Hashing - Spectral Hashing is a method for approximate nearest neighbor search that uses spectral graph theory to generate compact binary codes, often applied in vector databases to enhance retrieval efficiency on large-scale, high-dimensional data.
AnnSimilarity SearchCompressionOptimization - Vector Database - A vector database is a specialized database designed to store, index, and retrieve unstructured data represented as high-dimensional vectors, enabling efficient semantic search, similarity search, and powering applications such as LLM long-term memory, semantic search, and recommendation systems. (Read more)
vector databasesdefinitionSemantic SearchSimilarity Search
- Airbyte Milvus Connector - The Airbyte Milvus connector lets users sync data from various Airbyte-supported sources into Milvus as a destination, enabling low-code vector data ingestion pipelines. (Read more)
Integrationmigrationvector data - Attu - Attu is a graphical user interface (GUI) tool for managing and administering Milvus vector databases. It simplifies tasks such as data exploration, schema management, and monitoring, making Milvus more accessible for a wide range of users. (Read more)
GuiManagementMilvusOpen Source - Birdwatcher - Birdwatcher is a system debugging tool designed for the Milvus vector database. It provides advanced diagnostics to help developers and operators understand and troubleshoot Milvus deployments, ensuring robust vector search operations. (Read more)
debuggingMilvusManagementOpen Source - Kafka Connect Milvus Connector - The Kafka Connect Milvus Connector is a plugin for Kafka Connect that streams data into and out of Milvus, supporting real-time vector data ingestion pipelines. (Read more)
IntegrationReal Timevector data - Milvus Backup Tool - Milvus Backup Tool provides backup and restore functionalities for Milvus vector databases, ensuring data safety and disaster recovery capabilities. Also referred to as Milvus Backup. (Read more)
MilvusBackuprestoreDisaster Recovery - Milvus CDC - Milvus CDC (Change Data Capture) is a component of the Milvus ecosystem that enables data synchronization between Milvus and other systems. It is useful for maintaining up-to-date vector data pipelines and supporting real-time vector search applications. (Read more)
Milvusdata synchronizationReal Timevector databases - Milvus Connectors - Milvus Connectors, such as the Spark-Milvus Connector, enable seamless integration of Milvus vector databases with third-party tools like Apache Spark for machine learning and data processing workflows. (Read more)
MilvusIntegrationMachine LearningApache Spark - Milvus Destination for Fivetran - The Milvus destination in Fivetran enables automated ELT pipelines that load data into Milvus as a vector database, supporting AI and similarity search workloads. (Read more)
IntegrationEtlvector data - MindsDB Milvus Integration - MindsDB provides an integration with Milvus, enabling users to connect and manage vector data using SQL-like queries. This integration brings federated AI query capabilities across structured and unstructured data with Milvus as the vector database backend. (Read more)
MilvusIntegrationAiSql - Spark-Milvus Connector - The Spark-Milvus Connector is an integration that allows Apache Spark jobs to read from and write to Milvus, enabling scalable ETL and analytics workflows for vector data. (Read more)
IntegrationApache Sparkvector data - Vector Transport Service (VTS) - Vector Transport Service (VTS) is a tool for transporting vector data efficiently between Milvus clusters or environments, supporting large-scale data migration and synchronization. Vector Transmission Services (VTS) are tools for transferring data between Milvus and various data sources (like Zilliz clusters, Elasticsearch, Postgres/PgVector, or other Milvus instances), facilitating vector data migration and integration. (Read more)
vector datamigrationIntegrationMilvus - VTS (Vector Transfer Service) - VTS is a data migration and connector service for Milvus that simplifies moving and synchronizing vector data between Milvus instances and external systems. (Read more)
migrationdata synchronizationMilvus
- Dify - Open-source LLM app development platform with an intuitive interface that combines AI workflow, RAG pipeline, agent capabilities, model management, and observability features for rapid prototyping and production deployment. (Read more)
Open SourceRagAi Agents - Embedchain - Open Source RAG Framework designed to be 'Conventional but Configurable', streamlining the creation of RAG applications with efficient data management, embeddings generation, and vector storage. (Read more)
RagOpen SourcePython - FlashRAG - Python toolkit for efficient RAG research providing 36 pre-processed benchmark datasets and 23 state-of-the-art RAG algorithms in a unified, modular framework for reproduction and development. (Read more)
RagOpen SourcePython - h2oGPT - Apache 2.0 open-source project for querying and summarizing documents or chatting with local private GPT LLMs. Supports Ollama, Mixtral, llama.cpp with persistent databases (Chroma, Weaviate, FAISS) and accurate embeddings. (Read more)
Open SourcePrivacyLocal Llm - LightRAG - Simple and efficient retrieval-augmented generation framework that combines document retrieval with generation, focusing on speed and ease of use. Designed to run on standard CPUs and laptops with minimal resource requirements. (Read more)
RagLightweightOpen Source - LLMWare - Retrieval-augmented generation framework that utilizes small, specialized models instead of large language models, significantly reducing computational and financial costs while offering cost-effective RAG solutions that can run on standard hardware. (Read more)
RagCost EffectiveOpen Source - NVIDIA NeMo Retriever - Collection of industry-leading Nemotron RAG models delivering 50% better accuracy, 15x faster multimodal PDF extraction, and 35x better storage efficiency for building enterprise-grade retrieval-augmented generation pipelines. (Read more)
RagMultimodalMicroservices - Pathway - Python ETL framework for stream processing and real-time analytics with built-in vector search capabilities. Features real-time document synchronization, in-memory vector index, and adaptive RAG technology for always-current AI applications. (Read more)
Real TimeStreamingRag - PrivateGPT - Production-ready AI project for private, local document Q&A using RAG. 100% private with no data leaving your environment, supporting offline operation with local LLMs and vector databases. (Read more)
PrivacyLocalRag - RAGatouille - Python library designed to simplify the integration and training of state-of-the-art late-interaction retrieval methods, particularly ColBERT, within RAG pipelines with a modular and user-friendly interface. (Read more)
RagColbertRetrieval - RAGFlow - Open-source RAG engine based on deep document understanding with citation-backed responses. RAGFlow extracts tables, images, and structured data from complex documents, providing truthful question-answering with well-founded citations. (Read more)
Open SourceRagDocument Understanding - Vercel AI SDK - Free open-source TypeScript toolkit for building AI-powered applications with a unified API supporting 15+ providers including OpenAI, Anthropic, Google, and more. Created by the makers of Next.js for seamless AI integration. (Read more)
TypescriptApiMulti Provider
- LlamaIndex - LlamaIndex is a data framework for large language model (LLM) applications, providing tools to ingest, structure, and access private or domain-specific data, often integrating with vector databases for retrieval augmented generation (RAG). (Read more)
LlmRagframework
- ARES - RAG evaluation framework that trains lightweight judges for retrieval and generation scoring, refining evaluation by training specialized LLM judges on synthetic datasets to provide more reliable, confidence-aware judgments. (Read more)
EvaluationRagOpen Source - Arize Phoenix - Open-source LLM tracing and evaluation solution built on OpenTelemetry for RAG evaluation. Provides automated instrumentation which records the execution path of LLM requests through multiple steps. (Read more)
ObservabilityEvaluationOpentelemetry - AWQ - Activation-aware Weight Quantization method that preserves model accuracy at 4-bit quantization by identifying and skipping important weights. Maintains 99%+ of original performance with moderate inference speed improvements. (Read more)
QuantizationOptimizationCompression - Cohere Rerank - Proprietary neural network reranker accessed via API that processes query and document together as a cross-encoder to precisely judge relevance. Supports over 100 languages with Rerank 3 Nimble variant for faster production performance. (Read more)
RerankingApiMultilingual - DeepEval - Simple open-source LLM evaluation framework similar to Pytest for unit testing LLM outputs. Provides 14+ targeted metrics for RAG and fine-tuning scenarios including hallucination, faithfulness, and contextual relevancy. (Read more)
EvaluationTestingRag - Docling - Open-source document parsing framework from IBM with 97.9% accuracy in complex table extraction and excellent text fidelity. Self-hostable solution for converting PDFs, spreadsheets, and scanned images into structured data for RAG pipelines. (Read more)
Document ParsingOpen SourceRag - Feder - Visualization tool for ANNS (Approximate Nearest Neighbor Search) algorithms enabling users to observe index structures, parameter configurations, and the complete vector similarity search process. (Read more)
VisualizationAnnHnsw - FiftyOne - Computer vision interface for vector search with native integrations for Qdrant, Pinecone, LanceDB, and Milvus. Enables natural language search, configurable vector database backends, and visualization of search matches across billions of images. (Read more)
Computer VisionVisualizationVector Search - GGUF - GPT-Generated Unified Format for storing quantized model weights, designed for CPU inference and consumer hardware. Enables running LLMs on laptops and edge devices with flexible layer offloading to GPU. (Read more)
QuantizationCpuFormat - GPTQ - Post-training quantization method for 4-bit weight compression that focuses on GPU inference performance. First quantization method to compress LLMs to 4-bit range while maintaining accuracy, minimizing mean squared error to weights. (Read more)
QuantizationCompressionOptimization - Helicone - Open-source observability layer designed to help developers monitor and understand how their applications interact with large language models. Acts as a lightweight proxy between applications and LLM providers. (Read more)
ObservabilityMonitoringOpen Source - LangSmith - LangChain's observability platform for monitoring, debugging, and evaluating LLM applications. Automatically traces every LLM call, captures prompts and outputs, tracks costs and latency, and enables systematic evaluation through dataset-based testing. (Read more)
ObservabilityMonitoringLangchain - LiteLLM - Open-source proxy and SDK that provides a single unified API to call and manage hundreds of different LLM providers and models with OpenAI-compatible endpoints. Simplifies multi-provider LLM integration. (Read more)
Open SourceApiLlm - llamafile - Single-file executable that bundles LLM weights and llama.cpp runtime. Distribute and run LLMs locally with no installation, including embedding generation via built-in server. (Read more)
Local LlmSingle FileEmbeddings - LlamaParse - High-performance document parsing service by LlamaIndex that consistently processes documents in about 6 seconds regardless of size. Returns rich Markdown and optional HTML tables with wide format support through hosted API. (Read more)
Document ParsingApiRag - Milvus WebUI - Built-in GUI introduced in Milvus v2.5 for system observability, offering real-time monitoring of system health, collection management, and query optimization from a unified dashboard. (Read more)
VisualizationMonitoringMilvus - Nomic Atlas - AI-ready data visualization platform for massive datasets of embeddings. Atlas enables interactive exploration of millions of vectors in your web browser, with automatic dimensionality reduction and semantic clustering. (Read more)
VisualizationEmbeddingsAnalytics - Portkey - AI gateway that provides a unified interface to interact with 250+ AI models, offering advanced tools for control, visibility, and security in Generative AI applications. Integrates with vector databases for production-level routing and reliability. (Read more)
Ai GatewayObservabilityLlm - Recursive Character Text Splitter - Document chunking strategy that splits text at hierarchical boundaries like paragraphs, sentences, or headings. Industry-standard approach recommended as starting point with 400-512 tokens and 10-20% overlap for optimal RAG performance. (Read more)
ChunkingText ProcessingRag - Semantic Chunker - Document chunking strategy that dynamically chooses split points between sentences based on embedding similarity rather than fixed sizes. Maintains semantic coherence by grouping related content together for improved RAG retrieval. (Read more)
ChunkingSemantic SearchEmbeddings - TruLens - Open-source solution for evaluating and tracing AI Agents and RAG applications using feedback functions to programmatically evaluate components of execution flow. Features the RAG Triad metrics for comprehensive evaluation. (Read more)
EvaluationRagObservability - Unstructured - Document parsing platform delivering strong content fidelity and precision with low hallucination rates. Achieves 100% accuracy on simple tables and 75% on complex structures with comprehensive enterprise document support. (Read more)
Document ParsingEnterpriseRag - VectorDBZ - Enterprise-grade desktop application for managing and analyzing vector databases with interactive visualizations, supporting Qdrant, Weaviate, Milvus, ChromaDB, Pinecone, pgvector, and Elasticsearch. (Read more)
VisualizationManagementGui - Zep - Context engineering and agent memory platform for AI agents with sub-200ms latency. Zep uses a temporal knowledge graph architecture to deliver relationship-aware context from chat history, business data, documents, and app events. (Read more)
Ai AgentsKnowledge GraphMemory
- Apache Kvrocks - Distributed key-value NoSQL database with experimental vector similarity search. Redis-compatible with RocksDB storage engine, adding HNSW-based vector indexing for large-scale vector data management. (Read more)
Redis CompatibleDistributedVector Search - Deep Lake 4.0 (Activeloop) - Multimodal AI database for vectors, images, texts, videos, and more. Features index-on-the-lake technology for sub-second queries from object storage with 10x cost efficiency and 2x faster performance. (Read more)
MultimodalData LakeCost Efficient - Memgraph - In-memory graph database with native vector search capabilities powered by USearch. Combines vector embeddings with knowledge graphs for GraphRAG, enabling semantic similarity search alongside graph traversal. (Read more)
Graph DatabaseVector SearchIn Memory
- AnythingLLM - AnythingLLM is an open-source AI application that integrates with vector databases to facilitate storage and retrieval of embeddings, supporting various AI and LLM workflows. (Read more)
Open SourceAiLlmvector database - Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data that is commonly used to facilitate efficient integration between vector databases and machine learning frameworks. It provides a standardized format for data exchange that is useful for storing and querying high-dimensional vectors in AI applications. (Read more)
Open SourceIn MemoryData IntegrationAi - arroy - Arroy is an open-source library for efficient similarity search and management of vector embeddings, useful in vector database systems. (Read more)
Open Sourcevector embeddingsSimilarity SearchVector Search - Awesome-Moviate - Awesome-Moviate is a movie search and recommendation engine demo that combines BM25 keyword search, semantic vector search, and hybrid search using Weaviate as the underlying vector database, serving as a practical example of hybrid retrieval for media content. (Read more)
Hybrid SearchexamplesOpen Source - Bleve - Bleve is an open-source search library with experimental support for vector search, enabling hybrid search and retrieval in applications. (Read more)
Open Sourcesearch libraryHybrid SearchVector Search - Crate - Crate is an open-source distributed SQL database with support for vector data types and vector search, suitable for AI-driven applications. (Read more)
Open SourceDistributedSqlVector Search - cuVS - cuVS is an open-source library from RAPIDS for fast, GPU-accelerated vector search, useful for building high-performance vector databases. (Read more)
Open SourceGpu AccelerationVector SearchHigh Performance - DocArray - An open-source library for creating, storing, and searching multimodal data and vector embeddings, supporting AI and ML workflows. (Read more)
Open SourceMultimodalvector embeddingsAi - Epsilla - Epsilla is an open-source vector database optimized for high-performance similarity search and scalable storage of vector embeddings. (Read more)
Open Sourcevector databaseSimilarity SearchScalable - frugal - A platform focused on transforming AI/ML operations with transparency, control, and cost optimization, including support for vector database tasks. (Read more)
Open SourceAiMLvector database - Havenask - Havenask is an open-source distributed search engine with support for vector search, designed for large-scale AI and search applications. (Read more)
Open SourceDistributedVector SearchAi - Healthsearch Demo - Healthsearch is an open-source demo application that uses Weaviate as a vector database to retrieve supplement products based on user-written reviews and queries, illustrating real-world semantic product search over vector embeddings. (Read more)
Semantic SearchexamplesOpen Source - HelixDB - HelixDB is a powerful, open-source graph-vector database built in Rust, designed for intelligent data storage for Retrieval-Augmented Generation (RAG) and AI applications. It combines graph database features with vector search, making it directly relevant to AI and machine learning workflows that require vector data management. (Read more)
Open SourceGraph DatabaseVector SearchRagRust - HVS (Hierarchical Graph Structure) - HVS is a graph-based index structure leveraging Voronoi diagrams for approximate nearest neighbor search in high-dimensional vector spaces. It is directly relevant to vector databases as it provides efficient similarity search capabilities for large-scale vector data. (Read more)
Open SourceAnnGraph DatabaseSimilarity Search - InfluxDB - InfluxDB 3 OSS provides high-performance time series workloads with new support for vector data, making it suitable for AI/ML and vector search applications. Relevant as a vector-capable database. (Read more)
Open Sourcevector dataTime SeriesVector Search - Jina - Jina is an open-source neural search framework that delivers cloud-native neural and vector search solutions powered by deep learning for AI applications. It is also known as Jina Search, designed for building search systems powered by vector databases, making it highly relevant for applications involving AI, semantic search, and vector data management. (Read more)
Open SourceNeural SearchVector SearchCloud Native - KGraph - KGraph is an open-source library for fast approximate nearest neighbor search in high-dimensional vector spaces, applicable to vector database solutions. (Read more)
Open SourceAnnSimilarity SearchVector Search - langchain4j - langchain4j is an open-source framework for developing LLM-powered Java applications, with built-in support for integrating vector databases as memory stores. (Read more)
Open SourceLlmJavavector database - llm-app - llm-app is an open-source project that provides an AI application framework with integrated support for vector databases, enabling the development of LLM-powered solutions. (Read more)
Open SourceAiLlmvector database - MeiliSearch - MeiliSearch is an open-source, fast, and relevant search engine that supports vector search capabilities, making it suitable for AI applications requiring vector database functionality. (Read more)
Open SourceSearch EngineVector SearchAi - mem0 - mem0 is an open-source vector database focused on efficient storage and retrieval of high-dimensional embeddings for large-scale AI applications. (Read more)
Open Sourcevector databaseHigh DimensionalAi - MRPT - MRPT (Multi-Resolution Proximity Trees) is an open-source library for fast approximate nearest neighbor search in high-dimensional vector spaces, applicable to vector database backends. (Read more)
Open SourceAnnHigh DimensionalVector Search - MuopDB - MuopDB is an open-source vector database designed for fast and scalable similarity search in AI applications. (Read more)
Open Sourcevector databaseSimilarity SearchScalable - nanopq - nanopq is a lightweight product quantization library for efficient vector compression and similarity search, which is an important feature for vector databases that need to store and query large-scale vector data efficiently. (Read more)
Open SourceQuantizationvector compressionSimilarity Search - NGT - NGT (Neighborhood Graph and Tree) is an open-source vector search engine designed for fast and scalable approximate nearest neighbor search. (Read more)
Open SourceVector SearchAnnScalable - OasysDB - OasysDB is an open-source vector database focused on efficient similarity search and management of high-dimensional data. (Read more)
Open Sourcevector databaseSimilarity SearchHigh Dimensional - puck - Puck is an open-source vector search engine designed for fast similarity search and retrieval of embedding vectors. (Read more)
Open SourceVector SearchSimilarity SearchEmbedding - RAFT - RAFT is a suite of GPU-accelerated libraries for data science, including support for vector search and similarity operations, often used in vector database scenarios. (Read more)
Open SourceGpu AccelerationVector Searchdata science - reor - reor is an open-source vector database solution focused on fast and scalable storage of high-dimensional vectors for AI and ML applications. (Read more)
Open Sourcevector databaseScalableAi - sqlite-vec - sqlite-vec is an open-source extension for SQLite that adds vector data types and similarity search, enabling lightweight vector database capabilities. (Read more)
Open SourceSqlitevector dataSimilarity Search - Valkey - Valkey is an open-source in-memory key-value data store that supports vector search operations, making it useful for AI and machine learning vector database workloads. It is also a specialized open-source vector database designed for efficient management and retrieval of high-dimensional vector data, offering advanced APIs and optimized storage for AI workloads. (Read more)
Open SourceVector SearchIn MemoryAi
- CockroachDB - CockroachDB is a cloud-native, distributed SQL database that now supports vector data, combining traditional SQL queries with efficient vector search capabilities, ensuring data resilience, availability, scalability, and strong consistency. (Read more)
Sqlvector dataDistributed - PostgreSQL - A powerful, open-source relational database that can be extended with modules like pgvector to support efficient storage and similarity search of vector embeddings, effectively functioning as a vector database. (Read more)
Open Sourcerelational databasePgvector
- A Brief Survey of Vector Databases - This survey paper provides an overview of the landscape, technologies, and applications of vector databases, making it a valuable resource for understanding the field.
vector databasesSurveyApplicationstechnologies - A Comprehensive Survey on Vector Database - A comprehensive academic survey that explores the architecture, storage, retrieval techniques, and challenges associated with vector databases. It categorizes algorithmic approaches to approximate nearest neighbor search (ANNS) and discusses how vector databases can be integrated with large language models, offering valuable insights and foundational knowledge for understanding and building vector database systems. (Read more)
vector databasesSurveyANNSArchitecture - ACL 2023 Tutorial: Retrieval-Based Language Models and Applications - This ACL 2023 tutorial reviews retrieval-based language models, which often rely on vector databases and vector search systems to retrieve relevant context. The tutorial covers methods and applications central to the use of vector databases in modern NLP systems. (Read more)
TutorialsRetrievalvector databasesApplications - ACORN - ACORN is a performant and predicate-agnostic search system for vector embeddings and structured data, enhancing the capability of vector databases to handle complex queries over high-dimensional data efficiently. (Read more)
vector embeddingssearch systempredicate-agnosticResearch - Adanns - Adanns is a framework for adaptive semantic search, focusing on efficient and scalable similarity search in high-dimensional vector spaces. Its relevance to 'Awesome Vector Databases' lies in its support for advanced vector search techniques suitable for AI and machine learning applications. (Read more)
Semantic SearchSimilarity SearchAiMachine LearningResearch - AiSAQ - AiSAQ is an all-in-storage approximate nearest neighbor search system that uses product quantization to enable DRAM-free vector similarity search, serving as a specialized vector search/indexing approach for large-scale information retrieval. (Read more)
AnnSimilarity Searchvector indexing - BANG - BANG is a billion-scale approximate nearest neighbor search system optimized for single GPU execution, enabling high-performance vector search in vector database environments at massive scale. (Read more)
AnnGpu AccelerationHigh PerformanceVector SearchResearch - Cagra - Cagra provides highly parallel graph construction and approximate nearest neighbor search for GPUs, supporting large-scale vector database operations and efficient similarity search. (Read more)
graph constructionAnnGpu AccelerationSimilarity SearchResearch - DET-LSH - DET-LSH is a locality-sensitive hashing scheme that introduces a dynamic encoding tree structure to accelerate approximate nearest neighbor (ANN) search in high-dimensional spaces. While it is a research algorithm rather than a production database, it directly targets the core operation behind vector databases—efficient ANN search over vector embeddings—and is relevant for designing or optimizing vector indexing components within vector database systems. (Read more)
AnnHashingHigh Dimensional - Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs - This paper introduces the HNSW algorithm, which is widely adopted in vector databases and search engines for its efficient and robust performance on high-dimensional data. HNSW is foundational in powering modern vector search systems. (Read more)
HnswAnnVector SearchResearch - Efficient Locality Sensitive Hashing - This work by Jingfan Meng is a comprehensive research thesis on efficient locality-sensitive hashing (LSH), covering algorithmic solutions, core primitives, and applications for approximate nearest neighbor search. It is relevant to vector databases because LSH-based indexing is a foundational technique for scalable similarity search over high-dimensional vectors, informing the design of vector indexes, retrieval engines, and similarity search modules in modern vector database systems. (Read more)
AnnSimilarity SearchHashing - Graph-based Methods - A category of vector database solutions and algorithms leveraging graph-based approaches for efficient similarity search and vector indexing, which are core to many vector database implementations in AI applications. (Read more)
Graph DatabaseSimilarity Searchvector indexingAi - GTS - GTS is a GPU-based tree index for fast similarity search over high-dimensional vector data, providing an efficient ANN index structure that can be integrated into or used to build high-performance vector database systems. (Read more)
Similarity SearchAnnGpu Acceleration - LANNS: a web-scale approximate nearest neighbor lookup system - A scalable system for approximate nearest neighbor search at web-scale, relevant for implementing and understanding vector database infrastructure for high-dimensional data. (Read more)
AnnScalabilityVector SearchResearch - Li, Wen, et al. "Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement." - An influential paper analyzing and improving approximate nearest neighbor search methods for high-dimensional data, highly relevant for developing and understanding vector databases.
AnnHigh DimensionalVector SearchResearch - Maze - Maze is a web-scale video deduplication system that relies on large-scale approximate nearest neighbor vector search over video embeddings to detect and remove duplicate or near-duplicate videos efficiently. While not a general-purpose vector database, it represents a specialized, production-scale application of vector search infrastructure for multimedia content management. (Read more)
AnnApplicationsMultimodal - OneSparse: A Unified System for Multi-index Vector Search - A unified system designed for efficient multi-index vector search, directly addressing large-scale vector database performance and scalability challenges. (Read more)
Vector SearchPerformanceScalabilityResearch - SOAR - SOAR is a set of improved algorithms on top of ScaNN that accelerate vector search by introducing controlled redundancy and multi-cluster assignment, enabling faster approximate nearest neighbor retrieval with smaller indexes in large‑scale vector databases and search systems. (Read more)
AnnVector SearchOptimization - Starling - Starling is an I/O-efficient, disk-resident graph index framework tailored for high-dimensional vector similarity search on large data segments, supporting the scalable storage and retrieval needs of vector databases. (Read more)
graph indexSimilarity SearchScalableResearch - Towards Reliable Vector Database Management Systems: A Software Testing Roadmap for 2030 - An academic paper providing a comprehensive overview of the architecture, empirical defects, and future research roadmap for Vector Database Management Systems (VDBMS). This resource is directly relevant for understanding the current state and challenges in building and testing reliable vector databases. (Read more)
vector databasesTestingroadmapreliability - VDBMS Architecture Overview - An overview of the architectural components common to Vector Database Management Systems (VDBMS), which are designed to efficiently store, index, and query high-dimensional vector embeddings. This provides foundational knowledge for anyone interested in the internal workings of vector databases. (Read more)
ResearchArchitecturevector databasesHigh Dimensional - VDBMS Testing Research Roadmap Paper - A research paper that proposes the first structured roadmap for testing Vector Database Management Systems (VDBMS), analyzing bugs, vulnerabilities, and test challenges unique to vector databases. It provides insights and future directions for improving the reliability and robustness of vector databases. (Read more)
ResearchTestingvector databasesroadmap - VDBMS Testing Roadmap - A comprehensive research roadmap addressing the unique challenges of testing vector database management systems (VDBMS), including approaches for test input generation, oracle definition, and test evaluation tailored to vector databases. The work highlights the complexities of high-dimensional vector data, approximate search semantics, and integration with AI/LLM pipelines, making it a valuable resource for advancing reliability and trustworthiness in vector databases. (Read more)
vector databasesTestingroadmapAi - Vector Database Group @ NTU - A research group focused on advancing the theory and practice of vector databases, providing resources, publications, and tools related to vector database technology. (Read more)
Researchvector databasesresourcesAi - Vector database management systems: Fundamental concepts, use-cases, and current challenges - A comprehensive research paper outlining the fundamental concepts, practical use-cases, and current challenges in the field of vector database management systems.
vector databasesUse CaseschallengesSurvey
- BatANN - Distributed disk-based approximate nearest neighbor system achieving near-linear throughput scaling. Delivers 6.21-6.49x throughput improvement over scatter-gather baseline with sub-6ms latency on 10 servers. (Read more)
AnnDistributedResearch - FreshDiskANN - Fast and accurate graph-based ANN index for streaming similarity search, enabling real-time updates on billion-point indexes using a single machine with real-time freshness. (Read more)
AnnGraph BasedDynamic Updates - LoRANN - Low-Rank Matrix Factorization algorithm for Approximate Nearest Neighbor Search, offering competitive performance with faster query times than leading libraries at various recall levels. (Read more)
AnnAlgorithmOptimization - MCGI - Manifold-Consistent Graph Indexing for billion-scale disk-resident vector search. Leverages Local Intrinsic Dimensionality to achieve 5.8x throughput improvement over DiskANN on high-dimensional datasets. (Read more)
AnnResearchDisk Based - PECANN - Parallel Efficient Clustering with graph-based Approximate Nearest Neighbor search, providing efficient clustering algorithms optimized for high-dimensional vector spaces. (Read more)
AnnClusteringParallel - SLIM (Sparsified Late Interaction Multi-Vector Retrieval) - Efficient multi-vector retrieval system using sparsified late interaction with inverted indexes. Achieves 40% less storage and 83% lower latency than ColBERT-v2 while maintaining competitive accuracy. (Read more)
RetrievalResearchSparse - SPFresh - Incremental in-place update system for billion-scale vector search from Microsoft Research. Maintains 2.41x lower P99.9 latency than baselines while supporting efficient vector updates with minimal resource overhead. (Read more)
AnnResearchDynamic Updates
- chromem-go - Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence, designed for simplicity and performance in Go applications. (Read more)
GolangEmbeddedIn Memory - Dense Passage Retrieval (DPR) - Set of tools and models from Meta AI Research for open domain Q&A using dense representations, outperforming BM25 by 9%-19% in passage retrieval accuracy with a dual-encoder BERT framework. (Read more)
RetrievalOpen SourceNlp - FlagEmbedding - Open-source retrieval and RAG framework from BAAI featuring the BGE embedding model series. BGE-M3 supports multi-functionality (dense, sparse, multi-vector), multi-linguality (100+ languages), and multi-granularity (up to 8192 tokens). (Read more)
Open SourceEmbeddingsMultilingual - FLANN - Fast Library for Approximate Nearest Neighbors containing a collection of algorithms optimized for nearest neighbor search in high dimensional spaces with automatic algorithm and parameter selection. (Read more)
AnnOpen SourceCpp - FlashRank - Ultra-lite and super-fast Python reranking library based on SoTA cross-encoders and LLMs, running on CPU with the tiniest reranking model in the world at ~4MB with no PyTorch dependency. (Read more)
RerankingLightweightOpen Source - Graphiti - Open-source framework for building temporally-aware knowledge graphs that power AI agent memory. Graphiti tracks when facts were true and maintains historical context, combining semantic search with graph traversal. (Read more)
Open SourceKnowledge GraphTemporal - hnswlib-node - Node.js bindings for HNSWlib implementing approximate nearest-neighbor search. Provides fast HNSW-based vector similarity search for JavaScript/TypeScript applications with file persistence support. (Read more)
NodejsJavascriptHnsw - Ollama Embeddings - Local embedding generation through Ollama supporting models like nomic-embed-text and mxbai-embed-large. Enables completely offline embeddings with no subscription fees or API costs, ideal for privacy-focused RAG applications. (Read more)
EmbeddingsLocalPrivacy - RAPIDS cuVS - GPU-accelerated vector search library from NVIDIA providing approximate nearest neighbors and clustering algorithms with up to 12x faster index builds and 4.7x lower search latency through GPU parallelization. (Read more)
Gpu AccelerationNvidiaPerformance
- Cloaked AI - Application-layer encryption solution for securing vector embeddings and enabling searchable/queryable encryption in vector databases, protecting AI data without compromising search functionality. (Read more)
EncryptionSecurityPrivacy
- Privacera AI Governance (PAIG) - Privacera AI Governance (PAIG) is a solution designed to secure and govern AI data, including safeguarding vector databases and embeddings, ensuring data privacy and compliance for AI applications. (Read more)
data governanceSecuritycompliance
- Please give us ⭐ on Github, it helps!
All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship.
This directory may include content generated by artificial intelligence (AI). While efforts have been made to ensure the accuracy and reliability of the information, we make no representations or warranties of any kind, express or implied, about the completeness, accuracy, reliability, suitability, or availability of the information contained herein. Users are advised to independently verify the information before making decisions based on it.
We disclaim any responsibility for errors, omissions, or inaccuracies in the content, whether generated by humans, AI, or any other means. By using this directory, you agree to use it at your own risk and acknowledge that the information provided may not always be current or accurate.
If you believe that your intellectual property rights or other legal rights have been infringed, please contact us immediately at legal@ever.co and we will take appropriate action.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License.
