Chroma
Free tierOpen-source search infrastructure for AI — vector, full-text, regex, and metadata search at scale
Free tier available·All audiences·API available·Open source
Key strengths
Multi-modal search: vector, full-text (BM25/SPLADE), regex, and metadata in one systemBuilt on object storage (S3/GCS) — up to 10x cheaper than memory-based alternativesServerless and zero-ops — auto-scales with no manual tuning requiredApache 2.0 open-source with 27k GitHub stars and 15M+ monthly downloadsEnterprise-grade: SOC 2 Type II, BYOC, multi-region replication, customer-managed encryption keys
Free tier + paid plans
Self-hostable
No ratings yet
- RAG (Retrieval-Augmented Generation) pipelines: Use Chroma as the retrieval backend, combining dense vector KNN with sparse BM25 re-ranking for high-recall, high-precision document retrieval
- Multi-tenant SaaS search: Leverage Chroma's 1M collections-per-database architecture to provide isolated, scalable search indexes per customer
- Hybrid search systems: Combine vector, sparse (SPLADE/BM25), full-text (trigram/regex), and metadata filters in a single query for complex retrieval strategies
- Embedding evaluation & iteration: Use collection forking (copy-on-write) to A/B test embedding models or chunking strategies without data duplication overhead
- MCP-based agent tooling: Integrate Chroma's Package Search MCP to allow agents to query thousands of open-source repositories programmatically
- Low-latency production search: Deploy on Chroma Cloud or BYOC enterprise with p50 20ms warm query latency, auto-scaling, and 0-ops management
