LlamaIndex logo

LlamaIndex

Free tier

Document OCR & AI agents that turn complex documents into structured, LLM-ready outputs in seconds

Free tier available·All audiences·Powered by Multiple (VLM-powered, proprietary agents)·API available·Open source

Key strengths

Industry-leading document parsing across 50+ file types including handwriting, tables, and chartsVLM-powered agentic OCR with auto-correction loops for high accuracy on messy or complex documentsOpen-source LiteParse for fully local, cloud-free document processingEnterprise-grade security with HIPAA, GDPR, and SOC2 compliance plus VPC deploymentEnd-to-end document agent workflows: parse, extract, classify, split, index, and retrieve
Free tier + paid plans
San Francisco, USA
Founded 2022
Self-hostable
No ratings yet

LlamaIndex's core product, LlamaParse, leverages Vision Language Models (VLMs) and task-specific agentic pipelines to deliver layout-aware document parsing. It routes document content — text, tables, charts, handwriting — to specialized expert agents, then applies recursive auto-correction loops to maximize accuracy. The platform exposes APIs for parse, extract (schema-based LLM extraction), split, classify, index, and retrieval operations, making it suitable for building full RAG pipelines and multi-step document agents. An open-source sibling, LiteParse (npm install @llamaindex/liteparse), enables fast local parsing with no cloud dependencies or LLM token usage, outputting bounding-box-annotated structured text from PDFs, Office docs, and images.