LlamaIndex
Free tierDocument OCR & AI agents that turn complex documents into structured, LLM-ready outputs in seconds
Free tier available·All audiences·Powered by Multiple (VLM-powered, proprietary agents)·API available·Open source
Key strengths
Industry-leading document parsing across 50+ file types including handwriting, tables, and chartsVLM-powered agentic OCR with auto-correction loops for high accuracy on messy or complex documentsOpen-source LiteParse for fully local, cloud-free document processingEnterprise-grade security with HIPAA, GDPR, and SOC2 compliance plus VPC deploymentEnd-to-end document agent workflows: parse, extract, classify, split, index, and retrieve
Free tier + paid plans
San Francisco, USA
Founded 2022
Self-hostable
No ratings yet
Developer Setup & API Usage
LiteParse (Open Source — Local)
npm install @llamaindex/liteparse
Run local document parsing with no cloud calls, no LLM tokens, and bounding box output for PDFs, Office docs, and images.
LlamaParse API (Cloud)
- Authenticate with your API key from the LlamaIndex dashboard.
- Submit documents via REST API or SDK to the
/parse,/extract,/classify,/split, or/indexendpoints. - Key parameters:
file— the document to process (50+ supported formats)extraction_schema— JSON schema for structured LLM-powered extraction (no model training required)split_description— natural-language rule for logical document segmentationclassify_rules— natural-language document categorization rules
- Results include chunked, embedding-ready text optimized for RAG retrieval pipelines.
- The indexing pipeline handles enterprise-grade chunking and embedding for best-in-class retrieval accuracy.
- Supports VPC deployment for data residency requirements; HIPAA, GDPR, and SOC2 compliant out of the box.
