LlamaIndex logo

LlamaIndex

Free tier

Document OCR & AI agents that turn complex documents into structured, LLM-ready outputs in seconds

Free tier available·All audiences·Powered by Multiple (VLM-powered, proprietary agents)·API available·Open source

Key strengths

Industry-leading document parsing across 50+ file types including handwriting, tables, and chartsVLM-powered agentic OCR with auto-correction loops for high accuracy on messy or complex documentsOpen-source LiteParse for fully local, cloud-free document processingEnterprise-grade security with HIPAA, GDPR, and SOC2 compliance plus VPC deploymentEnd-to-end document agent workflows: parse, extract, classify, split, index, and retrieve
Free tier + paid plans
San Francisco, USA
Founded 2022
Self-hostable
No ratings yet

Developer Setup & API Usage

LiteParse (Open Source — Local)

npm install @llamaindex/liteparse

Run local document parsing with no cloud calls, no LLM tokens, and bounding box output for PDFs, Office docs, and images.

LlamaParse API (Cloud)

  • Authenticate with your API key from the LlamaIndex dashboard.
  • Submit documents via REST API or SDK to the /parse, /extract, /classify, /split, or /index endpoints.
  • Key parameters:
    • file — the document to process (50+ supported formats)
    • extraction_schema — JSON schema for structured LLM-powered extraction (no model training required)
    • split_description — natural-language rule for logical document segmentation
    • classify_rules — natural-language document categorization rules
  • Results include chunked, embedding-ready text optimized for RAG retrieval pipelines.
  • The indexing pipeline handles enterprise-grade chunking and embedding for best-in-class retrieval accuracy.
  • Supports VPC deployment for data residency requirements; HIPAA, GDPR, and SOC2 compliant out of the box.