Falcon OCR: The 300M-Parameter Model Beating Gemini at Document Processing (2026)
TL;DR
- Falcon OCR is a 300M parameter open-source model from TII (Abu Dhabi), released March 2026
- Scores 80.3% on olmOCR benchmark — edging out Gemini 3 Pro (80.2%) with a fraction of the parameters
- Highest throughput among open-source OCR models — built for speed at scale
- Weaknesses: old scanned documents and very small text
- Free to use via Hugging Face; can run locally or on cloud
A 300 million parameter open-source model just beat Gemini 3 Pro at document processing. Falcon OCR, released by the Technology Innovation Institute (TII) in Abu Dhabi in March 2026, is one of the most surprising benchmark results of the year: a compact, efficient model outscoring commercial giants that cost orders of magnitude more to run.
If you work with invoices, contracts, PDFs, research papers, or any document at scale, Falcon OCR deserves serious attention.
What Is Falcon OCR?
Falcon OCR is a natively multimodal autoregressive Transformer model from TII's Falcon Vision Team. Unlike traditional OCR systems that pipeline separate text-detection and language models, Falcon OCR uses early fusion architecture — it processes images and text together from the first layer, allowing richer contextual understanding of document structure.
Key specs:
- Parameters: 300 million
- Architecture: Early-fusion natively multimodal autoregressive Transformer
- Developer: Technology Innovation Institute (TII), Abu Dhabi
- Release: March 2026
- License: Open source (Hugging Face)
Benchmark Results
| Model | olmOCR Score | OmniDocBench | Params |
|---|---|---|---|
| Falcon OCR (TII) | 80.3% | 88.64 | 300M |
| Gemini 3 Pro | 80.2% | — | Large (closed) |
| PaddleOCR VL 1.5 | 79.3% | — | — |
| DeepSeek OCR v2 | 78.8% | — | — |
| GPT 5.2 | 69.8% | — | Large (closed) |
The headline result: Falcon OCR scores 80.3% on olmOCR while using dramatically fewer parameters than any competing model. On OmniDocBench, which tests full-page document parsing including tables and mixed layouts, it scores 88.64.
Throughput is Falcon OCR's other advantage. TII claims it delivers the highest throughput among open-source OCR models — critical for enterprise document processing pipelines where speed is as important as accuracy.
Where Falcon OCR Excels
- Standard printed documents — invoices, contracts, reports, forms
- Tables and structured data — strong OmniDocBench score reflects this
- High-volume pipelines — highest throughput in the open-source category
- Mixed text/image layouts — early fusion handles these better than pipelined approaches
- Cost-sensitive deployments — 300M params means cheap inference
Where Falcon OCR Struggles
- OldScan (43.5%) — scanned historical documents with degraded quality. For archival work, Mistral OCR 3 or Chandra performs better.
- TinyText (78.5%) — very small printed text (labels, footnotes, fine print). Larger models handle this better.
How to Run Falcon OCR
Falcon OCR is available on Hugging Face under the tiiuae/falcon-ocr repository. You can run it locally with a standard GPU (the 300M parameter size is very manageable) or deploy it via a cloud inference provider.
Basic Python usage:
from transformers import AutoProcessor, AutoModelForVision2Seq
from PIL import Image
processor = AutoProcessor.from_pretrained("tiiuae/falcon-ocr")
model = AutoModelForVision2Seq.from_pretrained("tiiuae/falcon-ocr")
image = Image.open("invoice.png")
inputs = processor(images=image, return_tensors="pt")
outputs = model.generate(**inputs)
text = processor.batch_decode(outputs, skip_special_tokens=True)[0]
print(text)For production deployments, TII recommends batching images to maximize throughput. The model supports batch sizes up to 32 on a single A100 GPU.
Falcon OCR vs Adobe Acrobat AI: When to Use Which
Adobe Acrobat AI and enterprise OCR tools like ABBYY FineReader are still relevant for workflows that need GUI editing, PDF modification, or compliance features. Falcon OCR is the right choice when you need:
- High-volume batch processing (thousands of documents per day)
- Custom integration into your own pipeline
- Cost control — no per-document API pricing
- Data privacy — run on your own infrastructure, no data sent to third parties
For point-and-click document editing, stick with Adobe. For programmatic document extraction at scale, Falcon OCR is now the default open-source choice.
What This Means for the AI Landscape
Falcon OCR is another data point in the "small models catching up" trend. A 300M parameter model matching Gemini 3 Pro on a specialized task signals that domain-specific fine-tuning is increasingly competitive with general-purpose frontier models.
For enterprises, this is good news: you don't need to pay frontier model API costs for every task. Specialized open-source models can handle high-volume document processing cheaper, faster, and with better data privacy than calling a commercial API per document.
Tools like Happycapy let you build document processing workflows without coding — upload batches of PDFs and extract structured data automatically. For developers who want direct model access, Falcon OCR is the current best open-source option.
Frequently Asked Questions
What is Falcon OCR and who made it?
Falcon OCR is a 300 million parameter open-source document processing model from the Technology Innovation Institute (TII) in Abu Dhabi, released March 2026. It uses early fusion architecture to process images and text together.
How does Falcon OCR compare to Gemini on benchmarks?
On olmOCR, Falcon OCR scores 80.3% vs Gemini 3 Pro's 80.2%. It also beats DeepSeek OCR v2 (78.8%) and GPT 5.2 (69.8%) with 300M parameters — a fraction of the size of any competing model.
Is Falcon OCR free to use?
Yes. Falcon OCR is open-source and free. Weights and code are on Hugging Face and can run locally or on cloud infrastructure.
What are the limitations of Falcon OCR?
Weaknesses: old scanned documents (43.5% on OldScan) and very small text (78.5% on TinyText). For archival or fine-print OCR, Mistral OCR 3 or Chandra performs better.
Sources
- TII Falcon Vision Team — Falcon OCR release (March 2026), Hugging Face
- olmOCR Benchmark — olmresearch.org
- OmniDocBench — GitHub/omni-doc-bench
- ScienceDaily: "Compact AI models reach frontier performance" (April 2026)