Launching “Will It Extract?” – Agentic Document Extraction on Messy Docs

LandingAI

Making computer vision easy and accessible for everyone

Published Sep 9, 2025

Welcome to the latest edition of the Landing AI newsletter! 👋

In the world of document understanding, the smell test can be more honest than a suite of metrics. Such a suite of metrics or benchmarks built for someone else’s dataset and success criteria will likely be inconsequential for you. In fact, there isn’t a single, widely adopted benchmark that covers the full spread of IDP work. So, what can you do then?

Eyeball your hardest, most diverse documents instead of trusting task-specific leaderboards like FUNSD (forms), SROIE (receipts), DocVQA (question answering), or DocILE (key-info localization and line items).

💡That’s why we’re kicking off a new series: Will It Extract? Does the name ring a bell?

Article content — The famous Blendtec's Will It Blend series with Tom Dickson

Think of the internet classic “Will It Blend?” from Blendtec—but for documents. The idea is simple. Bring in documents with all those messy layouts, complex structures, odd formats and run them through ADE. And get to know the answer for the billion dollar question - Will it extract? That’s it. No heavy benchmarking, no perfect lab conditions. Just real documents meeting real tech. Let’s dive in.

The complex document of the Day

Quite complex even for a human reader, isn’t it? Not uncommon, either. You see formats like this all the time in insurance.

Let’s see how our favorite LLMs handle this doc:

Grok Auto

Didn't feel like spending 300$ 😝 but I couldn't stop myself from trying a $30 subscription to get Grok 4 Expert model. Now let’s see the output when we make it think hard.

Grok 4

It still misses box 9 🙁

Now, ChatGPT’s turn.

GPT 5

This is hilarious? Makes me believe that VLMs are blind lol. My deep dive with ChatGPT if you are interested to dig into why it stumbles and how it turns around when we feed it the JSON straight from the ADE: https://chatgpt.com/share/68a7348b-da50-8012-99df-ce2054b9d9bf

How about making GPT-5 think harder as well!

GPT-5 Thinking

Again, chat link if you are interested: https://chatgpt.com/share/68a734ce-2b34-8012-b1d1-6aae8e48faee

It took a while, but I’m glad ChatGPT eventually figured it out. It was funny though, to wait and see it "Thinking" so hard:

So how about we use ADE now?

Agentic Document Extraction

ADE parses the same document in just a few seconds (typically under 8).

Let me show you both ways: first using the playground, and then with our Python library via the quick start script:

Code snippet for your reference:

from agentic_doc.parse import parse
import json

result = parse("accident_insurance.png")

# Save markdown (works as-is because it's a str)
print("Extracted Markdown:")
print(result[0].markdown)

# 🔧 Convert chunks to JSON-serializable dicts, then dump
chunks_jsonable = [
    (c.model_dump(mode="json") if hasattr(c, "model_dump")
     else c.dict() if hasattr(c, "dict")
     else c.__dict__)
    for c in result[0].chunks
]

with open("extracted_chunks.json", "w", encoding="utf-8") as f:
    json.dump(chunks_jsonable, f, indent=2, ensure_ascii=False)

Conclusion

In conclusion, I'll repeat the same mantra and will encourage you to throw your most complex documents at ADE.

Eyeball your hardest, most diverse documents instead of trusting task-specific leaderboards.

Next Steps

Join our ADE Discord community and participate in weekly AMAs
Reach out to me directly if you have any question

Read full article here.

Join Our Weekly Webinar Series on ADE! 🚀

We’re launching a weekly webinar series on Agentic Document Extraction (ADE) — live, developer-focused sessions. See how ADE is transforming the way enterprises process complex documents.

What you’ll discover:

What makes ADE different from OCR and LLM-only approaches
How developers can process hundreds to thousands of pages per minute
Real-world case study: ADE in action powering efficiency and productivity
Best practices to embed ADE into your workflows via APIs

No manual templates. No endless fine-tuning. Just agentic, layout-aware, high-accuracy extraction.

Next session:

Date: Wednesday, September 10, 2025
Time: 9:00AM PT

👉 Register here: https://us02web.zoom.us/webinar/register/WN_nuxftvPcQ6-guAKF6ivXhg#/registration

LinkedIn respects your privacy

Launching “Will It Extract?” – Agentic Document Extraction on Messy Docs

LandingAI

Making computer vision easy and accessible for everyone

The complex document of the Day

Grok Auto

Grok 4

GPT 5

GPT-5 Thinking

Agentic Document Extraction

Conclusion

Next Steps

Join Our Weekly Webinar Series on ADE! 🚀

Visual AI Spotlight

41,994 followers

More articles by LandingAI

Explore content categories

The complex document of the Day

Grok Auto

Grok 4

GPT 5

GPT-5 Thinking

Agentic Document Extraction

Conclusion

Next Steps

Join Our Weekly Webinar Series on ADE! 🚀

Visual AI Spotlight

41,994 followers

More articles by LandingAI

Introducing Parse Jobs API for ADE: The Heavy-Duty API for Large Files

ADE DPT-2: A Major Step Forward in Document Intelligence

Eolas Medical Enhances Clinical Knowledge Access with Agentic Document Extraction

Auto-Fill Job Applications with Agentic Document Extraction

From CCTV to Insights — How Snowflake Customers Can Build Visual AI

Performance Benchmark: LandingLens Vision Model Improves Retinopathy Classification

How XBuild’s AI-Powered Visual Inspection Cuts Construction Bid Time in Half

“What’s in Your Fridge?” – Build a Practical Computer Vision Application with VisionAgent

Unlock Text Recognition: A Guide to LandingAI’s OCR Model on Docker

6 Visual AI Use Cases for Utilities from Easy to Advanced

Explore content categories