Dark | Light
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]

![rohanpaul_ai Avatar](https://lunarcrush.com/gi/w:24/cr:twitter::2588345408.png) Rohan Paul [@rohanpaul_ai](/creator/twitter/rohanpaul_ai) on x 73.6K followers
Created: 2025-07-01 14:37:08 UTC

PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images.

đź’ˇTesting the new open-source OCRFlux model, and here the results are really good for a change.

So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text.

Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface .

The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file.

It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging.

OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages.

The evaluation shows Edit Distance Similarity (EDS) XXXXX and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware.

How it works while parsing each page

- Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets
- Support for complicated tables and equations
- Automatically removes headers and footers

Cross-page table/paragraph merging

- Cross-page table merging
- Cross-page paragraph merging

A compact vision‑language models can beat bigger models once cross‑page context is added.

🧵 1/n Read on 👇

![](https://pbs.twimg.com/amplify_video_thumb/1940052456496316416/img/p8FLdsmqqsn5kp_3.jpg)

XXXXXXX engagements

![Engagements Line Chart](https://lunarcrush.com/gi/w:600/p:tweet::1940057084021940543/c:line.svg)

**Related Topics**
[llm](/topic/llm)

[Post Link](https://x.com/rohanpaul_ai/status/1940057084021940543)

[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]

rohanpaul_ai Avatar Rohan Paul @rohanpaul_ai on x 73.6K followers Created: 2025-07-01 14:37:08 UTC

PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images.

đź’ˇTesting the new open-source OCRFlux model, and here the results are really good for a change.

So OCRFlux is a multimodal, LLM based toolkit for converting PDFs and images into clean, readable, plain Markdown text.

Because the underlying VLM is only 3B param, it runs even on a 3090 GPU. The model is available on @huggingface .

The engine that powers the OCRFlux, teaches the model to rebuild every page and then stitch fragments across pages into one clean Markdown file.

It bundles one vision language model with 3B parameters that was fine-tuned from Qwen 2.5-VL-3B-Instruct for both page parsing and cross-page merging.

OCRFlux reads raw page images and, guided by task prompts, outputs Markdown for each page and merges split elements across pages.

The evaluation shows Edit Distance Similarity (EDS) XXXXX and cross‑page table Tree Edit Distance 0.950, so the parser is both accurate and layout aware.

How it works while parsing each page

  • Convert into text with a natural reading order, even in the presence of multi-column layouts, figures, and insets
  • Support for complicated tables and equations
  • Automatically removes headers and footers

Cross-page table/paragraph merging

  • Cross-page table merging
  • Cross-page paragraph merging

A compact vision‑language models can beat bigger models once cross‑page context is added.

🧵 1/n Read on 👇

XXXXXXX engagements

Engagements Line Chart

Related Topics llm

Post Link

post/tweet::1940057084021940543
/post/tweet::1940057084021940543