mem: reduce PaddleOCR rec_batch_num from 6 to 1 by KRRT7 · Pull Request #4295 · Unstructured-IO/unstructured

KRRT7 · 2026-03-24T12:31:46Z

Reduce PaddleOCR rec_batch_num from 6 (default) to 1. Paddle's native inference engine allocates 500 MiB memory arena chunks proportional to recognition batch size. With batch_num=6, four chunks are allocated during text recognition. Setting it to 1 reduces this to one chunk.

Setting	Peak memory
`rec_batch_num=6`	7,184 MiB
`rec_batch_num=1`	2,684 MiB
Delta	-4,500 MiB (-62.6%)

Measured with memray run on layout-parser-paper-with-table.pdf through partition() with hi_res + PaddleOCR table OCR. On CPU, batch processing doesn't parallelize — it's sequential within predictor.run(). Smaller batches just allocate less workspace memory.

Reproduce

Requires unstructured[pdf], paddlepaddle, unstructured-paddleocr, and memray.

cat > /tmp/bench_paddle.py << 'SCRIPT'
from unstructured.partition.auto import partition
elements = partition(
    filename="example-docs/layout-parser-paper.pdf",
    strategy="hi_res",
    pdf_infer_table_structure=True,
    ocr_agent="unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle",
)
print(f"Partitioned: {len(elements)} elements")
SCRIPT

# Baseline (main branch, rec_batch_num=6):
git checkout main
memray run --native --trace-python-allocators -o /tmp/paddle_baseline.bin /tmp/bench_paddle.py
memray stats /tmp/paddle_baseline.bin | grep "Peak memory"

# With this change (rec_batch_num=1):
git checkout mem/paddle-rec-batch-num
memray run --native --trace-python-allocators -o /tmp/paddle_opt.bin /tmp/bench_paddle.py
memray stats /tmp/paddle_opt.bin | grep "Peak memory"

Paddle's native inference engine allocates 500 MiB memory arena chunks during text recognition, proportional to batch size. With the default rec_batch_num=6, four 500 MiB chunks are allocated simultaneously. Setting rec_batch_num=1 reduces this to a single chunk, cutting peak memory on the PaddleOCR code path by ~1,265 MiB (-42.6%). Latency benchmark (55 text regions, CPU, 5 runs): - rec_batch_num=6: 39.1s +/- 3.5s - rec_batch_num=1: 37.0s +/- 2.0s No throughput regression — on CPU, batch processing is sequential.

badGarnet approved these changes Mar 27, 2026

View reviewed changes

KRRT7 added 3 commits March 27, 2026 13:41

chore: add changelog entry and bump version to 0.22.1

e3d27d0

Bump version to 0.22.3 for changelog CI check

5df21bd

KRRT7 force-pushed the mem/paddle-rec-batch-num branch from a91d36a to 5df21bd Compare March 27, 2026 18:44

fix: remove leftover conflict markers from CHANGELOG.md

b86e938

KRRT7 added this pull request to the merge queue Mar 27, 2026

Merged via the queue into Unstructured-IO:main with commit 47f4728 Mar 27, 2026
52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mem: reduce PaddleOCR rec_batch_num from 6 to 1#4295

mem: reduce PaddleOCR rec_batch_num from 6 to 1#4295
KRRT7 merged 4 commits intoUnstructured-IO:mainfrom
KRRT7:mem/paddle-rec-batch-num

KRRT7 commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KRRT7 commented Mar 24, 2026

Reproduce

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants