Pipeline API¶

Main pipeline class for document processing.

Overview¶

The Pipeline class orchestrates the entire 8-stage document processing workflow.

from pathlib import Path
from pipeline import Pipeline

# Create pipeline
pipeline = Pipeline(
    detector_name="doclayout-yolo",
    sorter_name="mineru-xycut",
    backend="gemini",
    model="gemini-2.5-flash"
)

# Process PDF
result = pipeline.process_single_pdf(Path("document.pdf"))

Key Methods¶

`init`¶

Initialize the pipeline with detector, sorter, and recognizer configurations.

`process_single_pdf`¶

Process a single PDF file and return results.

`process_directory`¶

Batch process all PDFs in a directory.

`process_page`¶

Process a single page (used internally).

Full API Reference

Detailed API reference coming soon. See Advanced Examples for usage patterns.

Pipeline API¶

Overview¶

Key Methods¶

__init__¶

process_single_pdf¶

process_directory¶

process_page¶

`init`¶

`process_single_pdf`¶

`process_directory`¶

`process_page`¶