Skip to content

Differentiate between conversion whre document_timeout is reached versus those for which individual pages failed #3205

@simonschoe

Description

@simonschoe

Bug

Previously, ConversionStatus.PARTIAL_SUCCESS was only emitted when document_timeout is reached. Since #2939 was merged, ConversionStatus.PARTIAL_SUCCESS is also assigned when individual pages fail.

As a result, a downstream consumer has no way, based on the conversion results, to differentiate between a document for which only individual pages failed (which might be okay) versus a document for which the timeout was reached and where the last X pages are missing (which might be handled differently).

Docling version

docling                                      2.82.0
docling-core                                 2.70.2
docling-ibm-models                           3.13.0
docling-parse                                5.6.1

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions