feat: Update Tesseract installation to include all language packs#183
feat: Update Tesseract installation to include all language packs#183athoik wants to merge 1 commit intodocling-project:mainfrom
Conversation
Description:
Replaced specific installation of tesseract-langpack-eng with tesseract-langpack-\* to ensure support for all available OCR languages.
This improves flexibility for multilingual OCR processing without requiring manual additions for each language.
Changes:
Updated os-packages.txt: replaced tesseract-langpack-eng with tesseract-langpack-\*
Ensures all Tesseract language packs are installed via wildcard in dnf
Note:
Wildcard is escaped (\*) to prevent shell expansion and allow dnf to interpret it correctly.
Signed-off-by: Athanasios Oikonomou <athoik@gmail.com>
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
|
@athoik do you know what is the added size for the container image? |
All the OCR languages will add an extra 685MB If that's a problem, I can add only Greek language I am interesting to perform OCR. |
|
@dolfim-ibm could be specific to 2 cpu images? |
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
Description:
Replaced specific installation of tesseract-langpack-eng with tesseract-langpack-* to ensure support for all available OCR languages. This improves flexibility for multilingual OCR processing without requiring manual additions for each language.
Changes:
Updated os-packages.txt: replaced tesseract-langpack-eng with tesseract-langpack-*
Ensures all Tesseract language packs are installed via wildcard in dnf
Note:
Wildcard is escaped (*) to prevent shell expansion and allow dnf to interpret it correctly.