Add support for PaddleOCRv5 models for character recognition. by tjanczak · Pull Request #747 · open-edge-platform/dlstreamer

tjanczak · 2026-04-02T15:28:14Z

Description

Add converters for PaddleOCR v5 recognition models; add support to download models from HuggingFace repository + load vocabulary form HF config file.

Fixes # (issue)

Any Newly Introduced Dependencies

N/A.

How Has This Been Tested?

Validated locally; to be added to CI tests as part of new sample.

Checklist:

I agree to use the MIT license for my code changes.
I have not introduced any 3rd party components incompatible with MIT.
I have not included any company confidential information, trade secret, password or security token.
I have performed a self-review of my code.

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp

oonyshch · 2026-04-02T18:53:54Z

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp

+        double exp_sum = 0.0;
+        for (size_t v = 0; v < vocab_size; ++v)
+            exp_sum += std::exp(static_cast<double>(row[v] - row_max));
+        log_conf_sum += std::log(1.0 / exp_sum + 1e-10);


I see this line computes log(1/exp_sum + 1e-10) as a log-softmax approximation. the 1e-10 is added after the division to prevent log(0), but this is numerically odd: standard log-softmax would be log_max - log(exp_sum), not this form.
please check for correctness

this is to avoid divide by zero error, changed to explicit check if not zero

oonyshch · 2026-04-02T18:58:30Z

samples/download_public_models.sh

+
+export_ppocr_v5_model() {
+  local MODEL_NAME=$1
+  MODEL_DIR="$MODELS_PATH/public/$MODEL_NAME"


MODEL_DIR should be declared as local
the other variables in this function (MODEL_NAME, DST_FILE1, DST_FILE2) all use local, but MODEL_DIR is in the parent scope

oonyshch · 2026-04-02T18:59:25Z

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp

+
+            // Output shape: [batch_size, seq_len, vocab_size]
+            const auto &dims = blob->GetDims();
+            const size_t vocab_size = (dims.size() == 3) ? dims[2] : 0;


these lines could set vocab_size and seq_len to 0 if the tensor rank is wrong, but the subsequent check at line 224 produces a misleading error message "Unexpected vocabulary size".
It would be clearer to add an explicit tensor rank check, that would give a clear error when the model output shape is wrong, rather than defaulting and failing later

oonyshch

So far everything that I've not mentioned in comments LGTM

Add support for PaddleOCRv5 models for character recognition.

5395c3d

tjanczak requested review from BaoHuiling, OskarFiedot, ZiningLi, dmichalo, jmotow, marcin-wadolkowski, mholowni, msmiatac, nszczygl9, oonyshch, pbartosik, qianlongding, tbujewsk, walidbarakat, yangjianfeng1208 and yunowo as code owners April 2, 2026 15:28

jmotow approved these changes Apr 2, 2026

View reviewed changes

oonyshch reviewed Apr 2, 2026

View reviewed changes

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp Show resolved Hide resolved

oonyshch reviewed Apr 2, 2026

View reviewed changes

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp Show resolved Hide resolved

oonyshch reviewed Apr 2, 2026

View reviewed changes

src/monolithic/gst/inference_elements/common/post_processor/converters/to_tensor/paddle_ocr.cpp Show resolved Hide resolved

oonyshch reviewed Apr 2, 2026

View reviewed changes

Merge branch 'main' into paddle_ocr_v5

f0d4424

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for PaddleOCRv5 models for character recognition.#747

Add support for PaddleOCRv5 models for character recognition.#747
tjanczak wants to merge 2 commits intomainfrom
paddle_ocr_v5

tjanczak commented Apr 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oonyshch Apr 2, 2026

Uh oh!

tjanczak Apr 3, 2026

Uh oh!

oonyshch Apr 2, 2026 •

edited

Loading

Uh oh!

oonyshch Apr 2, 2026 •

edited

Loading

Uh oh!

oonyshch left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tjanczak commented Apr 2, 2026

Description

Any Newly Introduced Dependencies

How Has This Been Tested?

Checklist:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

oonyshch Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

tjanczak Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

oonyshch Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oonyshch Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oonyshch left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

oonyshch Apr 2, 2026 •

edited

Loading

oonyshch Apr 2, 2026 •

edited

Loading