Support fake-quantized linear with fp32 bias by jinevening · Pull Request #154 · Samsung/TICO

jinevening · 2025-06-17T07:20:18Z

This supports fake-quantized linear with fp32 bias.

TICO-DCO-1.0-Signed-off-by: Hyukjin Jeong hj1.jeong@samsung.com

Related to: #149, #165

jinevening · 2025-06-17T07:21:03Z

test/modules/op/linear.py

        return (torch.randn(3, 3), None)
+
+
+@test_without_inference


I added this tag. A test with this tag stops before inference.

jinevening · 2025-06-17T07:25:35Z

tico/experimental/quantization/passes/quantize_bias.py

+
+            # TODO Support more ops.
+
+        graph.eliminate_dead_code()


TODO We need to eliminate lifted constant tensor placeholder.

This supports fake-quantized linear with fp32 bias. TICO-DCO-1.0-Signed-off-by: Hyukjin Jeong <hj1.jeong@samsung.com>

dayo09 · 2025-06-17T08:50:15Z

tico/experimental/quantization/passes/quantize_bias.py

+                if bias_val.dtype != torch.float32:
+                    continue


I just wonder if this bias quantization is not required for torch.float64 case?.

I've not seen fp64 bias so far. Most quantization frameworks use fp32 by default and partially support fp16. It would be possible to support another type when necessary.

mhs4670go

dayo09

LGTM

jinevening commented Jun 17, 2025

View reviewed changes

Support fake-quantized linear with fp32 bias

d9c147c

This supports fake-quantized linear with fp32 bias. TICO-DCO-1.0-Signed-off-by: Hyukjin Jeong <hj1.jeong@samsung.com>

jinevening force-pushed the fq_linear_fp32_bias branch from e87b415 to d9c147c Compare June 17, 2025 07:27

dayo09 reviewed Jun 17, 2025

View reviewed changes

mhs4670go approved these changes Jun 18, 2025

View reviewed changes

dayo09 approved these changes Jun 18, 2025

View reviewed changes

mhs4670go merged commit d06871a into Samsung:main Jun 18, 2025
5 checks passed

jinevening deleted the fq_linear_fp32_bias branch June 18, 2025 02:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support fake-quantized linear with fp32 bias#154

Support fake-quantized linear with fp32 bias#154
mhs4670go merged 1 commit intoSamsung:mainfrom
jinevening:fq_linear_fp32_bias

jinevening commented Jun 17, 2025 •

edited

Loading

Uh oh!

jinevening Jun 17, 2025

Uh oh!

jinevening Jun 17, 2025

Uh oh!

dayo09 Jun 17, 2025 •

edited

Loading

Uh oh!

jinevening Jun 17, 2025

Uh oh!

mhs4670go left a comment

Uh oh!

dayo09 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jinevening commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jinevening Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

jinevening Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

dayo09 Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jinevening Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

mhs4670go left a comment

Choose a reason for hiding this comment

Uh oh!

dayo09 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jinevening commented Jun 17, 2025 •

edited

Loading

dayo09 Jun 17, 2025 •

edited

Loading