Speed up --verify-numerics sample generation for half-precision convs by rkayaith · Pull Request #1314 · iree-org/iree-turbine

rkayaith · 2026-02-27T00:58:39Z

torch.randn on CPU is much slower for half-precision dtypes than on GPU. For a large conv config (convfp16 -n 32 -c 256 -H 100 -W 100 -k 2376 -y 3 -x 3 -p 1 -q 1 -u 1 -v 1 -l 1 -j 1 --in_layout NHWC --fil_layout NHWC --out_layout NHWC -m conv -g 1 -F 4 -t 1), sample generation was taking 18.3s of 25.1s total verification time.

Generate sample data on GPU and transfer to CPU for the reference computation instead of the other way around. Total runtime dropped from 45.9s to 16.2s with --verify-numerics (8.3s without) on a 96-core EPYC 9454.

torch.randn on CPU is much slower for half-precision dtypes than on GPU. Generate sample data on GPU and transfer to CPU for the reference computation instead of the other way around. Tested on convfp16 -n 32 -c 256 -H 100 -W 100 -k 2376 -y 3 -x 3 -F 4 (NHWC, weight backward): total runtime dropped from 45.9s to 16.2s with --verify-numerics (8.3s without). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

rkayaith mentioned this pull request Feb 27, 2026

--verify-numerics is slow for large conv configs nod-ai/amd-shark-ai#2841

Open

rkayaith marked this pull request as ready for review February 27, 2026 01:03

rkayaith requested a review from zjgarvey as a code owner February 27, 2026 01:03

zjgarvey approved these changes Feb 27, 2026

View reviewed changes

rkayaith merged commit b3ddea4 into iree-org:main Feb 27, 2026
8 checks passed

rkayaith deleted the slow-verify-numerics branch February 27, 2026 02:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up --verify-numerics sample generation for half-precision convs#1314

Speed up --verify-numerics sample generation for half-precision convs#1314
rkayaith merged 1 commit intoiree-org:mainfrom
rkayaith:slow-verify-numerics

rkayaith commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rkayaith commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants