Skip to content

[A64FX]: add tt for a64fx dot#5542

Merged
martin-frbg merged 1 commit intoOpenMathLib:developfrom
abhishek-iitmadras:abhishek_new_tt_a64fx
Nov 23, 2025
Merged

[A64FX]: add tt for a64fx dot#5542
martin-frbg merged 1 commit intoOpenMathLib:developfrom
abhishek-iitmadras:abhishek_new_tt_a64fx

Conversation

@abhishek-iitmadras
Copy link
Contributor

Performance: We can see upto 2x to 3x perf improvement on sdot and ddot from the range 10000 < size <1000000

Signed-off-by: Abhishek Kumar <abhishek.r.kumar@fujitsu.com>
@martin-frbg martin-frbg added this to the 0.3.31 milestone Nov 23, 2025
@martin-frbg martin-frbg merged commit d6b25c4 into OpenMathLib:develop Nov 23, 2025
97 of 102 checks passed
@abhishek-iitmadras abhishek-iitmadras deleted the abhishek_new_tt_a64fx branch November 24, 2025 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants