Skip to content

Power10: Replace vector pair loads with __builtin_vsx_lxvp#5827

Merged
martin-frbg merged 1 commit into
OpenMathLib:developfrom
amritahs-ibm:use_lxvp_builtins
Jun 9, 2026
Merged

Power10: Replace vector pair loads with __builtin_vsx_lxvp#5827
martin-frbg merged 1 commit into
OpenMathLib:developfrom
amritahs-ibm:use_lxvp_builtins

Conversation

@amritahs-ibm

Copy link
Copy Markdown
Contributor

Replace normal vector pair pointer dereferences with the optimized __builtin_vsx_lxvp builtin across DGEMM, ZGEMM, and DGEMV kernels.

Also done some identation corrections in dgemm_kernel_power10.c. This is done as part of POWER code cleanup and may not have any performance impact.

Replace normal vector pair pointer dereferences with the optimized
__builtin_vsx_lxvp builtin across DGEMM, ZGEMM, and DGEMV kernels.

Also done some identation corrections in dgemm_kernel_power10.c.
This is done as part of POWER code cleanup and may not have any
performance impact.

Signed-off-by: Amrita H S <amritahs@linux.vnet.ibm.com>
@amritahs-ibm amritahs-ibm changed the title Power10: Replace vector pair loads with __builtin_vsx_lxvp DRAFT PR: Power10: Replace vector pair loads with __builtin_vsx_lxvp Jun 4, 2026
@amritahs-ibm amritahs-ibm changed the title DRAFT PR: Power10: Replace vector pair loads with __builtin_vsx_lxvp Power10: Replace vector pair loads with __builtin_vsx_lxvp Jun 9, 2026
@amritahs-ibm

Copy link
Copy Markdown
Contributor Author

I have done functional and performance verification on a Power10 machine. There are no unit testcase failures and there is no performance regression.

@martin-frbg martin-frbg added this to the 0.3.34 milestone Jun 9, 2026
@martin-frbg

Copy link
Copy Markdown
Collaborator

Thank you

@martin-frbg martin-frbg merged commit 901c214 into OpenMathLib:develop Jun 9, 2026
98 of 100 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants