Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option #41

imciner2 · 2025-08-21T13:22:30Z

The code guarding the use of underscores uses the define BLIS_ENABLE_NO_UNDERSCORE_API, but the CMake config option was actually defining ENABLE_NO_UNDERSCORE_API, so the config option was never propagating into the code. This updates the CMake build system to define the correct thing.

chandrkr · 2025-09-11T14:40:30Z

Hi @imciner2,

Thank you for your interest in contributing to AOCL-BLAS and helping improve the project.
We appreciate the changes you've proposed in this pull request — they are valid and valuable.

However, our CI/CD pipeline has detected a build error on Windows OS related to this PR. We've attached the build log and the steps used during the build process for your reference.

Kindly review the logs and update the same pull request with a new commit to address the issue.
Once again, thank you for your contribution!

build_commands.txt
build_log.txt

* Bug Fixes in FP32 Kernels: - The current implementation lets m=1 tiny cases inside LPGEMV_TINY loop, but the m=1 GEMV kernel call doesn't have the call to GEMV_M_ONE kernels. Added the m=1 path in LPGEMV_TINY loop by handling the pack A/Pack B/reorder B conditions. - Added BF16 support for BIAS, Matrix-Add and Matrix-Mul for AVX512 F32 main and GEMV kernels - Added BF16 Matrix-Add and Matrix-Mul support for AVX512_256 F32 kernels. - Modified the condition check in FP32 Zero point in AVX512 kernels, and fixed few bugs in Col-major Zero point evaluation. AMD Internal: [ CPUPL - 6748 ] * Bug Fixes in FP32 Kernels: - The current implementation lets m=1 tiny cases inside LPGEMV_TINY loop, but doesn't have the call to GEMV_M_ONE kernels. Added the m=1 path in LPGEMV_TINY loop by handling the pack A/Pack B/reorder B conditions. - Added BF16 support for BIAS, Matrix-Add and Matrix-Mul for AVX512 F32 main and GEMV kernels. - Added BF16 Downscale, BIAS, Matrix-Add and Matrix-Mul support in AVX2 GEMV_N and AVX512_256 GEMV kernels. - Added BF16 Matrix-Add and Matrix-Mul support for AVX512_256 F32 kernels. - Modified the condition check in FP32 Zero point in AVX512 kernels, and fixed few bugs in Col-major Zero point evaluation and instruction usage. AMD Internal: [ CPUPL - 6748 ] * Bug Fixes in FP32 Kernels: - The current implementation lets m=1 tiny cases inside LPGEMV_TINY loop, but doesn't have the call to GEMV_M_ONE kernels. Added the m=1 path in LPGEMV_TINY loop by handling the pack A/Pack B/reorder B conditions. - Added BF16 support for BIAS, Matrix-Add and Matrix-Mul for AVX512 F32 main and GEMV kernels. - Added BF16 Downscale, BIAS, Matrix-Add and Matrix-Mul support in AVX2 GEMV_N and AVX512_256 GEMV kernels. - Added BF16 Matrix-Add and Matrix-Mul support for AVX512_256 F32 kernels. - Modified the condition check in FP32 Zero point in AVX512 kernels, and fixed few bugs in Col-major Zero point evaluation and instruction usage. AMD Internal: [ CPUPL - 6748 ] * Bug Fixes in FP32 Kernels: - The current implementation lets m=1 tiny cases inside LPGEMV_TINY loop, but doesn't have the call to GEMV_M_ONE kernels. Added the m=1 path in LPGEMV_TINY loop by handling the pack A/Pack B/reorder B conditions. - Added BF16 support for BIAS, Matrix-Add and Matrix-Mul for AVX512 F32 main and GEMV kernels. - Added BF16 Downscale, BIAS, Matrix-Add and Matrix-Mul support in AVX2 GEMV_N and AVX512_256 GEMV kernels. - Added BF16 Matrix-Add and Matrix-Mul support for AVX512_256 F32 kernels. - Modified the condition check in FP32 Zero point in AVX512 kernels, and fixed few bugs in Col-major Zero point evaluation and instruction usage. AMD Internal: [ CPUPL - 6748 ] --------- Co-authored-by: VarshaV <varshav2@amd.com>

Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option

2d61fb3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option #41

Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option #41

imciner2 commented Aug 21, 2025

Uh oh!

chandrkr commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option #41

Are you sure you want to change the base?

Fix definition from ENABLE_NO_UNDERSCORE_API CMake config option #41

Conversation

imciner2 commented Aug 21, 2025

Uh oh!

chandrkr commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants