[WIP] Add gfx1100 support to AMD pytorch build #2642

cazlo · 2024-10-13T06:11:46Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Reference

From https://docs.vllm.ai/en/latest/getting_started/amd-installation.html: BUILD_FA: specifies whether to build CK flash-attention. The default is 1. For Radeon RX 7900 series (gfx1100), this should be set to 0 before flash-attention supports this target. Might have relevance here

see huggingface#2641

mht-sharma · 2024-10-16T09:47:15Z

Hi @cazlo, thanks for the PR. FYI, you would also need to modify the: vllm and flash-attention CK

I have not looked into the support for these GPUs in composable kernel and VLLM yet. But let me know if you face any issues.

lhl · 2024-12-10T20:10:35Z

CK (and hence ROCm/flash-attention) does not support gfx1100 so it's best to just be disabled and set ROCM_USE_FLASH_ATTN_V2_TRITON=1? Note quite sure what vllm is used for, but to use Triton FA, you need to set VLLM_USE_TRITON_FLASH_ATTN=1. Also for PyTorch you may need TORCH_ROCM_AOTRITON_ENABLE_EXPERIMENTAL=1.

If you're looking to successfully build vLLM for ROCm, see: https://github.com/vllm-project/vllm/blob/main/Dockerfile.rocm

add gfx1100 support to AMD pytorch build

af54650

see huggingface#2641

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add gfx1100 support to AMD pytorch build #2642

[WIP] Add gfx1100 support to AMD pytorch build #2642

cazlo commented Oct 13, 2024 •

edited

Loading

mht-sharma commented Oct 16, 2024

lhl commented Dec 10, 2024

[WIP] Add gfx1100 support to AMD pytorch build #2642

Are you sure you want to change the base?

[WIP] Add gfx1100 support to AMD pytorch build #2642

Conversation

cazlo commented Oct 13, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

Reference

mht-sharma commented Oct 16, 2024

lhl commented Dec 10, 2024

cazlo commented Oct 13, 2024 •

edited

Loading