Skip to content

Add architecture during device linking#554

Open
rzambre wants to merge 1 commit intodeepseek-ai:mainfrom
rzambre:rzambre/fix-dlinking
Open

Add architecture during device linking#554
rzambre wants to merge 1 commit intodeepseek-ai:mainfrom
rzambre:rzambre/fix-dlinking

Conversation

@rzambre
Copy link

@rzambre rzambre commented Jan 3, 2026

Without this fix, I see the following warnings when compiling DeepEP for a specific architecture, which leads to runtime errors with test_low_latency.py

nvlink warning : SM Arch ('sm_75') not found in 'DeepEP/build/temp.linux-x86_64-cpython-312/csrc/kernels/internode.o'
nvlink warning : SM Arch ('sm_75') not found in 'DeepEP/build/temp.linux-x86_64-cpython-312/csrc/kernels/internode_ll.o'
nvlink warning : SM Arch ('sm_75') not found in 'DeepEP/build/temp.linux-x86_64-cpython-312/csrc/kernels/intranode.o'
nvlink warning : SM Arch ('sm_75') not found in 'DeepEP/build/temp.linux-x86_64-cpython-312/csrc/kernels/layout.o'
nvlink warning : SM Arch ('sm_75') not found in 'DeepEP/build/temp.linux-x86_64-cpython-312/csrc/kernels/runtime.o'

Before this PR:

> cuobjdump -lelf DeepEP/build/lib.linux-x86_64-cpython-312/deep_ep_cpp.cpython-312-x86_64-linux-gnu.so 2>&1 | grep "sm_"
ELF file    1: deep_ep_cpp.cpython-312-x86_64-linux-gnu.1.sm_75.cubin

After this PR:

> cuobjdump -lelf DeepEP/build/lib.linux-x86_64-cpython-312/deep_ep_cpp.cpython-312-x86_64-linux-gnu.so 2>&1 | grep "sm_"
ELF file    1: deep_ep_cpp.cpython-312-x86_64-linux-gnu.1.sm_103.cubin

Signed-off-by: Rohit Zambre <rzambre@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant