Isn’t __shfl_down not recommended these days because of warp synchronization iss... | Hacker News

Hacker Timesnew | past | comments | ask | show | jobs | submit

saagarjha on Dec 15, 2024 | parent | context | favorite | on: Fast LLM Inference From Scratch (using CUDA)

Isn’t __shfl_down not recommended these days because of warp synchronization issues?

reasonableklout on Dec 16, 2024 [–]

Oops, you're right and it's a difference between my blog post and source code. It should be __shfl_down_sync as seen [here](https://github.com/andrewkchan/yalm/blob/8c908f23f5d8cc3f14c...)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact