- cross-posted to:
- technology@lemmy.ml
- cross-posted to:
- technology@lemmy.ml
You must log in or register to comment.
But does it need Flash Attention, Sage Attention, or Scaled dot product Attention?
But does it need Flash Attention, Sage Attention, or Scaled dot product Attention?