Fix cache offset casting with low precision policies #299

jonatanklosko · 2023-12-08T16:28:23Z

Currently running bf16 llama with long left-padded sequences produces wrong results. This is because we have an integer cache.offset and casting to bf16 causes overflows for large values.

Axon used to cast integers according to model policy, but that's no longer the case as of elixir-nx/axon#547, so we can keep offset as an integer and it's all good.

Fix cache offset casting with low precision policies

f37e6aa

jonatanklosko merged commit e726e26 into main Dec 8, 2023
2 checks passed

jonatanklosko deleted the jk-cache-types branch December 8, 2023 16:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix cache offset casting with low precision policies #299

Fix cache offset casting with low precision policies #299

jonatanklosko commented Dec 8, 2023 •

edited

Loading

Fix cache offset casting with low precision policies #299

Fix cache offset casting with low precision policies #299

Conversation

jonatanklosko commented Dec 8, 2023 • edited Loading

jonatanklosko commented Dec 8, 2023 •

edited

Loading