dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAttSlice(unsigned int, unsigned int, unsigned int, slice_index_t): Assertion `nHeads % nSlices == 0' failed. #98

EntusiastaIApy · 2024-07-07T02:17:40Z

I'm trying to run model nkpz/llama2-22b-chat-wizard-uncensored on a cluster composed of 1 Raspberry Pi 4B 8 Gb and 7 Raspberry Pi 4B 4 Gb, but, both on inference and chat modes, distributed llama throws the following error. Do you know why this is happening and how to fix it?

b4rtaz · 2024-07-10T22:32:29Z

Hello @EntusiastaIApy,

I think the problem is that: "num_attention_heads": 52 The current implementation expects that this number can be divided by the number of nodes without remainder.

52 / 8 => 6 remainder 4

This is basically a bug.

Different-Pranav · 2024-09-13T14:35:07Z

I am facing a similar kind of issue. I am trying to run TinyLlama in the dllama environment. I am using 2 worker nodes of 8 GB ram each but it throws a similar kind of error.

b4rtaz · 2024-09-13T18:55:48Z

@Different-Pranav you are using 3 nodes (root + 2 workers). You should try with 2 nodes (1 root + 1 worker) or 4 nodes (1 root + 3 workers).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAttSlice(unsigned int, unsigned int, unsigned int, slice_index_t): Assertion `nHeads % nSlices == 0' failed. #98

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAttSlice(unsigned int, unsigned int, unsigned int, slice_index_t): Assertion `nHeads % nSlices == 0' failed. #98

EntusiastaIApy commented Jul 7, 2024

b4rtaz commented Jul 10, 2024 •

edited

Loading

Different-Pranav commented Sep 13, 2024

b4rtaz commented Sep 13, 2024

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAttSlice(unsigned int, unsigned int, unsigned int, slice_index_t): Assertion `nHeads % nSlices == 0' failed. #98

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAttSlice(unsigned int, unsigned int, unsigned int, slice_index_t): Assertion `nHeads % nSlices == 0' failed. #98

Comments

EntusiastaIApy commented Jul 7, 2024

b4rtaz commented Jul 10, 2024 • edited Loading

Different-Pranav commented Sep 13, 2024

b4rtaz commented Sep 13, 2024

b4rtaz commented Jul 10, 2024 •

edited

Loading