there is error pop up when switch model in "answer" tab #46

MYaoBQ · 2024-08-26T05:30:08Z

Describe the bug

Error pop up when switching model in "answer" tab

To Reproduce

Steps to reproduce the behavior:

Go to 'Answer'
switch the model on left down corner ( i tried "Llama-2-7b-chat-hf").
Input Prompt, Click on ''Generate"
See error

Screenshots

##Log captured below

load llm model Llama-2-7b-chat-hf finish. cost 15.6s

No chat template is defined for this tokenizer - using the default template for the LlamaTokenizerFast class. If the default is not appropriate for your model, please set tokenizer.chat_template to an appropriate template. See https://huggingface.co/docs/transformers/main/chat_templating for more information.

{'input_ids': tensor([[ 1, 1, 29961, 25580, 29962, 3532, 14816, 29903, 6778, 13,
3492, 526, 263, 8444, 13436, 20255, 29889, 3529, 3867, 9109,
29892, 11314, 936, 322, 16232, 2472, 304, 278, 1404, 29889,
3529, 3013, 278, 1962, 1426, 4086, 278, 1021, 408, 278,
1404, 1881, 29889, 13, 29966, 829, 14816, 29903, 6778, 13,
13, 5816, 338, 319, 29902, 29973, 518, 29914, 25580, 29962]],
device='xpu:0'), 'attention_mask': tensor([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]], device='xpu:0'), 'streamer': <transformers.generation.streamers.TextIteratorStreamer object at 0x000001C31D15D410>, 'num_beams': 1, 'do_sample': True, 'max_new_tokens': 1024, 'stopping_criteria': [<llm_biz.CustomStopCriteria object at 0x000001C31AB74E50>]}
Traceback (most recent call last):
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\service\llm_biz.py", line 69, in stream_chat_generate
model.generate(**args)
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\ipex_llm\transformers\lookup.py", line 88, in generate
return original_generate(self,
^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\ipex_llm\transformers\speculative.py", line 109, in generate
return original_generate(self,
^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\ipex_llm\transformers\pipeline_parallel.py", line 241, in generate
return original_generate(self,
^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\transformers\generation\utils.py", line 1575, in generate
result = self._sample(
^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\transformers\generation\utils.py", line 2697, in _sample
outputs = self(
^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\nn\modules\module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\nn\modules\module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\transformers\models\llama\modeling_llama.py", line 1196, in forward
outputs = self.model(
^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\nn\modules\module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\torch\nn\modules\module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\ipex_llm\transformers\models\llama.py", line 155, in llama_model_forward_4_38
return llama_model_forward_4_38_internal(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\byao2\AppData\Local\Programs\AI Playground\resources\env\Lib\site-packages\ipex_llm\transformers\models\llama.py", line 2590, in llama_model_forward_4_38_internal
causal_mask = self._update_causal_mask(attention_mask, inputs_embeds)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: LlamaModel._update_causal_mask() missing 1 required positional argument: 'cache_position'
exception:LlamaModel._update_causal_mask() missing 1 required positional argument: 'cache_position'

Environment (please complete the following information):

OS: Win 11 23H2
GPU: iGPU
CPU: Core Ultra 7 155H
Version: v1.01b-MTL-H

Additional context

Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

qiacheng · 2024-08-26T20:43:06Z

this is a known issue related to transformers version in the packaged installer.

you could upgrade transformers to 4.41.0 and get llama2/llama3 working.

the workaround is

open a command prompt
cd to ai playground install location\resources\service
type in ..\env\python.exe -m pip install transformers==4.41.0
relaunch AI Playground

this will get fixed in the next packaged installer :)

brownplayer · 2024-08-28T15:23:17Z

这是与打包安装程序中的 Transformers 版本相关的已知问题。

您可以将 Transformers 升级到 4.41.0 并让 LLAMA2/LLAMA3 正常工作。

解决方法是

打开命令提示符

CD 到 AI Playground 安装位置\资源\服务

键入..\env\python.exe -m pip install transformers==4.41.0

重新启动 AI Playground

这将在下一个打包的安装程序 :) 中修复

This problem also occurred when I ran mistral-nemo. Is it also caused by the low version of transformers

MYaoBQ added the bug Something isn't working label Aug 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

there is error pop up when switch model in "answer" tab #46

there is error pop up when switch model in "answer" tab #46

MYaoBQ commented Aug 26, 2024

qiacheng commented Aug 26, 2024

brownplayer commented Aug 28, 2024

there is error pop up when switch model in "answer" tab #46

there is error pop up when switch model in "answer" tab #46

Comments

MYaoBQ commented Aug 26, 2024

Describe the bug

To Reproduce

Screenshots

Environment (please complete the following information):

Additional context

qiacheng commented Aug 26, 2024

brownplayer commented Aug 28, 2024