Is it possible to increase the Baihuan2-13b default ctx length to 4096? #131

trekrollercoaster · 2023-09-22T09:48:42Z

【BUG: CUDA error invalid configuration argument】When token length more than 2048 use Baichuan2-13B-Chat

When using Baichuan2-13B-Chat cpp model and token length more than 2048, will killed by this error:

CUDA error 9 at /tmp/pip-install-alzfik5u/chatglm-cpp_304776c77dfd4f94a8265b20b0fe43e0/third_party/ggml/src/ggml-cuda.cu:6047: invalid configuration argument

The text was updated successfully, but these errors were encountered:

trekrollercoaster changed the title ~~【BUG: CUDA error invalid configuration argument】When token length more than 1000 use Baichuan2-13B-Chat~~ Is it possible to increase the Baihuan2-13b default ctx length to 4096? Sep 25, 2023

trekrollercoaster mentioned this issue Oct 23, 2023

max_context_length > 2048 (比如langchain 场景下很长的上下文)时报错: ggml_new_tensor_impl: not enough space in the scratch memory pool #136

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is it possible to increase the Baihuan2-13b default ctx length to 4096? #131

Is it possible to increase the Baihuan2-13b default ctx length to 4096? #131

trekrollercoaster commented Sep 22, 2023 •

edited

Loading

Is it possible to increase the Baihuan2-13b default ctx length to 4096? #131

Is it possible to increase the Baihuan2-13b default ctx length to 4096? #131

Comments

trekrollercoaster commented Sep 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

trekrollercoaster commented Sep 22, 2023 •

edited

Loading