-
Notifications
You must be signed in to change notification settings - Fork 1.1k
llama_tokenize: too many tokens (Requested tokens exceed context window of 512) #416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Is this error from langchain or llama-cpp-python? |
@gjmulder as far as i can tell from llama-cpp-python. Langchain is just an orchestration layer. And the vector store (chromadb) works just fine---it reports the number of hits before llama gives this error. |
Updated with failure logs. Note: i'm getting a |
Why not increase |
Did not know this parameter was for token size :-) Changing this to Thanks @gjmulder |
Note that llama models support a maximum context size of 2048. |
Delete this for now to avoid confusion since it contains some wrong checksums from the old tokenizer format Re-add after abetlen#374 is resolved
* Revert "Delete SHA256SUMS for now (abetlen#416)" This reverts commit 8eea5ae. * Remove ggml files until they can be verified * Remove alpaca json * Add also model/tokenizer.model to SHA256SUMS + update README --------- Co-authored-by: Pavol Rusnak <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.
Prerequisites
Please answer the following questions for yourself before submitting an issue.
Expected Behavior
ggml-vic13b-q4_0.bin
RecursiveCharacterTextSplitter
.stuff
'ed chained queries... specially when there are only a few matching chunks.See the notebook below.
Current Behavior
Environment and Context
Please provide detailed information about your computer setup. This is important in case the issue is not reproducible except for under certain specific conditions.
mac m2 pro
Darwin UAVALOS-M-NR30 22.5.0 Darwin Kernel Version 22.5.0: Mon Apr 24 20:53:19 PDT 2023; root:xnu-8796.121.2~5/RELEASE_ARM64_T6020 arm64
Failure Information (for bugs)
Please help provide information about the failure if this is a bug. If it is not a bug, please remove the rest of this template.
Steps to Reproduce
Here is the jupyter notebook:
Error Logs
The text was updated successfully, but these errors were encountered: