-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Webui/prompt processing progress #18300
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Webui/prompt processing progress #18300
Conversation
|
Just a nits improvement, I think showing percentage + ETA instead of elapsed time can be more useful: |
tools/server/webui/src/lib/components/app/chat/ChatMessages/ChatMessageAssistant.svelte
Outdated
Show resolved
Hide resolved
Address review feedback from ngxson
|
It can still be improved; I don't know if people have prompts that take several minutes, but adding the minutes might be a good idea! (and also we calculate the tokens/s we can display them, but it will bloat, and we already have the final value) |
ngxson
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice feature!
(May need approval from @allozaur too)


Make sure to read the contributing guidelines before submitting a PR
WebUI: Display prompt preprocessing progress
Integrates existing backend 'return_progress' feature into WebUI to show real-time token processing during prompt preprocessing.
What it does
Displays processing progress before generation starts:
Implementation
Testing
Progress updates are sent at batch boundaries. Use smaller batch sizes to see more frequent updates:
Then send a long prompt (500+ tokens) via WebUI to observe progress.
Notes
Tested with -b = 128 on GPU and large prompt
PR-18300.mp4
Close #17079