We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 674edca commit b48a15cCopy full SHA for b48a15c
docs/source/design/v1/metrics.md
@@ -279,7 +279,7 @@ every 5 seconds with some key metrics:
279
seconds
280
- The number of new tokens generated per second over the past 5
281
282
-- The prefix cache hit rate over the most recent 1 queries
+- The prefix cache hit rate over the most recent 1k kv-cache block queries
283
284
### Metrics Publishing - Prometheus
285
0 commit comments