[core][optimization] use a pool of numpy ndarray to hold seq data #5877

youkaichao · 2024-06-26T23:53:46Z

similar to #5584

the same benchmark command:

python benchmarks/benchmark_throughput.py --output-len 256 --input 256 --model meta-llama/Llama-2-7b-hf -tp 8

the same machine: 8*H100

before (current main): Throughput: 38.07 requests/s, 19493.23 tokens/s

after (this PR): Throughput: 38.94 requests/s, 19939.65 tokens/s

let's see if it breaks anything. we need to make sure, we only use python list when receiving/sending user's request. elsewhere, we should keep numpy array, where slicing is only a view operation. Never copy the whole sequence.

youkaichao · 2024-06-27T23:59:04Z

close as it is separated into #5882 and #5942

youkaichao added 12 commits June 26, 2024 15:42

seqdata pool

b3eeedd

fix hash_prefix_token_ids

cfc941e

update block to use numpy too

78c1a8d

use ndarray in sequence

6fe6cee

reduce np -> list conversion

abaf03d

reduce np -> list conversion

30f9188

remove function

020a6b2

remove type

20ab3cb

remove logical token block?

a6aff03

restore outputs

060c1f3

fix output

900477c

keep prompt_token_ids

ba4df45

This was referenced Jun 27, 2024

[core][misc] remove logical block #5882

Merged

[core][optimization] use a pool of numpy ndarray to hold seq data #5942

Closed

youkaichao closed this Jun 27, 2024

youkaichao deleted the seq_data_pool branch June 27, 2024 23:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[core][optimization] use a pool of numpy ndarray to hold seq data #5877

[core][optimization] use a pool of numpy ndarray to hold seq data #5877

Uh oh!

youkaichao commented Jun 26, 2024 •

edited

Loading

Uh oh!

youkaichao commented Jun 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

[core][optimization] use a pool of numpy ndarray to hold seq data #5877

[core][optimization] use a pool of numpy ndarray to hold seq data #5877

Uh oh!

Conversation

youkaichao commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

youkaichao commented Jun 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

youkaichao commented Jun 26, 2024 •

edited

Loading