File tree
1,523 files changed
+118227
-96749
lines changed- .buildkite
- nightly-benchmarks/scripts
- .github
- benchmarks
- kernels/deepgemm
- cmake
- csrc
- cutlass_extensions
- moe/marlin_moe_wna16
- quantization
- gptq_marlin
- machete
- docs
- deployment/integrations
- design
- features
- mkdocs/hooks
- examples
- others
- requirements
- tests
- basic_correctness
- benchmarks
- compile
- piecewise
- config
- cuda
- detokenizer
- distributed
- engine
- entrypoints
- llm
- offline_mode
- openai
- correctness
- tool_parsers
- pooling
- correctness
- llm
- openai
- evals
- gpt_oss
- gsm8k
- configs
- kernels
- attention
- core
- mamba
- moe
- modular_kernel_tools
- quantization
- kv_transfer
- lora
- model_executor
- model_loader
- fastsafetensors_loader
- runai_model_streamer
- tensorizer_loader
- models
- language
- generation_ppl_test
- generation
- pooling_mteb_test
- pooling
- multimodal
- generation
- vlm_utils
- pooling
- processing
- quantization
- multimodal
- plugins_tests
- plugins
- lora_resolvers
- prithvi_io_processor_plugin/prithvi_io_processor
- vllm_add_dummy_model
- vllm_add_dummy_model
- vllm_add_dummy_platform
- vllm_add_dummy_platform
- quantization
- reasoning
- samplers
- speculative_decoding/speculators
- standalone_tests
- tokenization
- tool_use
- mistral
- tools
- tpu
- lora
- transformers_utils
- utils_
- v1
- attention
- core
- cudagraph
- distributed
- e2e
- engine
- entrypoints
- llm
- openai
- responses
- executor
- generation
- kv_connector
- nixl_integration
- unit
- kv_offload
- logits_processors
- metrics
- sample
- shutdown
- spec_decode
- structured_output
- tpu
- worker
- tracing
- worker
- vllm_test_utils
- vllm_test_utils
- weight_loading
- tools
- pre_commit
- profiler
- nsys_profile_tools
- vllm
- assets
- attention
- backends
- layers
- ops
- utils
- benchmarks
- lib
- compilation
- config
- device_allocator
- distributed
- device_communicators
- eplb
- kv_transfer
- kv_connector
- v1
- p2p
- kv_lookup_buffer
- kv_pipe
- engine
- entrypoints
- cli
- benchmark
- openai
- tool_parsers
- executor
- inputs
- logging_utils
- lora
- layers
- ops
- ipex_ops
- torch_ops
- triton_ops
- xla_ops
- punica_wrapper
- model_executor
- layers
- fla/ops
- fused_moe
- mamba
- ops
- quantization
- compressed_tensors
- schemes
- transform
- schemes
- kernels
- mixed_precision
- scaled_mm
- quark
- schemes
- utils
- rotary_embedding
- model_loader
- models
- warmup
- multimodal
- platforms
- plugins
- io_processors
- lora_resolvers
- profiler
- ray
- reasoning
- transformers_utils
- chat_templates
- configs
- speculators
- processors
- tokenizers
- triton_utils
- usage
- utils
- v1
- attention/backends
- mla
- core
- sched
- engine
- executor
- kv_offload
- backends
- worker
- metrics
- pool
- sample
- logits_processor
- ops
- tpu
- spec_decode
- structured_output
- worker
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,523 files changed
+118227
-96749
lines changedLines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
368 | 368 |
| |
369 | 369 |
| |
370 | 370 |
| |
371 |
| - | |
| 371 | + | |
372 | 372 |
| |
373 | 373 |
| |
374 | 374 |
| |
|
This file was deleted.
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
477 | 477 |
| |
478 | 478 |
| |
479 | 479 |
| |
| 480 | + | |
480 | 481 |
| |
481 | 482 |
| |
482 | 483 |
| |
| |||
834 | 835 |
| |
835 | 836 |
| |
836 | 837 |
| |
837 |
| - | |
| 838 | + | |
838 | 839 |
| |
839 | 840 |
| |
840 | 841 |
| |
841 |
| - | |
| 842 | + | |
842 | 843 |
| |
843 | 844 |
| |
844 | 845 |
| |
| |||
865 | 866 |
| |
866 | 867 |
| |
867 | 868 |
| |
| 869 | + | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
| 877 | + | |
| 878 | + | |
868 | 879 |
| |
869 | 880 |
| |
870 | 881 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
23 | 23 |
| |
24 | 24 |
| |
25 | 25 |
| |
| 26 | + | |
26 | 27 |
| |
27 | 28 |
| |
28 | 29 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
6 | 6 |
| |
7 | 7 |
| |
8 | 8 |
| |
9 |
| - | |
10 |
| - | |
11 |
| - | |
12 |
| - | |
13 |
| - | |
14 |
| - | |
15 |
| - | |
16 | 9 |
| |
17 |
| - | |
| 10 | + | |
18 | 11 |
| |
19 |
| - | |
| 12 | + | |
20 | 13 |
| |
21 | 14 |
| |
22 |
| - | |
23 | 15 |
| |
24 | 16 |
| |
25 | 17 |
| |
26 | 18 |
| |
27 |
| - | |
28 |
| - | |
29 |
| - | |
30 |
| - | |
31 | 19 |
| |
32 | 20 |
| |
33 | 21 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
2 | 2 |
| |
3 | 3 |
| |
4 | 4 |
| |
| 5 | + | |
5 | 6 |
| |
6 | 7 |
| |
7 |
| - | |
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
5 | 5 |
| |
6 | 6 |
| |
7 | 7 |
| |
| 8 | + | |
8 | 9 |
| |
9 | 10 |
| |
10 |
| - | |
11 | 11 |
| |
12 | 12 |
| |
13 | 13 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
37 | 37 |
| |
38 | 38 |
| |
39 | 39 |
| |
40 |
| - | |
41 |
| - | |
42 |
| - | |
43 | 40 |
| |
44 | 41 |
| |
45 | 42 |
| |
46 | 43 |
| |
47 | 44 |
| |
| 45 | + | |
| 46 | + | |
48 | 47 |
| |
49 | 48 |
| |
50 | 49 |
| |
|
0 commit comments