You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running the examples/offline_inference_neuron.py script with vllm past runs into an assertion error in the NeuronExecutor. This is because scheduler outputs such as blocks_to_swap_in etc are empty lists but the assertion checks for empty dictionaries.