Improve attention performance for qwen2.5 & deepseek #46

orionpapadakis · 2025-09-01T10:25:10Z

This PR adds a new Tornado kernel for parallel attention and improves grid sizes calculations.

mikepapadim · 2025-09-01T10:28:02Z

src/main/java/org/beehive/gpullama3/tornadovm/Qwen2TornadoVMLayerPlanner.java

        rmsNormWorker.setGlobalWork(config.dim(), 1, 1);  // Set global work size to total dimension
        rmsNormWorker.setLocalWork(32, 1, 1);         // Set local work size to 256 (standard efficient size)

        // Parallel attention worker configuration


put it into a seperate method:
and make 64 constant etc

public static int computeOptimalLocalSize(int headSize) { int optimalLocalSize = Math.min(headSize, 64); if (headSize % optimalLocalSize != 0) { for (int size = optimalLocalSize; size >= 1; size--) { if (headSize % size == 0) { return size; } } } return optimalLocalSize; }

Improve attention performance for qwen2.5 & deepseek

a8aeaf8

orionpapadakis assigned mikepapadim Sep 1, 2025

orionpapadakis added the enhancement New feature or request label Sep 1, 2025

orionpapadakis unassigned mikepapadim Sep 1, 2025

orionpapadakis requested a review from mikepapadim September 1, 2025 10:25

mikepapadim approved these changes Sep 4, 2025

View reviewed changes

mikepapadim merged commit efbe261 into beehive-lab:main Sep 4, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve attention performance for qwen2.5 & deepseek #46

Improve attention performance for qwen2.5 & deepseek #46

Uh oh!

orionpapadakis commented Sep 1, 2025

Uh oh!

mikepapadim Sep 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improve attention performance for qwen2.5 & deepseek #46

Improve attention performance for qwen2.5 & deepseek #46

Uh oh!

Conversation

orionpapadakis commented Sep 1, 2025

Uh oh!

mikepapadim Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants