-
Notifications
You must be signed in to change notification settings - Fork 567
Pull requests: google/gemma.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Added flash attention, with both a single-q function, and a register-tiled function.
#698
opened Sep 5, 2025 by
copybara-service
bot
Loading…
Implement the matmul op with Onednn to leverage AMX optimization.
#413
opened Oct 8, 2024 by
copybara-service
bot
Loading…
Add configurables for norm/rope/activation/scale/residual connection.
#287
opened Jul 3, 2024 by
copybara-service
bot
Loading…
ProTip!
Follow long discussions with comments:>50.