Skip to content

Copy-in embeddings in reduced precision and handle precision conversion during inference#73

Open
mikepapadim wants to merge 3 commits intomainfrom
feat/fp16-emb
Open

Copy-in embeddings in reduced precision and handle precision conversion during inference#73
mikepapadim wants to merge 3 commits intomainfrom
feat/fp16-emb

Commits