Train predict #1

pdufour · 2024-11-03T17:01:21Z

No description provided.

…33894) * docs: ko: gpt_neox_japanese.md * Update _toctree.yml * fix: manual edits * Update docs/source/ko/model_doc/gpt_neox_japanese.md Co-authored-by: Sungmin Oh <[email protected]> * Update docs/source/ko/model_doc/gpt_neox_japanese.md Co-authored-by: Sungmin Oh <[email protected]> * Update docs/source/ko/model_doc/gpt_neox_japanese.md Co-authored-by: Sungmin Oh <[email protected]> --------- Co-authored-by: Sungmin Oh <[email protected]>

* fix: toctree edits * feat: nmt-draft * fix: edit Inline TOC

…ingface#33959) * docs: ko: main_classes/quantization.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions --------- Co-authored-by: Ahnjj_DEV <[email protected]>

…gingface#33952) * docs: ko: main_classes/configuration.md * feat: nmt draft

) * docs: ko: model_doc/mamba.md * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions * fix: resolve suggestions --------- Co-authored-by: Ahnjj_DEV <[email protected]>

…ce#33574) * docs: ko: model_doc/autoformer.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions

…face#33587) * docs: ko: model_doc/patchtsmixer.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> * fix: resolve suggestions --------- Co-authored-by: HyeokJun SHIN <[email protected]>

) * docs: ko: model_doc/clip.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> * fix: resolve suggestions * fix: resolve suggestions * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions --------- Co-authored-by: Ahnjj_DEV <[email protected]> Co-authored-by: HyeokJun SHIN <[email protected]>

…e#33612) * docs: ko: model_doc/paligemma.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions * fix: resolve suggestions --------- Co-authored-by: Ahnjj_DEV <[email protected]>

…3635) * docs: ko: model_doc/llama3.md * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Chaewon Song <[email protected]> * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Chaewon Song <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions --------- Co-authored-by: Chaewon Song <[email protected]> Co-authored-by: HyeokJun SHIN <[email protected]> Co-authored-by: Ahnjj_DEV <[email protected]>

…33648) * docs: ko: model_doc/mistral.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> Co-authored-by: Chaewon Song <[email protected]> Co-authored-by: HyeokJun SHIN <[email protected]> * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> --------- Co-authored-by: Ahnjj_DEV <[email protected]> Co-authored-by: Chaewon Song <[email protected]> Co-authored-by: HyeokJun SHIN <[email protected]>

…3885) * docs: ko: model_doc/cohere.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: HyeokJun SHIN <[email protected]> Co-authored-by: SeongWooChoi <[email protected]> * fix: resolve suggestions --------- Co-authored-by: HyeokJun SHIN <[email protected]> Co-authored-by: SeongWooChoi <[email protected]>

* docs: ko: model_doc/dbrx.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: SeongWooChoi <[email protected]> * fix: resolve suggestions * fix: resolve suggestions --------- Co-authored-by: SeongWooChoi <[email protected]>

…ce#33968) * docs: ko: model_doc/deberta-v2.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: Chaewon Song <[email protected]> * fix: resolve suggestions * fix: resolve suggestions --------- Co-authored-by: Chaewon Song <[email protected]>

…33601) * docs: ko: main_classes/onnx.md * feat: nmt draft * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> * fix: resolve suggestions * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: SeongWooChoi <[email protected]> * fix: resolve suggestions Co-authored-by: SeongWooChoi <[email protected]> * fix: resolve suggestions Co-authored-by: Ahnjj_DEV <[email protected]> --------- Co-authored-by: Ahnjj_DEV <[email protected]> Co-authored-by: SeongWooChoi <[email protected]>

…#33813) * docs: ko: tokenization_utils.md * feat: nmt draft * fix: manual edits

* ko: doc: model_doc/swin.md * feat: nmt draft * fix: manual edits * fix: manual edits * fix: manual edits * fix: manual edits * fix: manual edits * Update docs/source/ko/model_doc/swin.md Co-authored-by: Yijun Lee <[email protected]> * resolve conflicts * resolve conflicts - 2 --------- Co-authored-by: Yijun Lee <[email protected]>

* docs: ko: file_utils.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Jiwook Han <[email protected]> --------- Co-authored-by: Jiwook Han <[email protected]>

* docs: ko: openai-gpt.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Jiwook Han <[email protected]> Co-authored-by: Chulhwa (Evan) Han <[email protected]> * fix: resolve suggestions * �fix: resolve suggestions --------- Co-authored-by: Jiwook Han <[email protected]> Co-authored-by: Chulhwa (Evan) Han <[email protected]>

* docs: ko: biogpt.md * feat: nmt draft * fix: manual edits * fix: resolve suggestion Co-authored-by: Chulhwa (Evan) Han <[email protected]> --------- Co-authored-by: Chulhwa (Evan) Han <[email protected]>

* docs: ko: model_doc/blip * feat: nmt darft * Apply suggestions from code review Co-authored-by: Jiwook Han <[email protected]> * Update docs/source/ko/model_doc/blip.md Co-authored-by: Woojun Jung <[email protected]> --------- Co-authored-by: Jiwook Han <[email protected]> Co-authored-by: Woojun Jung <[email protected]>

* nmt draft * fix toctree * minor fix * Apply suggestions from code review * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: boyunJang <[email protected]> Co-authored-by: wony617 <[email protected]> * Apply suggestions from code review * Apply suggestions from code review * Update docs/source/ko/main_classes/output.md * Update docs/source/ko/_toctree.yml Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: boyunJang <[email protected]> Co-authored-by: wony617 <[email protected]> Co-authored-by: Steven Liu <[email protected]>

…face#33804) * docs: ko: image_processing_utils.md * feat: nmt draft * fix: manual edits

…ce#33772) * docs: ko: modular_transformers.md * feat: nmt draft * fix inline TOC * fix: manual edits * fix: resolve suggestions * fix: resolve suggestions Co-authored-by: Jiwook Han <[email protected]> Co-authored-by: Chulhwa (Evan) Han <[email protected]> * fix: resolve suggestions Co-authored-by: Steven Liu <[email protected]> * Update docs/source/ko/_toctree.yml Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Jiwook Han <[email protected]> Co-authored-by: Chulhwa (Evan) Han <[email protected]> Co-authored-by: Steven Liu <[email protected]>

add more support

* add stablelm gguf architecture support * add additional quantization tests * resolve merge conflict, add weight conversion tests for fp16

…e#33950) * Fix Failed tests with mobile bert * Cast to the correct dtype * Code fixup * Fix padding_idx larger that embedding_size * Reduce covariance more. use 1e-7 instead of 1e-5 * Comment fix * Reduce covariance more. use 1e-9 instead of 1e-7 * Copy new config * all but MRA fixed * fix mra * very flaky * skip instead * make fixup --------- Co-authored-by: Joao Gante <[email protected]>

…n` (huggingface#33870)

* fix tests * don't need this * style

Fix PIL dep for tess

* add mamba architecture for gguf * add logic for weights conversion, some fixes and refactoring * add lm_head layers, unit test refactoring * more fixes for tests * remove lm_head creation * remove unused comments

Update training_args.py

* add fast image processor rtdetr * add gpu/cpu test and fix docstring * remove prints * add to doc * nit docstring * avoid iterating over images/annotations several times * change torch typing * Add image processor fast documentation

…Languages.(Changes made) (huggingface#34226) * Update TRANSLATING.md * Apply suggestions from code review Co-authored-by: Steven Liu <[email protected]> * Update TRANSLATING.md --------- Co-authored-by: Steven Liu <[email protected]>

* enable QA bf16 pipeline * add tests

…uggingface#34522) Fix: unpadding img mismatch

* replace total_batched_samples with step while counting grad accum step * remove unused variable * simplify condition for update step * fix format by ruff * simplify update step condition using accelerator.sync_gradients * simplify update condition using do_sync_step * remove print for test --------- Co-authored-by: Zach Mueller <[email protected]>

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

…gingface#34535) it has complex inputs_embeds computation

…e tests (huggingface#34518) * fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests Signed-off-by: Phillip Kuznetsov <[email protected]> * [run_slow] dpt, depth_anything --------- Signed-off-by: Phillip Kuznetsov <[email protected]>

* feat: add benchmarks pg indexes * refactor: remove debug `df -h`

* try * try * try * try * try * try * update * update * update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.

* Standardize image-text-to-text-models-output add post_process_image_text_to_text to chameleon and cleanup Fix legacy kwarg behavior and deprecation warning add post_process_image_text_to_text to qwen2_vl and llava_onevision Add post_process_image_text_to_text to idefics3, mllama, pixtral processor * nit var name post_process_image_text_to_text udop * nit fix deprecation warnings * Add image-text-to-text pipeline * add support for image url in chat template for pipeline * Reformat to be fully compatible with chat templates * Add tests chat template * Fix imports and tests * Add pipeline tag * change logic handling of single prompt ans multiple images * add pipeline mapping to models * fix batched inference * fix tests * Add manual batching for preprocessing * Fix outputs with nested images * Add support for all common processing kwargs * Add default padding when multiple text inputs (batch size>1) * nit change version deprecation warning * Add support for text only inference * add chat_template warnings * Add pipeline tests and add copied from post process function * Fix batched pipeline tests * nit * Fix pipeline tests blip2 * remove unnecessary max_new_tokens * revert processing kosmos2 and remove unnecessary max_new_tokens * fix pipeline tests idefics * Force try loading processor if pipeline supports it * revert load_processor change * hardcode loading only processor * remove unnecessary try except * skip imagetexttotext tests for kosmos2 as tiny model causes problems * Make code clearer * Address review comments * remove preprocessing logic from pipeline * fix fuyu * add BC resize fuyu * Move post_process_image_text_to_text to ProcessorMixin * add guard in post_process * fix zero shot object detection pipeline * add support for generator input in pipeline * nit * change default image-text-to-text model to llava onevision * fix owlv2 size dict * Change legacy deprecation warning to only show when True

…34419) * Remove interpolate_pos_encoding * Make fixup * Make interpolate_pos_encoding default to True * Reuse existing interpolation * Add integration test

* update doc * Update docs/source/en/perf_train_cpu.md Co-authored-by: Steven Liu <[email protected]> * delete closing tip --------- Co-authored-by: Steven Liu <[email protected]>

…bic (huggingface#33048) * Add docs/source/ar/multilingual.md to Add_docs_source_ar_multilingual.md * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update docs/source/ar/multilingual.md Co-authored-by: Abdullah Mohammed <[email protected]> * Update _toctree.yml * Update _toctree.yml * Add Translated files to branch for merg * Update _toctree.yml * Update _toctree.yml * Update custom_models.md * Update chat_templating.md * Update docs/source/ar/create_a_model.md Co-authored-by: Steven Liu <[email protected]> * Update create_a_model.md * Update gguf.md * Update gguf.md * Update gguf.md * Update gguf.md --------- Co-authored-by: Abdullah Mohammed <[email protected]> Co-authored-by: Steven Liu <[email protected]>

* set-get embeds * add tests * fix tests * remove * return dict True * fix tests * why did i remove this * enabel torchscript tests

* blip2 tests * instructblips * copies * fix slow tests * fix * uncomment this * clean up after rebase * should be model main input * fix overwritten tests * oops len should be multiple of frame number * style * fix some tests

…emma2 config (huggingface#34540) * fix query_pre_attn_scalar different of num_heads in default config * propagate modular changes * fix copies * fix modular copies * fix copies? * correct copies fix

* rework converter * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * cleaning * cleaning * finalize imports * imports * Update modular_model_converter.py * Better renaming to avoid visiting same file multiple times * start converting files * style * address most comments * style * remove unused stuff in get_needed_imports * style * move class dependency functions outside class * Move main functions outside class * style * Update modular_model_converter.py * rename func * add augmented dependencies * Update modular_model_converter.py * Add types_to_file_type + tweak annotation handling * Allow assignment dependency mapping + fix regex * style + update modular examples * fix modular_roberta example (wrong redefinition of __init__) * slightly correct order in which dependencies will appear * style * review comments * Performance + better handling of dependencies when they are imported * style * Add advanced new classes capabilities * style * add forgotten check * Update modeling_llava_next_video.py * Add prority list ordering in check_conversion as well * Update check_modular_conversion.py * Update configuration_gemma.py

* [i18n-HI] Translated accelerate page to Hindi * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <[email protected]> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <[email protected]> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <[email protected]> * Update docs/source/hi/accelerate.md Co-authored-by: K.B.Dharun Krishna <[email protected]> --------- Co-authored-by: Kay <[email protected]> Co-authored-by: K.B.Dharun Krishna <[email protected]>

…predict

* gptqmodel Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * update readme Signed-off-by: jiqing-feng <[email protected]> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * Revert quantizer_gptq.py (#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix warning Signed-off-by: jiqing-feng <[email protected]> * fix version check Signed-off-by: jiqing-feng <[email protected]> * revert unrelated changes Signed-off-by: jiqing-feng <[email protected]> * enable gptqmodel tests Signed-off-by: jiqing-feng <[email protected]> * fix requires gptq Signed-off-by: jiqing-feng <[email protected]> * Fix Transformer compat (#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix format again Signed-off-by: jiqing-feng <[email protected]> * update gptqmodel version (huggingface#6) * update gptqmodel version * update gptqmodel version * fix unit test (huggingface#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (huggingface#7) * fix format and tests Signed-off-by: jiqing-feng <[email protected]> * fix memory check Signed-off-by: jiqing-feng <[email protected]> * fix device mismatch Signed-off-by: jiqing-feng <[email protected]> * fix result check Signed-off-by: jiqing-feng <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * update tests Signed-off-by: jiqing-feng <[email protected]> * review: update docs (huggingface#10) * review: update docs (huggingface#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <[email protected]> * update document (huggingface#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <[email protected]> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: LRL-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: LRL <[email protected]> Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Mohamed Mekkouri <[email protected]> Co-authored-by: Steven Liu <[email protected]>

* Resolve vptq conflict * Rename spqr package to spqr_quant * Get rid of aqlm mention * Start working on tests * Resolve ruff code checks * Ruff format * Isort * Test updates * Add gpu tag * Rename to modules_to_not_convert * Config update * Docs and config update * Docs and config update * Update to update_torch_dtype * spqr config parameter validation * Ruff update * Apply ruff fixes * Test fixes * Ruff update * Mark tests as @slow again; Ruff; Docstring update * Ruff * Remove absolute path * Resolve typo * Remove redundandt log * Check accelerate/spqr availability * Ruff fix * Check if the config contains proper shapes * Ruff test * Documentation update * overview update * Ruff checks * Ruff code quality * Make style * Update docs/source/en/quantization/spqr.md Co-authored-by: Steven Liu <[email protected]> * Update spqr.md * Enable gptqmodel (huggingface#35012) * gptqmodel Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * update readme Signed-off-by: jiqing-feng <[email protected]> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * Revert quantizer_gptq.py (#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix warning Signed-off-by: jiqing-feng <[email protected]> * fix version check Signed-off-by: jiqing-feng <[email protected]> * revert unrelated changes Signed-off-by: jiqing-feng <[email protected]> * enable gptqmodel tests Signed-off-by: jiqing-feng <[email protected]> * fix requires gptq Signed-off-by: jiqing-feng <[email protected]> * Fix Transformer compat (#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix format again Signed-off-by: jiqing-feng <[email protected]> * update gptqmodel version (huggingface#6) * update gptqmodel version * update gptqmodel version * fix unit test (huggingface#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (huggingface#7) * fix format and tests Signed-off-by: jiqing-feng <[email protected]> * fix memory check Signed-off-by: jiqing-feng <[email protected]> * fix device mismatch Signed-off-by: jiqing-feng <[email protected]> * fix result check Signed-off-by: jiqing-feng <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * update tests Signed-off-by: jiqing-feng <[email protected]> * review: update docs (huggingface#10) * review: update docs (huggingface#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <[email protected]> * update document (huggingface#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <[email protected]> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: LRL-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: LRL <[email protected]> Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Mohamed Mekkouri <[email protected]> Co-authored-by: Steven Liu <[email protected]> * Fix : Nemotron Processor in GGUF conversion (huggingface#35708) * fixing nemotron processor * make style * Update docs/source/en/quantization/spqr.md Co-authored-by: Arthur <[email protected]> * Add missing TOC to doc --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: Steven Liu <[email protected]> Co-authored-by: jiqing-feng <[email protected]> Co-authored-by: LRL-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: LRL <[email protected]> Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Mohamed Mekkouri <[email protected]> Co-authored-by: Arthur <[email protected]>

…uggingface#36457) Fixed 2 issues regarding `tests/trainer/test_data_collator.py::TFDataCollatorIntegrationTest::test_all_mask_replacement`: 1. I got the error `RuntimeError: "bernoulli_tensor_cpu_p_" not implemented for 'Long'`. This is because the `mask_replacement_prob=1` and `torch.bernoulli` doesn't accept this type (which would be a `torch.long` dtype instead. I fixed this by manually casting the probability arguments in the `__post_init__` function of `DataCollatorForLanguageModeling`. 2. I also got the error `tensorflow.python.framework.errors_impl.InvalidArgumentError: cannot compute Equal as input #1(zero-based) was expected to be a int64 tensor but is a int32 tensor [Op:Equal]` due to the line `tf.reduce_all((batch["input_ids"] == inputs) | (batch["input_ids"] == tokenizer.mask_token_id))` in `test_data_collator.py`. This occurs because the type of the `inputs` variable is `tf.int32`. Solved this by manually casting it to `tf.int64` in the test, as the expected return type of `batch["input_ids"]` is `tf.int64`.

* updated mistral3 model card (#1) * updated mistral3 model card * applying suggestions from code review Co-authored-by: Steven Liu <[email protected]> * made all changes to mistral3.md * adding space between paragraphs in docs/source/en/model_doc/mistral3.md Co-authored-by: Steven Liu <[email protected]> * removing duplicate in mistral3.md --------- Co-authored-by: Steven Liu <[email protected]> * adding 4 backticks to preserve formatting --------- Co-authored-by: Steven Liu <[email protected]>

* Fix EXAONE-4.0 dummy id * Fix exaone4 dummy (#1) * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <[email protected]> --------- Co-authored-by: Yih-Dar <[email protected]> Co-authored-by: ydshieh <[email protected]>

ahnjj and others added 30 commits October 8, 2024 17:08

🌐 [i18n-KO] Translated rag.md to Korean (huggingface#33989)

2fe7778

* fix: toctree edits * feat: nmt-draft * fix: edit Inline TOC

🌐 [i18n-KO] Translated main_classes/configuration.md to Korean (hug…

47da2c5

…gingface#33952) * docs: ko: main_classes/configuration.md * feat: nmt draft

🌐 [i18n-KO] Translated model_doc/autoformer.md to Korean (huggingfa…

bb825dd

…ce#33574) * docs: ko: model_doc/autoformer.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions

🌐 [i18n-KO] Translated tokenization_utils.md to Korean (huggingface…

0d0ec1d

…#33813) * docs: ko: tokenization_utils.md * feat: nmt draft * fix: manual edits

🌐 [i18n-KO] Translated file_utils.md to Korean (huggingface#33803)

c15d01f

* docs: ko: file_utils.md * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Jiwook Han <[email protected]> --------- Co-authored-by: Jiwook Han <[email protected]>

🌐 [i18n-KO] Translated biogpt.md to Korean (huggingface#33773)

5809b43

* docs: ko: biogpt.md * feat: nmt draft * fix: manual edits * fix: resolve suggestion Co-authored-by: Chulhwa (Evan) Han <[email protected]> --------- Co-authored-by: Chulhwa (Evan) Han <[email protected]>

🌐 [i18n-KO] Translated image_processing_utils.md to Korean (hugging…

6151bc4

…face#33804) * docs: ko: image_processing_utils.md * feat: nmt draft * fix: manual edits

[Patch helper] update to not have to checkout main (huggingface#34006)

e783f12

add more support

Add gguf support for StableLM (huggingface#33793)

faa0f63

* add stablelm gguf architecture support * add additional quantization tests * resolve merge conflict, add weight conversion tests for fp16

Generate: remove most decoder-only LLMs `prepare_inputs_for_generatio…

295a90c

…n` (huggingface#33870)

Mllama: fix tests (huggingface#34000)

5ee52ae

* fix tests * don't need this * style

Fix PIL dep for tests (huggingface#34028)

4fb2870

Fix PIL dep for tess

VladOS95-cyber and others added 23 commits October 30, 2024 16:52

Add GGUF for Mamba (huggingface#34200)

5251fe6

* add mamba architecture for gguf * add logic for weights conversion, some fixes and refactoring * add lm_head layers, unit test refactoring * more fixes for tests * remove lm_head creation * remove unused comments

Fix super tiny extra space typo (huggingface#34440)

9f06fb0

Update training_args.py

enable QA bf16 pipeline (huggingface#34483)

f385316

* enable QA bf16 pipeline * add tests

Fix: img size mismatch caused by incorrect unpadding in LLaVA-Next (h…

1b86772

…uggingface#34522) Fix: unpadding img mismatch

avoid calling gc.collect and cuda.empty_cache (huggingface#34514)

ab98f0b

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

Qwen2VL: skip base input_ids-inputs_embeds equivalence check (hug…

4ca004e

…gingface#34535) it has complex inputs_embeds computation

feat: add benchmarks pg indexes (huggingface#34536)

294c170

* feat: add benchmarks pg indexes * refactor: remove debug `df -h`

make test_eager_matches_sdpa_inference less flaky (huggingface#34512)

114dd81

* try * try * try * try * try * try * update * update * update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

Bug Fix for issue huggingface#34294 (huggingface#34295)

c443d8d

Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.

[CLIPSeg] Make interpolate_pos_encoding default to True (huggingface#…

df8640c

…34419) * Remove interpolate_pos_encoding * Make fixup * Make interpolate_pos_encoding default to True * Reuse existing interpolation * Add integration test

update doc (huggingface#34478)

2801d7b

* update doc * Update docs/source/en/perf_train_cpu.md Co-authored-by: Steven Liu <[email protected]> * delete closing tip --------- Co-authored-by: Steven Liu <[email protected]>

Blip: get/set input embeddings correctly (huggingface#34152)

6beb3f1

* set-get embeds * add tests * fix tests * remove * return dict True * fix tests * why did i remove this * enabel torchscript tests

BLIP: enable generation tests (huggingface#34174)

4cc0813

* blip2 tests * instructblips * copies * fix slow tests * fix * uncomment this * clean up after rebase * should be model main input * fix overwritten tests * oops len should be multiple of frame number * style * fix some tests

🔴 🔴 fix query_pre_attn_scalar different of num_heads in default g…

86701f2

…emma2 config (huggingface#34540) * fix query_pre_attn_scalar different of num_heads in default config * propagate modular changes * fix copies * fix modular copies * fix copies? * correct copies fix

Merge remote-tracking branch 'zucchini-nlp/train_predict' into train_…

b37a978

…predict

pdufour closed this Nov 3, 2024

pdufour mentioned this pull request Nov 3, 2024

Trainer: add predict with generate huggingface/transformers#32346

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Train predict #1

Train predict #1

Uh oh!

pdufour commented Nov 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

101 participants

Train predict #1

Train predict #1

Uh oh!

Conversation

pdufour commented Nov 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

101 participants