Skip to content

Eval bug: <think> tag with DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M.gguf #11325

@gnusupport

Description

@gnusupport

Name and Version

llama-cli --version
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
  Device 0: NVIDIA GeForce GTX 1050 Ti, compute capability 6.1, VMM: yes
version: 159 (80d0d6b)
built with Debian clang version 14.0.6 for x86_64-pc-linux-gnu

I see tag, which I should not see:

<think>
Okay, the user just sent "Hello". I should respond in a friendly and welcoming manner. Maybe say hello back and ask how I can assist them today. Keep it simple and open-ended so they can express what they need help with.
</think>

Hello! How can I assist you today?

with DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M.gguf

Operating systems

Linux

GGML backends

CUDA

Hardware

i5 + GTX 1050 Ti 4 GB

Models

DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M.gguf

Problem description & steps to reproduce

Okay, the user just sent "Hello". I should respond in a friendly and welcoming manner. Maybe say hello back and ask how I can assist them today. Keep it simple and open-ended so they can express what they need help with.

Hello! How can I assist you today?

First Bad Commit

No response

Relevant log output

none

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions