Help Center

Improved

Fixed

t3chat

Triage

Investigating / Needs Repro

Gathering Interest

Planned

In Progress

Completed

Closed

Hey {name|there}! 👋

According to <a target="_blank" rel="noopener noreferrer nofollow" class="underline decoration-from-font underline-offset-2 links-accent:underline-offset-4 links-default:decoration-primary/6 links-default:text-primary-subtle links-default:hover:text-primary links-default:contrast-more:text-primary links-default:contrast-more:hover:text-primary-strong links-accent:decoration-primary-subtle links-accent:hover:decoration-[3px] links-accent:hover:[text-decoration-skip-ink:none] transition-all duration-100 link" href="https://huggingface.co/Qwen/QwQ-32B">Qwen</a>, these are the recommended settings for inference:<ul><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Temperature of 0.6</li><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Top_K of 40 (or 20 to 40)</li><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Min_P of 0.1 (optional, but works well)</li><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Top_P of 0.95</li><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Repetition Penalty of 1.0. (1.0 means disabled in llama.cpp and transformers)</li><li>div]:mt-0 [&amp;:is(h3)>div]:mt-0 [&amp;:is(h4)>div]:mt-0 mx-0">Chat template: <code>&lt;|im_start|&gt;user\nCreate a Flappy Bird game in Python.&lt;|im_end|&gt;\n&lt;|im_start|&gt;assistant\n&lt;think&gt;\n</code></li></ul><a target="_blank" rel="noopener noreferrer nofollow" class="link" href="https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively">https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively</a> This model would be a lot better with default settings versus what I have tested locally. Thank you for the wonderful product

Use the recommended settings for QwQ 32B to avoid endless thinking loops

hidden

T3 Chat

Use the recommended settings for QwQ 32B to avoid endless thinking loops

Subscribe to post

Subscribe to post