Noticiably Slow thinking

I noticed that GLM 4.6 thinking was only writing like, 0.5 tps for its thinking. But once the thinking ended, it went back to a reasonable pace.

So, I tested out Deepseek 3.2 exp thinking, and noticed it also dropped to 1 tps while thinking.

I then tested Kimi K2 thinking, and yes, it’s definitely the thinking is slower. as K2 was only writing about 7-10 tps while thinking, but shot up to its normal tps when the thinking ended.

Please authenticate to join the conversation.

Upvoters
Status

Investigating / Needs Repro

Board
🐛

Bug Reports

Subscribe to post

Get notified by email when there are changes.