No idea on feasibility given reliance on external APIs, but it would be convenient if each model understood which messages it did or didn’t send within a thread.
I select GPT-4o as the model, and I ask “Why is the sky blue?”
I wait for, and eventually vet GPT-4o’s response.
Skeptical of GPT-4o’s response, I switch the model to DeepSeek within the same thread, and ask “what do you think about GPT-4o’s answer?”
From what I can tell, DeepSeek doesn’t understand that GPT-4o sent the previous response. Each model assumes ownership of every reply sent in response to user messages within a thread, irregardless of which model actually sent each reply.
This ostensibly has a negative effect on each model’s reasoning capabilities, as they spend time defending/justifying/correcting something they never said.
Please authenticate to join the conversation.
Gathering Interest
Feature Request
Get notified by email when there are changes.
Gathering Interest
Feature Request
Get notified by email when there are changes.