Combine models for improve reasoning | Voters

Combine models for improve reasoning

under review

Endre

Here's a tweet about DeepClaude: https://x.com/Saboo_Shubham_/status/1885167873615945893
It combines Claude Sonnet 3.5 with DeepSeek R1 CoT reasoning to outperform OpenAI o1, DeepSeek R1, Claude Sonnet 3.5. (After we set our Anthropic and DeepSeek API keys.)
You could practically do the same, once R1 model arrives to Merlin. And I bet you could cook it over. 😏
https://deepclaude.com/

February 1, 2025

How the heck does that even work? Does it bounce responses between the bots in some kind of coordinated conversation before responding to the user?

Endre

X I guess it uses these models as agents, so yes, it processes one's output with the other. No other idea. :)

alan

I saw this on Aider's website with a benchmark showing that the combination beats both Claude and DeepSeek R1 independently for the dream team of coding

https://aider.chat/2025/01/24/r1-sonnet.html

endu

updated the status to

under review

endu

Thanks a ton for sharing this.. We’ve also been thinking about using multiple models for different purposes, either simultaneously or in steps. This approach looks super interesting, and we’ll definitely check it out..

Endre

endu Though I'm not sure it truly should be done this literally with DeepSeek. Maybe rather o3-mini?

Siddhartha

Endre: Our deepseek is via fireworks AI. So its hosted in the US.

Endre

Siddhartha Ohh, that's cool!

Photo Viewer

Photo Viewer