I was working on a project using Claude 4.0 and felt that the chat was getting too long, so I prompted Claude 4.0 to generate a context document summarising the chat context, but it failed three time because the responses kept getting cut off. In the last two time I tried to get it to generate an output, it used up 25-30% of my monthly fair usage limit. I am willing to share the chat to someone at merlin, but two prompts using up this much fair usage limit is not acceptable at all.
This issue has been occurring frequently since the new agentic flow feature was introduced, and there’s no way to disable this feature. I can share the chat if necessary, but I can't do so here, as it contains information I prefer not to make public.