Make Merlin AI uncensored, remove all query keyword limits | Voters

Make Merlin AI uncensored, remove all query keyword limits

under review

Is it not possible to remove all merlin ai keywords flagging? I think it's completely unnecessary to be honest, take emochi for example as they didn't implement any limit. Let the model itself withold the restrictions rather than merlin itself. I for one wanted to use grok 3 to it's full capabilities without Merlin flagging certain keywords

April 13, 2025

Vijay Bharadwaj

Hi Heero, I've merged your post to a similar one. Please read the discussion here for answers :)

Vijay Bharadwaj

Merged in a post:

"Flagged" prompts can be annoying. Is it the LLMs or Merlin.AI that does the flagging?

Heero

I ran into issues with flagged queries twice now: Once when trying to summarize the accusations against Prince Andrew of the British Royals for a friend who wasn't aware of it. The other time when I tried to fact check a tweet regarding the sexual assault accusations against an Israeli Minister. I find AI enormously helpful to fact check internet news, and this one seemed particularly nasty. I checked with ChatGPT, and it also flagged the tweet content right away. Is this the LLMs doing the flagging, or Merlin.AI? There is some room for improvement, I guess.

April 15, 2025

Vijay Bharadwaj

marked this post as

under review

Vijay Bharadwaj

Hi, the current restrictions are important for us to continue giving uninterrupted service of all models, because API providers (like OpenAI) impose strict enforcement on vulgar/inappropriate content. While we understand your frustration, loosening these rules would cause a problem. 
Imagine a million users with loosened rules on what they can generate. The number of inappropriate results would be significant, and providers may interrupt our service. 
These restrictions are even stronger on mobile, who impose even stronger enforcement on NSFW content generation.
That being said, I'm putting this under review. Let's see what we can do -- I've told our tech team about this.

Vijay Bharadwaj

That's a possibility we're considering. If it does pass as a suitable solution, all NSFW-marked queries would pass through Grok/open source models. However, the rules of distribution are sticky across mobile app stores, and we'd have to tread carefully. We're considering our options here.
Generally, providers have different rules for their own product (say, chatgpt.com) versus their API offering, where they restrict NSFW prompts, jailbreak attempts etc.
Our app is suitable for ages 3+ right now. OpenAI is 12+. So is Reddit (though you might be aware what goes on at Reddit 👀). We might have to change our ratings if we make these changes. So yeah, all this considered -- we'll see what we can do.

Heero

Vijay Bharadwaj Reddit is awful 😂 I don't know what a solution would be, but it's of course a bit tricky if whole areas of research or creation are limited because some idiots abuse them. The old problem. 🤔
Is there a chance to create different tiers of membership for different age groups? 
How is the flagging done? By keywords?

Heero

i think that's a constant worry with AI creations - that they can be used to create or amplify nasty content from fake news to porn. I see the issue, it's just as always a difficulty of limiting creativity and also (which bothers me the most) research.