Arguments for a rollback.

perchance·Perchance - Create a Random Text GeneratorbyLucalis

Arguments for a rollback.

After having used the new model for over a month, mostly on AI Story Generator, and investigating on the old and new AI models used, I've reached to a conclussion that, in m opinion, makes sense.

The old model was Llama 2. Llama 2 (and Llama 3) are models feed on books, as in lots of literature. Meta licensed a LOT of them to train the models.

The new model is Deep Seek, or at least it seems to be so. We'll assume it is, but to be fair, it doesn't changes the argument a lot. DS has an issue, it is trained on normal content, say: internet, some books obviously, interations, etc.

Now, what's the issue with this?

Llama is a model that knows WAY better how a story works, having hundreds of them on its dataset and having processed them during its training. DS doesn't, DS is a more generalist model, thought more as an assistant than a story creator.

For the kind of usage done here, essentially either chatting with characters with AI-Character-Chat or writing a story with AI-Story Generator, the improvement in context and general knowledge DS gives is not worth the decrease in narrative quality, and understanding of story writing. That's not mentioning all the hallucinations, total ignoration of context and prompting, and similar the new model has.

Llama 2 is a way better option for the kind of usage we have. Yes, we would be lossing some general knowledge. Yes, it may not be the best AI model out there. But it's all things considered, it's a matter of chosing the best option for our use case.

I understand the dev does all this work alone, and appretiate his effort for it. That's why, as a really active user of this platform and service, I consider the best choice here is to return to the old model.

If you have some argument more for it, please add it in the comments. Thanks everyone for your time.

-Lucalis.

View original on lemmy.world

-3

Comments11

Garth01

lemmy.world

I agree. The new model isn't very good. The main issues I have with it are that it tends to break immersion, saying stuff like "BREAKING_BAD_PATTERNS" or talking about "breaking bad patterns" for no given reason whatsoever and has no context to the roleplay at hand. They also tend to speak like this all the time:

John nods, "I think we've got it under control," he said "but watch your back just in case."

Even if the initial message has asterisk roleplay such as this, characters added to that thread will still talk like the above example, asterisks have to be incorporated a LOT into the initial message (at least from what I observed) in order for the new characters to RP like that.

The new model isn't that bad though, as it is able to portray fictional characters almost perfectly. I don't really use the AI Story Generator much, I only use the AI Character Chat and occasionally the image generator. I'm saying this based on what I've observed from there.

Lucalis reply

lemmy.world

I don't have as much experience with character chats, but the old model was perfectly capable of handling fictional characters. I mostly use it for story creation, and the decrease in narration quality is stagering

Almaumbria reply

lemmy.world

Randomize reply

lemmy.world

Good luck, I hope you're successful! I also like the new model for rp (when it works properly once in a while), it's much smarter and doesn't need me to hold it's hand for every little detail. I immediately notice the difference. old model often doesn't understand chars can't know what happens in places far away unless there is some kind of stable connection. My user isn't constantly with the char 24/7, so that irked me quite a bit. New model knew without prompt. <3

Lucalis reply

lemmy.world

it’s much smarter and doesn’t need me to hold it’s hand for every little detail

Maybe Im the unluckiest guy ever, but on my end, the AI just hallucinates whatever it wants when I do something with a character. I MUST be actively guiding it to obvious things, and it still just completelly ignores it, something never happened before

Randomize reply

lemmy.world

That's why I said when it works once in a while. There are certain hours where I think dev is working on the model constantly (like right now) and yes, then it's dumb af. But like... uh... I don't know, like 10 hours ago or so, it worked perfectly fine for me. I was able to have a real flow of back-and-forth messages for the 10 mins I used it, without much rerolling or needing to prompt real-life mechanics like "char can't see what user does while texting" (from across the city)

Those 10 mins I got more story done than in two hours yesterday. And this wasn't the first time, that's why I think dev might works on it on certain more or less fixed times.

Randomize reply

lemmy.world

Uhhh...damn, I jinxed it. maybe because it's weekend.

Lucalis reply

lemmy.world

Those 10 mins I got more story done than in two hours yesterday. And this wasn’t the first time, that’s why I think dev might works on it on certain more or less fixed times.

The issue is... I use it always, it helps with my Asperger and ADHD

Randomize reply

lemmy.world

Maybe see it like your doc being on vacation. If there's a serious issue, you need to find a substitute, otherwise you need to wait. It might not be perfect, but once the doc comes back, they are (hopefully) better than before.

It probably will take more time, days, weeks, mabey even months, idk much about programming. But complaining right now always seems to me like shouting at a surgeon mid-surgery why there's so much blood.

Lucalis reply

lemmy.world

Those 10 mins I got more story done than in two hours yesterday. And this wasn’t the first time, that’s why I think dev might works on it on certain more or less fixed times.

Following your own analogy, the problem is: this didn't needed any surgery.

Lucalis reply

lemmy.world

More on-topic, I’ve been testing the new model a lot, writing prompts for it from scratch, and the results are amazing: it can consistently understand complex, structured instructions, so one can more reliably make little ‘programs’ with it, not just narrative stuff. But you have to understand that generators using old prompts will more than likely not work out of the box, you have to tinker with them to get the results you want.

Being brutally honest, no. The AI just does whatever it wants. How long are your stories? cos the old model used to handle my 300k word long ones with ease (around 2.1MB size as the downloaded JSON), and the new model can't even understand what point of the story it is on. Like it's consistensy ir horibid, it just becomes idiotic after the 50 paragraphs, sometimes even less.

The whole point of AI-Story-Generator is to be a model capable of creating a long story, and the situation now is: it can't.

Almaumbria reply

lemmy.world

Lucalis reply

lemmy.world

Patience will be more rewarding than a rollback

Really, REALLY doubt it. I'm struggling right now to get it to write about alternate history, but not the entire AH, fucking singular dialogs. It hallucinates that the character is drunk and tired when nothing similar was even mentioned, it ignores already written paragraphs and does whatever it wants. It is not getting better, unless you consider stupidization better.

https://perchance.org/story-ai#data=uup1%3A7c498bf05802fc5b74f5e9eb85becacf.gz

Here's the story as example.

Almaumbria reply

lemmy.world