from the mod log:

The output from the LLM was provided as proof that someone needed to be banned.
I didn’t want to do this but my hand was forced by their dissembling, minimization and bullshit.
from the mod log:

The output from the LLM was provided as proof that someone needed to be banned.
I didn’t want to do this but my hand was forced by their dissembling, minimization and bullshit.
I didn’t catch the previous post and gave it a quick skim now. My thoughts are more to do with how LLM based moderation is viewed by users.
It’s not a new thing, since sentiment analysis based moderation has been around for a long while. Where it becomes a problem is
I also don’t agree with the privacy angle since all content here is public by nature, but I do see value in discussing these other problems since that’s what this community is for?
Also, while Rimu can defederate, letting people discuss it first is better. Best case scenario, the groups find some kind of compromise. Otherwise it lets people weigh in on the platform policies and federation status, instead of having admins make that call on their own
This is a good take, and I appreciate it.
The cold hard fact is that there are a lot of bad actors, even here on Lemmy. Most users are sheltered from it because of good mods and admins. It’s easy to shit on them, but these people sort through the absolute muck. There’s a reason we don’t have racists and gore and CP here, and it’s because they keep all of it at bay away from eyes here. I know this because I firsthand have seen some of the permanently scarring shit, and it was only a small fraction of what mods on world deal with.
Soi don’t blame moderated arching out tools that help them not see it.
However each admin and mod have to decide for themselves and their users what is best for them, and that is purely an admin level decision for their server. The fediverse is very cool that everyone can make their own rules, and I like that nuance.
The thing is, nothing is automated. From what I can see it only flags potential posts for a mod/admin to verify and act on.
It’s not like this is a bot that scans everyone’s posts and automatically bans them, it’s an entirely manual process to download a comment history and AI ctrl-f for stuff.
There has been some deceptive information spread around what this tool seems to actually do. Which is especially concerning after the recent (proven) misinformation about Nazis infiltrating our instances.