I was playing around with Lemmy statistics the other day, and I decided to take the number of comments per post. Essentially a measure of engagement – the higher the number the more engaging the post is. Or in other words how many people were pissed off enough to comment, or had something they felt like sharing. The average for every single Lemmy instance was 8.208262964 comments per post.
So I modeled that with a Poisson distribution, in stats terms X~Po(8.20826), then found the critical regions assuming that anything that had a less than 5% chance of happening, is important. In other words 5% is the significance level. The critical regions are the region either side of the distribution where the probability of ending up in those regions is less than 5%. These critical regions on the lower tail are, 4 comments and on the upper tail is 13 comments, what this means is that if you get less than 4 comments or more than 13 comments, that’s a meaningful value. So I chose to interpret those results as meaning that if you get 5 or less comments than your post is “a bad post”, or if you get 13 or more than your post is “a good post”. A good post here is litterally just “got a lot of comments than expected of a typical post”, vice versa for “a bad post”.
You will notice that this is quite rudimentary, like what about when the Americans are asleep, most posts do worse then. That’s not accounted for here, because it increases the complexity beyond what I can really handle in a post.
To give you an idea of a more sweeping internet trend, the adage 1% 9% 90%, where 1% do the posting, 9% do the commenting, and 90% are lurkers – assuming each person does an average of 1 thing a day, suggests that c/p should be about 9 for all sites regardless of size.
Now what is more interesting is that comments per post varies by instance, lemmy.world for example has an engagement of 9.5 c/p and lemmy.ml has 4.8 c/p, this means that a “good post” on .ml is a post that gets 9 comments, whilst a “good post” on .world has to get 15 comments. On hexbear.net, you need 20 comments, to be a “good post”. I got the numbers for instance level comments and posts from here
This is a little bit silly, since a “good post”, by this metric, is really just a post that baits lots and lots of engagement, specifically in the form of comments – so if you are reading this you should comment, otherwise you are an awful person. No matter how meaningless the comment.
Anyway I thought that was cool.
EDIT: I’ve cleared up a lot of the wording and tried to make it clearer as to what I am actually doing.
Write a few comments, go to sleep and see for yourself what happens when you wake up. This is how I do it.
I sometimes comment, just for the sake of comment and to support this community. For example now.
Your post is good!
I feel like we shouldn’t do that or stray off on a tangent, like commenting about comments.
Yes, or even continuing such discussions for no apparent reason, but to feel a bit of the normal human interaction… Or to have fun playing some meta together :3
Oof. That would be problematic for sure. I’m glad we’re cooperating to oppose that kind of nonsense.
At this depth, I consider you a friend, however you are :) Thank you for the good time!
I think one needs to include parameters like how soon after the topic was created the comment was made and how deep is it in the comment tree. If you for instance consistently comment on 1 month old topics or reply on comments ten levels deep you will get very few interactions.
I guess those without ego stroking don’t care.
It makes sense that it would be highly dependent on comments because for one, Lemmy’s default filter is activity based so the more activity a new post has, the higher it will rank, until displaced by a newer post. The second part is that if there aren’t any comments there people might be less likely to leave comments and the post is more likely to do poorly as it’ll get bumped down by posts with higher activity. Obviously not everyone uses the activity sort feature, some sort by new, top, or scaled, but since activity is the default most will use that. Especially since it shows posts with the most discussion and activity, the ones most likely to find other people interacting on.
I think the community matters a lot more than the instance. Hexbear has a bunch of coping bubble communities but they keep posting the same low-quality comments, so that’s probably why the threshold of 20 comments is so high. Another example, I make posts to my own blog community !dginovker_blog@lemmy.ml, but there’s no subscribers so there’s never gonna be any comments.
Basically I’m saying you should do this same analysis across a sample of random communities ^^
Similarly a “good post”, one that gets lots of comments, would be any post that gets more than 13 comments.
By my count, this comment will take your post from one with 12 comments to one with 13 comments, therefore I’m conferring on you the title of “good post”. Congratulations!!
However, I’m assuming that you’re including your own comments in the comment tally. If you’re not, then your 2 comments so far to this post don’t count, and you’ll only be at 11, and therefore “not good”.
If you are counting your own comments on your own post, can you juice the numbers by adding lots of comments? In other words, can you make a post good by interacting with the people who are interacting with the post? Like some kind of um… conversation? Sounds like cheating to me.
Any details you could share about how you obtained and processed the data? It seems like there’s a lot of interesting things that could be done with this but I’m not sure where the best place to start would be
I’m a bit confused but I think I liked this post
Hey, just tossing in a comment here, I think this post is a good post!
If average comments/post is lowest on .ml, medium on .world and highest on hexbear, it might correlate those instances with post meaningfulness, or with the innate tendency of their users to comment. Or with both, or some other thing entirely. All I can really say about it is, “Huh, interesting.” Not interesting because it leads to any particular conclusion, but interesting that there’s a pattern.
that could be because it is an AMAZING post – it covered all the points and no one has anything left to say
Finally, I know why.
This does happen with comments sometimes. I go into a post and someone has already eloquently said what I would have said (often better than I would have). So I upvote it and move along
You need a factor for niche communities. A post with 4 comments in a backpacking community with 20 subscribers is way “gooder” than 40 comments in a 5k subscriber news community.
I.E. add a community size factor.
You know, I’m really just waiting for the day voyager supports gif insertion.
Another likely cause: you’re posting to a non-local community and you got hit by federation issues, while your instance thinks the post got created, the target instance doesn’t know about it.
Happened to me a few times.