CNBC shared this from a Google all-hands this month: One tool to try and help with that is Google’s new Perspectives feed that’s designed to show results from humans. But now that many of the protesting subreddits have opened up, the Reddit trick isn’t as nerfed as it used to be.
Yes, but it’s not as easy as appending “lemmy”. Here, the content is spread over a wide number of URLs, and unlike reddit, they don’t rank super high in the results.
But thanks to federation, search engines should eventually pick up most content, even when concentrating on just a few big instances. I’m guessing that eventually, you’ll be able to do a search with site:lemmy.ml and get most results (as with that many users, chances are high any other post has been copied via federation). Until then, you might need a more widespread search, I created a lens for kagi (paid search engine), that searches the biggest instances: kbin.social, lemmy.ml, lemmy.world, beehaw.org, sh.itjust.works, lemm.ee, programming.dev
The result is this:
That does not work with site:lemmy.ml and would instead require site:lemmy.world on google.
Caveat: Duplicate content detection might really throw things off. Normally, if two sites have the same content, but different URLs and no canonical html reference, then one will be detected as spam (or should be), I’m not sure how search engine indexing will be affected by this in the future.
It’s a subscription-based search engine. From what I’ve seen, the results seem pretty good compared to everything else at this point, but IDK if I’d want to pay monthly for search (though honestly I’m seriously considering it at this point).
As someone who’s been a paid user for a few months now, it is totally worth it. It has its moments where I need to try another engine, but the vast majority of the time it’s way better. The ability to rank sites is a game changer, and as a bonus I can block Pinterest and the usual fluff from results and never see them again.
That’s interesting that lemmy doesn’t generate canonicals. I would have thought that the original instance something is posted on would set the canonical, and other instances can point back to that - it really seems like this sort of problem is exactly what canonicals are made for. Does anyone know if there’s a reason for not using them (other than dev time, which is 100% a good reason)?
Yes, but it’s not as easy as appending “lemmy”. Here, the content is spread over a wide number of URLs, and unlike reddit, they don’t rank super high in the results.
But thanks to federation, search engines should eventually pick up most content, even when concentrating on just a few big instances. I’m guessing that eventually, you’ll be able to do a search with
site:lemmy.ml
and get most results (as with that many users, chances are high any other post has been copied via federation). Until then, you might need a more widespread search, I created a lens for kagi (paid search engine), that searches the biggest instances: kbin.social, lemmy.ml, lemmy.world, beehaw.org, sh.itjust.works, lemm.ee, programming.devThe result is this:
That does not work with
site:lemmy.ml
and would instead requiresite:lemmy.world
on google.Caveat: Duplicate content detection might really throw things off. Normally, if two sites have the same content, but different URLs and no
canonical
html reference, then one will be detected as spam (or should be), I’m not sure how search engine indexing will be affected by this in the future.Thank you for this. I also use Kagi so im totally gonna set the lens up.
What’s Kagi?
It’s a subscription-based search engine. From what I’ve seen, the results seem pretty good compared to everything else at this point, but IDK if I’d want to pay monthly for search (though honestly I’m seriously considering it at this point).
As someone who’s been a paid user for a few months now, it is totally worth it. It has its moments where I need to try another engine, but the vast majority of the time it’s way better. The ability to rank sites is a game changer, and as a bonus I can block Pinterest and the usual fluff from results and never see them again.
I bet it still respects search modifiers? Both DDG and Google frequently ignore mine.
I checked, the
!lemmy
bang is not in use ;)That’s interesting that lemmy doesn’t generate canonicals. I would have thought that the original instance something is posted on would set the canonical, and other instances can point back to that - it really seems like this sort of problem is exactly what canonicals are made for. Does anyone know if there’s a reason for not using them (other than dev time, which is 100% a good reason)?
Oh snap, + 1 for Kagi usage, I’ll try your implementation