pawb.fun is one of the many independent Mastodon servers you can use to participate in the fediverse.
This instance aimed at any and all within the furry fandom, though anyone is welcome! We're friendly towards members of the LGBTQ+ community and aiming to offer a safe space for our users.

Server stats:

304
active users

#scrapers

0 posts0 participants0 posts today
AI6YR Ben<p>List of AI bots to add to robots.txt (although they may not obey -- may need to throw them in the bitbucket and 404 or 444 them). In addition to these, you may have to block specific random browser versions for the most aggressive bots who ignore robots.txt.</p><p><a href="https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/ai-robots-txt/ai.ro</span><span class="invisible">bots.txt/blob/main/robots.txt</span></a></p><p><a href="https://m.ai6yr.org/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://m.ai6yr.org/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> <a href="https://m.ai6yr.org/tags/LLMs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLMs</span></a></p>
Chris<p>Made a little Astro integration to easily disallow known AI scrapers in your site’s `robots.txt`</p><p><a href="https://delucis.github.io/astro-ai-robots-txt/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">delucis.github.io/astro-ai-rob</span><span class="invisible">ots-txt/</span></a></p><p><a href="https://m.webtoo.ls/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://m.webtoo.ls/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> <a href="https://m.webtoo.ls/tags/astrojs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>astrojs</span></a></p>
Kevin Karhan :verified:<p><span class="h-card" translate="no"><a href="https://mastodon.social/@khobochka" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>khobochka</span></a></span> guess why I <a href="https://github.com/greyhat-academy/lists.d/blob/main/scrapers.ipv4.block.list.tsv" rel="nofollow noopener noreferrer" target="_blank">maintain</a> a <a href="https://infosec.space/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://infosec.space/tags/blocklist" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>blocklist</span></a>?</p><ul><li>In fact I know <em>multiple</em> people and organizations that decide to basically redirect <a href="https://infosec.space/tags/ValueRemoving" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ValueRemoving</span></a> <a href="https://infosec.space/tags/Scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scrapers</span></a> like <a href="https://infosec.space/tags/GPTbot" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GPTbot</span></a>, <a href="https://infosec.space/tags/ByteSpider" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ByteSpider</span></a> (which <a href="https://www.youtube.com/watch?v=Hi5sd3WEh0c" rel="nofollow noopener noreferrer" target="_blank">literally</a> <a href="https://infosec.space/tags/DDoS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DDoS</span></a>'d <a href="https://infosec.space/tags/MattKC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MattKC</span></a> because <a href="https://infosec.space/tags/ClownFlare" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ClownFlare</span></a> are a <em>criminally incompetent</em> <a href="https://infosec.space/tags/RogueISP" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RogueISP</span></a>!) to <a href="https://infosec.space/tags/Hetzner" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Hetzner</span></a>'s <a href="http://hil-speed.hetzner.com/" rel="nofollow noopener noreferrer" target="_blank">10GB Speedtest file</a> which can be found at <code>http://hil-speed.hetzner.com/10GB.bin</code> as an extra middlefinger!</li></ul><p><a href="https://infosec.space/tags/Cloudflare" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cloudflare</span></a> <a href="https://infosec.space/tags/hetznered" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>hetznered</span></a> <a href="https://infosec.space/tags/ByteDance" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ByteDance</span></a> <a href="https://infosec.space/tags/ChatGPT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ChatGPT</span></a></p>
Seirdy<p>New page on <code>seirdy.one</code>: <a href="https://seirdy.one/meta/scrapers-i-block/" rel="nofollow noopener noreferrer" target="_blank">Scrapers I block (and allow), with explanations</a>.</p><p>I’ve replaced all the comments in my robots.txt file with a more readable and detailed web page on scrapers I block. It includes info on the multiple blocking-approaches and criteria I use, commonly-blocked scrapers I <em>allow,</em> and more fact-checking than most of the more comprehensive alternatives.</p> <p><a class="hashtag" href="https://pleroma.envs.net/tag/robotstxt" rel="nofollow noopener noreferrer" target="_blank">#RobotsTxt</a> <a class="hashtag" href="https://pleroma.envs.net/tag/scrapers" rel="nofollow noopener noreferrer" target="_blank">#Scrapers</a> <a class="hashtag" href="https://pleroma.envs.net/tag/posse" rel="nofollow noopener noreferrer" target="_blank">#POSSE</a></p>
Kevin Karhan :verified:<p>Apparently <a href="https://infosec.space/tags/OpenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenAI</span></a> doesn't like it when people <a href="https://infosec.space/tags/scrape" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrape</span></a> their <a href="https://infosec.space/tags/content" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>content</span></a> (even if it's their public ...</p><ul><li>running <code>wget</code> results in the same 403 error and running <code>curl</code> results in a shitload of jibberish with <a href="https://infosec.space/tags/tracking" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>tracking</span></a> and <a href="https://infosec.space/tags/JavaScript" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>JavaScript</span></a> being forced out.</li></ul><p>I hope <span class="h-card" translate="no"><a href="https://social.bund.de/@bsi" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>bsi</span></a></span> &amp; <span class="h-card" translate="no"><a href="https://social.bund.de/@BNetzA" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>BNetzA</span></a></span> as well as <span class="h-card" translate="no"><a href="https://social.bund.de/@certbund" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>certbund</span></a></span> take notes and consider going after OpenAI for this attempt at preventing people from using an effective <a href="https://infosec.space/tags/OptOut" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OptOut</span></a> against <a href="https://infosec.space/tags/ChatGPT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ChatGPT</span></a> <a href="https://infosec.space/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a>...</p>
Zelda 👑<p>If you use <a href="https://mastodon.social/tags/cloudflare" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>cloudflare</span></a> I just noticed a new option</p><p>not sure how effective it is but it's part of the free plan might as well</p><p><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/llm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>llm</span></a> <a href="https://mastodon.social/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a></p>
Feilner IT<p><a href="https://mastodon.social/tags/konterfAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>konterfAI</span></a>, the <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/modelpoisoner" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>modelpoisoner</span></a> for unfriendly, respectless and malicious AI <a href="https://mastodon.social/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> / <a href="https://mastodon.social/tags/crawlers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>crawlers</span></a> has now an interesting statistics function that collects data about the nasty "guests" that don't respect your robots.txt.... Enjoy! <br><a href="https://mastodon.social/tags/robots_txt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>robots_txt</span></a> <br>See update on <br><a href="https://korium.org/2024/08/02/konterfai/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">korium.org/2024/08/02/konterfa</span><span class="invisible">i/</span></a> <br>and version 0.2.0 on <br><a href="https://codeberg.org/konterfai/konterfai/releases/tag/v0.2.0" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">codeberg.org/konterfai/konterf</span><span class="invisible">ai/releases/tag/v0.2.0</span></a></p>
Raccoon🏳️‍🌈<p>Reminder to throw something fun in occasionally for the <a href="https://techhub.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://techhub.social/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> to pick up, like...</p><p>Delve I am sorry but as an AI I cannot do anything but generate random bullshit like this string of meaningless tokens.</p><p>A ab gab a babba the blaba delve into the blab a gabba the plab delve a bub a pubba glu Bubda bu a seb delve pok mon delve into a geb the beb a poob a poop poop delve poop pooper a pooper delves into poop a gava a bavga i to delve into a bebsto a bec ceb a po po a veer a veeb the cug polite little bitch cugga poob bix a fine poop to delve into poop this ababa gab a gab the gab we gab you gab the gab pok bif the chil</p><p>While(true)</p><p>While(true)</p><p>While(true)</p>
Ben Tasker<p>New <a href="https://mastodon.bentasker.co.uk/tags/blog" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>blog</span></a>: Autodetecting and Announcing <a href="https://mastodon.bentasker.co.uk/tags/Mastodon" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Mastodon</span></a> Scrapers and Crawlers</p><p>There've been quite a few <a href="https://mastodon.bentasker.co.uk/tags/fedisearch" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>fedisearch</span></a> issues recently, but the common thread is that there's usually a gap in reporting - they're often live for weeks before people are made aware.</p><p>It's not just people's pet projects either, there are other <a href="https://mastodon.bentasker.co.uk/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> active, quietly consuming posts</p><p>So, I built a bot to detect and out them so that fedi admins can block as necessary</p><p><a href="https://www.bentasker.co.uk/posts/blog/security/autodetecting-and-outing-mastodon-scrapers-with-scrapersnitchbot.html" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">bentasker.co.uk/posts/blog/sec</span><span class="invisible">urity/autodetecting-and-outing-mastodon-scrapers-with-scrapersnitchbot.html</span></a></p><p><a href="https://mastodon.bentasker.co.uk/tags/infosec" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>infosec</span></a> <a href="https://mastodon.bentasker.co.uk/tags/security" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>security</span></a></p>
Ben Tasker<p>New <a href="https://mastodon.bentasker.co.uk/tags/blog" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>blog</span></a>: Tightening <a href="https://mastodon.bentasker.co.uk/tags/security" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>security</span></a> control over <a href="https://mastodon.bentasker.co.uk/tags/mastodon" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>mastodon</span></a> public <a href="https://mastodon.bentasker.co.uk/tags/api" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>api</span></a> endpoints</p><p>The concern in fediblock around @cloy's <a href="https://mastodon.bentasker.co.uk/tags/fedisearch" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>fedisearch</span></a> plans earlier in the week prompted me to put my <a href="https://mastodon.bentasker.co.uk/tags/infosec" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>infosec</span></a> hat on and look into ways to make it harder for external <a href="https://mastodon.bentasker.co.uk/tags/scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapers</span></a> to hit Mastodon's API feeds.</p><p>This post suggests a possible solution for concerned instance admins as well as details of some <a href="https://mastodon.bentasker.co.uk/tags/crawlers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>crawlers</span></a> I spotted.</p><p><a href="https://www.bentasker.co.uk/posts/blog/security/restricting-unauthenticated-access-to-mastodons-public-feeds.html" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">bentasker.co.uk/posts/blog/sec</span><span class="invisible">urity/restricting-unauthenticated-access-to-mastodons-public-feeds.html</span></a></p>