Gytis Repečka<p>Attention server admins! Yesterday I've read <a href="https://mastodon.scot/@simon_brooke/114618257884522043" rel="nofollow noopener noreferrer" target="_blank">a post</a> by <span class="h-card"><a href="https://mastodon.scot/@simon_brooke" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>simon_brooke</span></a></span> how nasty AI scraper bots are attacking his self-hosted <span class="h-card"><a href="https://floss.social/@forgejo" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>forgejo</span></a></span> instance. Soon after I'm seeing unusual, periodic traffic spikes on <a href="https://source.gyt.is/" rel="nofollow noopener noreferrer" target="_blank">mine</a> and again - dominated by OpenAI, but some other freeloaders too:</p><pre><code>20.171.207.41 GPTBot/1.2
85.208.96.211 SemrushBot/7~bl
54.36.148.64 AhrefsBot/7.0
114.119.139.53 PetalBot
</code></pre><p>With <code>GPTBot</code> and <code>SemrushBot</code> attacking hardest :blobcatscared:</p><p>They've been hammering my little server periodically today as well, slowing down my instance dramatically as if I was experiencing malicious DDoS attack :blobcatfearful: Well, in a sense it is one :blobcatnotlikethis:</p><p>Watch out - it seems corporate AI techbros learned to scrape :forgejo: content and starts doing it on a massive scale :blobcatoutage: Remember when <span class="h-card"><a href="https://social.anoxinon.de/@Codeberg" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>Codeberg</span></a></span> was (and repeatedly is) hit?</p><p>For now blocked IP ranges and <code>User-Agent</code> combinations, not sure for how long that will be enough :blobcatumm:</p><p>Please boost for visibility and be prepared!</p><p><a href="https://social.gyt.is/tags/forgejo" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>forgejo</span></a> <a href="https://social.gyt.is/tags/developerlife" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>developerlife</span></a> <a href="https://social.gyt.is/tags/coding" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>coding</span></a> <a href="https://social.gyt.is/tags/attack" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>attack</span></a> <a href="https://social.gyt.is/tags/techbros" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>techbros</span></a> <a href="https://social.gyt.is/tags/aislop" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>aislop</span></a> <a href="https://social.gyt.is/tags/openai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openai</span></a> <a href="https://social.gyt.is/tags/bots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>bots</span></a> <a href="https://social.gyt.is/tags/ddos" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ddos</span></a></p>